A New Robust Regression Method Based on Minimization of Geodesic Distances on a Probabilistic Manifold: Application to Power Laws †
<p>(<b>a</b>) Illustration of the Poincaré half-plane with several half-circle geodesics in the background, together with the geodesic between the points <span class="html-italic">p</span><sub>1</sub> and <span class="html-italic">p</span><sub>2</sub> and between <span class="html-italic">p</span><sub>3</sub> and <span class="html-italic">p</span><sub>4</sub>, defined in the main text. (<b>b</b>) Probability densities corresponding to the points <span class="html-italic">p</span><sub>1</sub>, <span class="html-italic">p</span><sub>2</sub>, <span class="html-italic">p</span><sub>3</sub> and <span class="html-italic">p</span><sub>4</sub> indicated in (a). The densities associated with some intermediate points on the geodesics between <span class="html-italic">p</span><sub>1</sub> and <span class="html-italic">p</span><sub>2</sub> and between <span class="html-italic">p</span><sub>3</sub> and <span class="html-italic">p</span><sub>4</sub> are also drawn. (<b>c</b>) Rendering of one blade of the tractroid, again with the two geodesics superimposed. The parallels of the tractroid are lines of constant standard deviation σ, while the meridians (the tractrices) are lines of constant mean µ. This representation of the normal manifold is periodic in the µ-direction, and a rescaled version (longer period along µ) is shown in (<b>d</b>).</p> ">
<p>A portion of the pseudosphere together with the regression results on synthetic data with an outlier, as described in the main text.</p> ">
<p>(<b>a</b>) Histograms of the relative error in estimating the regression coefficients β<span class="html-italic"><sub>i</sub></span> by means of OLS, MAP and GLS for a linear regression problem with outliers. Horizontal axes represent the error in percent and vertical axes probability, normalized to one. (<b>b</b>) Similar, for TLS and ROB.</p> ">
<p>Histograms of the relative error in estimating the regression coefficients β<span class="html-italic"><sub>i</sub></span> by means of OLS, MAP and GLS for a power-law regression problem after a logarithmic transformation. Horizontal axes represent the error in percent and vertical axes probability, normalized to one. (<b>b</b>) Similar, for TLS and ROB.</p> ">
Abstract
:1. Introduction
2. Geodesic Least Squares Regression
2.1. Distance in Information Geometry
2.2. Geodesics for the Univariate Normal Distribution
2.3. Geodesic Least Squares Methodology
3. The L-H Power Threshold and Database
4. Numerical Simulations
4.1. Effect of Outliers
4.1.1. Single Predictor Variable
4.1.2. Multiple Predictor Variables
4.2. Effect of Logarithmic Transformation
4.2.1. Single Predictor Variable
4.2.2. Multiple Predictor Variables
5. Power Threshold Scaling
5.1. Linear Scaling
5.2. Nonlinear Scaling
5.3. Influence of Error Bars
5.4. Discussion
6. Conclusions
Acknowledgments
- †This paper is an extended version of our paper published in the 34th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering (MaxEnt 2014), 21–26 September 2014, Amboise, France.
Conflicts of Interest
References and Notes
- Doyle, E.J.; Houlberg, W.A.; Kamada, Y.; Mukhovatov, V.; Osborne, T.H.; Polevoi, A.; Bateman, G.; Connor, J.W.; Cordey, J.G.; Fujita, T.; et al. Chapter 2: Plasma confinement and transport. Nucl. Fusion. 2007, 47, S18–S127. [Google Scholar]
- Xiao, X.; White, E.P.; Hooten, M.B.; Durham, S.L. On the use of log-transformations vs. nonlinear regression for analyzing biological power laws. Ecology 2011, 92, 1887–1894. [Google Scholar]
- McDonald, D.; Meakins, A.J.; Svensson, J.; Kirk, A.; Cordey, J.G. ITPA H-mode Threshold Database WG. The impact of statistical models on scalings derived from multi-machine H-mode threshold experiments. Plasma Phys. Control. Fusion. 2006, 48, A439–A447. [Google Scholar]
- Verdoolaege, G. Geodesic least squares regression on information manifolds. AIP Conf. Proc. 2013, 1636, 43–48. [Google Scholar]
- Verdoolaege, G. Geodesic least squares regression for scaling studies in magnetic confinement fusion. AIP Conf. Proc. 2014, 1641, 564–571. [Google Scholar]
- Basu, A.; Shioya, H.; Park, C. Statistical Inference: The Minimum Distance Approach; Chapman & Hall/CRC: Boca Raton, FL, USA, 2011; Volume 120. [Google Scholar]
- McCullagh, P.; Nelder, J. Generalized Linear Models, 2nd ed; Chapman & Hall/CRC: Boca Raton, FL, USA, 1989; Volume 37. [Google Scholar]
- Amari, S.; Nagaoka, H. Methods of Information Geometry; American Mathematical Society: New York, NY, USA, 2000. [Google Scholar]
- We follow standard notational practice from differential geometry with respect to index placement in the following definitions for the metric, Christoffel symbols and geodesic distance. However, in the remainder of the paper we will revert to subscript indices only, in order to avoid other notational problems.
- Oprea, J. Differential Geometry and Its Applications, 2nd ed; The Mathematical Association of America: Washington, DC, USA, 2007. [Google Scholar]
- Verdoolaege, G.; Scheunders, P. On the geometry of multivariate generalized Gaussian models. J. Math. Imaging Vis. 2011, 43, 180–193. [Google Scholar]
- Kass, R.; Vos, P. Geometrical Foundations of Asymptotic Inference; Wiley: New York, NY, USA, 1997. [Google Scholar]
- Verdoolaege, G.; Scheunders, P. Geodesics on the manifold of multivariate generalized Gaussian distributions with an application to multicomponent texture discrimination. Int. J. Comput. Vis. 2011, 95, 265–286. [Google Scholar]
- Kullback, S. Information Theory and Statistics; Dover Publications: New York, NY, USA, 1968. [Google Scholar]
- Atkinson, C.; Mitchell, A. Rao’s distance measure. Indian J. Stat. 1981, 48, 345–365. [Google Scholar]
- Burbea, J.; Rao, C. Entropy differential metric, distance and divergence measures in probability spaces: A unified approach. J. Multivar. Anal. 1982, 12, 575–596. [Google Scholar]
- Nielsen, F.; Nock, R. Visualizing hyperbolic Voronoi diagrams. Proceedings of the 30th Annual Symposium on Computational Geometry (SOCG’14), Kyoto, Japan, 8–1 June 2014; p. 90.
- Beran, R. Minimum Hellinger distance estimates for parametric models. Ann. Stat. 1977, 5, 445–463. [Google Scholar]
- Pak, R. Minimum Hellinger distance estimation in simple regression models; distribution and efficiency. Stat. Probab. Lett. 1996, 26, 263–269. [Google Scholar]
- Rao, C. Differential metrics in probability spaces. In Differential Geometry in Statistical Inference; Institute of Mathematical Statistics: Hayward, CA, USA, 1987. [Google Scholar]
- Gill, P.; Murray, W.; Wright, M. Numerical Linear Algebra and Optimization; Addison Wesley: Boston, MA, USA, 1991; Volume 1. [Google Scholar]
- Casella, G.; Berger, R. Statistical Inference, 2nd ed; Cengage Learning: Hampshire, UK, 2002. [Google Scholar]
- Snipes, J.A.; Greenwald, M.; Ryter, F.; Kardaun, O.J.W.F.; Stober, J.; Valovic, M.; Valovic, S.J.; Sykes, A.; Dnestrovskij, A.; Walsh, M.; et al. Multi-Machine global confinement and H-mode threshold analysis. Proceedings of the 19th. IAEA Fusion Energy Conference, Lyon, France, 14–19 October 2002.
- Martin, Y.R.; Takizuka, T. The ITPA CDBM H-mode Threshold Database Working Group. Power requirements for accessing the H-mode in ITER. J. Phys. Conf. Ser. 2008, 123, 012033. [Google Scholar]
- Ryter, F. The H-Mode Database Working Group. H Mode power threshold database for ITER. Nucl. Fusion. 1996, 36, 1217–1264. [Google Scholar]
- Ryter, F. The H-Mode Threshold Database Group. Progress of the international H-Mode power threshold database activity. Plasma Phys. Control. Fusion. 2002, 44, A415–A421. [Google Scholar]
- ITPA—Threshold database. Available online: http://efdasql.ipp.mpg.de/threshold accessed on 30 June 2015.
- Whereas the most recent update of the database dates from 2008 [24], we used the earlier version from 2002, because it allows a better illustration of the advantages of GLS with respect to other methods. The reason is that the data in the most recent version is significantly better conditioned, in which case even a simple regression technique such as OLS turns out to be able to provide acceptable estimates of the regression parameters. This point is not relevant for the present discussion, as here our aim is to demonstrate the advantages of GLS in cases where the data are not in the best shape.
- Verdoolaege, G.; Karagounis, G.; Tendler, M.; van Oost, G. Pattern recognition in probability spaces for visualization and identification of plasma confinement regimes and confinement time scaling. Plasma Phys. Control. Fusion. 2012, 54, 124006. [Google Scholar]
- Preuss, R.; Dose, V. Errors in all variables. AIP Conf. Proc. 2005, 803, 448–455. [Google Scholar]
- Markovsky, I.; van Huffel, S. Overview of total least-squares methods. Signal Process. 2007, 87, 2283–2302. [Google Scholar]
- Maronna, R.; Martin, D.; Yohai, V. Robust Statistics: Theory and Methods; Wiley: New York, NY, USA, 2006. [Google Scholar]
- MATLAB and Statistics Toolbox Release 2015a; The Mathworks Inc: Natick, MA, USA, 2015.
- We use the notation η for the response variable instead of Pthr because in this experiment η is generated artificially and therefore it is not necessarily related to the actual power threshold in fusion devices.
- Von Toussaint, U.; Frey, M.; Gori, S. Fitting of functions with uncertainties in dependent and independent variables. AIP Conf. Proc. 2009, 1193, 302–310. [Google Scholar]
- OLS is not repeated here because it does not depend on the error bars.
- Pennec, X. Intrinsic statistics on Riemannian manifolds: Basic tools for geometric measurements. J. Math. Imaging Vis. 2006, 25, 127–154. [Google Scholar]
Original | GLS | OLS | MAP | TLS | ROB |
---|---|---|---|---|---|
β = 3.00 | 3.031 ± 0.035 | 3.68 ± 0.29 | 3.83 ± 0.36 | 4.6 ± 1.0 | 2.992 ± 0.041 |
Parameter | Original | GLS | OLS | MAP | TLS | ROB |
---|---|---|---|---|---|---|
β0 | 0.80 | 0.94 ± 0.47 | 2.2 ± 2.3 | 3.0 ± 1.7 | 0.99 ± 0.70 | 2.72 ± 0.77 |
β1 | 1.40 | 1.39 ± 0.11 | 1.19 ± 0.16 | 1.08 ± 0.26 | 1.41 ± 0.14 | 1.17 ± 0.11 |
Method | |||||||
---|---|---|---|---|---|---|---|
OLS | Average | 0.0507 | 0.485 | 0.873 | 0.843 | 38.0 | 53.2 |
CI | ±0.0060 | ±0.073 | ±0.061 | ±0.041 | ±4.4 | ±8.0 | |
MAP | Average | 0.0449 | 0.567 | 0.867 | 0.901 | 45.6 | 67.6 |
CI | ±0.0051 | ±0.078 | ±0.069 | ±0.039 | ±5.0 | ±9.6 | |
GLS | Average | 0.0426 | 0.660 | 0.795 | 0.946 | 48.3 | 76.4 |
CI | ±0.0042 | ±0.069 | ±0.059 | ±0.034 | ±4.7 | ±9.8 |
ASDEX | AUG | CMOD | DIII-D | JET | JFT-2M | JT-60U | PBXM | |
---|---|---|---|---|---|---|---|---|
Average (%) | 41.8 | 23.0 | 22.0 | 15.7 | 24.6 | 15.9 | 22.8 | 27.6 |
CI (%) | ±5.3 | ±1.4 | ±1.1 | ±1.8 | ±2.0 | ±1.2 | ±2.3 | ±2.9 |
Method | |||||||
---|---|---|---|---|---|---|---|
OLS | Average | 0.0274 | 0.773 | 0.96 | 1.038 | 69 | 118 |
CI | ±0.0083 | ±0.090 | ±0.10 | ±0.071 | ±15 | ±32 | |
MAP | Average | 0.0425 | 0.643 | 0.788 | 0.933 | 44.2 | 69.1 |
CI | ±0.0041 | ±0.074 | ±0.079 | ±0.034 | ±3.8 | ±8.2 | |
GLS | Average | 0.0397 | 0.715 | 0.751 | 0.984 | 51.6 | 84.7 |
CI | ±0.0036 | ±0.071 | ±0.081 | ±0.031 | ±4.0 | ±8.8 |
ASDEX | AUG | CMOD | DIII-D | JET | JFT-2M | JT-60U | PBXM | |
---|---|---|---|---|---|---|---|---|
Average (%) | 35.8 | 21.2 | 20.4 | 15.9 | 22.4 | 15.7 | 22.3 | 27.7 |
CI (%) | ±9.1 | ±4.3 | ±3.4 | ±2.4 | ±3.8 | ±2.2 | ±4.6 | ±8.1 |
Method | ||||||
---|---|---|---|---|---|---|
MAP | 0.0436 | 0.581 | 0.828 | 0.900 | 41.0 | 61.3 |
GLS | 0.0393 | 0.725 | 0.742 | 0.990 | 52.1 | 86.2 |
ASDEX | AUG | CMOD | DIII-D | JET | JFT-2M | JT-60U | PBXM |
---|---|---|---|---|---|---|---|
49.5 | 35.9 | 31.7 | 24.9 | 32.9 | 27.6 | 38.9 | 47.7 |
Method | ||||||
---|---|---|---|---|---|---|
MAP | 0.0488 | 0.552 | 0.807 | 0.862 | 35.1 | 51.5 |
GLS | 0.0429 | 0.647 | 0.780 | 0.938 | 45.7 | 71.5 |
ASDEX | AUG | CMOD | DIII-D | JET | JFT-2M | JT-60U | PBXM |
---|---|---|---|---|---|---|---|
49.5 | 35.9 | 31.7 | 24.9 | 32.9 | 27.6 | 38.9 | 47.7 |
© 2015 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Verdoolaege, G. A New Robust Regression Method Based on Minimization of Geodesic Distances on a Probabilistic Manifold: Application to Power Laws. Entropy 2015, 17, 4602-4626. https://doi.org/10.3390/e17074602
Verdoolaege G. A New Robust Regression Method Based on Minimization of Geodesic Distances on a Probabilistic Manifold: Application to Power Laws. Entropy. 2015; 17(7):4602-4626. https://doi.org/10.3390/e17074602
Chicago/Turabian StyleVerdoolaege, Geert. 2015. "A New Robust Regression Method Based on Minimization of Geodesic Distances on a Probabilistic Manifold: Application to Power Laws" Entropy 17, no. 7: 4602-4626. https://doi.org/10.3390/e17074602
APA StyleVerdoolaege, G. (2015). A New Robust Regression Method Based on Minimization of Geodesic Distances on a Probabilistic Manifold: Application to Power Laws. Entropy, 17(7), 4602-4626. https://doi.org/10.3390/e17074602