Abstract
In this work we give new density estimators by averaging classical density estimators such as the histogram, the frequency polygon and the kernel density estimators obtained over different bootstrap samples of the original data. Using existent results, we prove the \(L^2\)-consistency of these new estimators and compare them to several similar approaches by simulations. Based on them, we give also a way to construct non-parametric pointwise variability band for the target density.
Similar content being viewed by others
References
Botev ZI, Grotowski JF, Kroese DP (2010) Kernel density estimation via diffusion. Ann Stat 38(5):2916–2957
Bourel M, Ghattas B (2013) Aggregating density estimators: an empirical study. Open J Stat 3(5):344–355
Bourel M, Ghattas B, Fraiman R (2014) Random average shifted histograms. Comput Stat Data Anal 79:149–164
Bowman A, Azzalini A (1997) Applied smoothing techniques for data analysis: the kernel approach with S-Plus illustrations. Oxford statistical science series. OUP Oxford, Oxford
Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Efron B (1979) Bootstrap methods: another look at the jackknife. Ann Stat 7(1):1–26
Efron B, Tibshirani R (1993) An introduction to the bootstrap. Monographs on statistics and applied probability. Chapman & Hall, Boca Raton
Fisher R (1932) Statistical methods for research workers. Biological monographs and manuals. Oliver and Boyd, New York
Freund Y, Schapire R (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139
Glodek M, Schels M, Schwenker F (2013) Ensemble gaussian mixture models for probability density estimation. Comput Stat 28(1):127–138
Hall P (1997) The bootstrap and Edgeworth expansion. Springer series in statistics. Springer, New York
Marron J, Wand M (1992) Exact mean integrated square error. Ann Stat 20(2):712–736
R Core Team (2018) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Ridgeway G (2002) Looking for lumps: Boosting and bagging for density estimation. Comput Stat Data Anal 38(4):379–392
Rigollet P, Tsybakov AB (2007) Linear and convex aggregation of density estimators. Math Methods Stat 16(3):260–280
Rosset S, Segal E (2002) Boosting density estimation. In: Advances in neural information processing systems (NIPS), pp 641–648
Scott D (1985a) Averaged shifted histogram: effective nonparametric density estimators inseveral dimensions. Ann Stat 13(3):1024–1040
Scott D (1985b) Frequency polygons: theory and application. J Am Stat Assoc 80(390):348–354
Scott D (2015) Multivariate density estimation: theory, practice, and visualization. Wiley series in probability and statistics. Wiley, Hoboken
Scott DW (1979) On optimal and data-based histograms. Biometrika 66:605–610
Smyth P, Wolpert D (1999) Linearly combining density estimators via stacking. Mach Learn 36(1–2):59–83
Song X, Yang K, Pavel M (2004) Density boosting for gaussian mixtures. Neural Inf Process 3316:508–515
Wasserman L (2006) All of nonparametric statistics. Springer texts in statistics. Springer, New York
Wolpert D (1992) Stacked generalization. Neural Netw 5:241–259
Acknowledgements
We would like to thank project ECOS-2014 Aprendizaje Automático para la Modelización y el Análisis de Recursos Naturales, no. U14E02, the LIA-IFUM and the ANII-Uruguay for their financial support.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix: Additional results
Appendix: Additional results
1.1 Quality of the estimation
For sake of completeness we present in this appendix the individual values of Fig. 3. In the following tables (Tables 3, 4, 5, 6, 7), values are \(100\times \)MISE obtained as mean average over 100 replicates. At each line, best results are shown in bold.
Rights and permissions
About this article
Cite this article
Bourel, M., Cugliari, J. Bagging of density estimators. Comput Stat 34, 1849–1869 (2019). https://doi.org/10.1007/s00180-019-00889-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-019-00889-9