[go: up one dir, main page]

Skip to main content
Log in

Bagging of density estimators

  • Original paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

In this work we give new density estimators by averaging classical density estimators such as the histogram, the frequency polygon and the kernel density estimators obtained over different bootstrap samples of the original data. Using existent results, we prove the \(L^2\)-consistency of these new estimators and compare them to several similar approaches by simulations. Based on them, we give also a way to construct non-parametric pointwise variability band for the target density.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  • Botev ZI, Grotowski JF, Kroese DP (2010) Kernel density estimation via diffusion. Ann Stat 38(5):2916–2957

    Article  MathSciNet  Google Scholar 

  • Bourel M, Ghattas B (2013) Aggregating density estimators: an empirical study. Open J Stat 3(5):344–355

    Article  Google Scholar 

  • Bourel M, Ghattas B, Fraiman R (2014) Random average shifted histograms. Comput Stat Data Anal 79:149–164

    Article  MathSciNet  Google Scholar 

  • Bowman A, Azzalini A (1997) Applied smoothing techniques for data analysis: the kernel approach with S-Plus illustrations. Oxford statistical science series. OUP Oxford, Oxford

    MATH  Google Scholar 

  • Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140

    MATH  Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Article  Google Scholar 

  • Efron B (1979) Bootstrap methods: another look at the jackknife. Ann Stat 7(1):1–26

    Article  MathSciNet  Google Scholar 

  • Efron B, Tibshirani R (1993) An introduction to the bootstrap. Monographs on statistics and applied probability. Chapman & Hall, Boca Raton

    Book  Google Scholar 

  • Fisher R (1932) Statistical methods for research workers. Biological monographs and manuals. Oliver and Boyd, New York

    Google Scholar 

  • Freund Y, Schapire R (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139

    Article  MathSciNet  Google Scholar 

  • Glodek M, Schels M, Schwenker F (2013) Ensemble gaussian mixture models for probability density estimation. Comput Stat 28(1):127–138

    Article  MathSciNet  Google Scholar 

  • Hall P (1997) The bootstrap and Edgeworth expansion. Springer series in statistics. Springer, New York

    Google Scholar 

  • Marron J, Wand M (1992) Exact mean integrated square error. Ann Stat 20(2):712–736

    Article  Google Scholar 

  • R Core Team (2018) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna

    Google Scholar 

  • Ridgeway G (2002) Looking for lumps: Boosting and bagging for density estimation. Comput Stat Data Anal 38(4):379–392

    Article  MathSciNet  Google Scholar 

  • Rigollet P, Tsybakov AB (2007) Linear and convex aggregation of density estimators. Math Methods Stat 16(3):260–280

    Article  MathSciNet  Google Scholar 

  • Rosset S, Segal E (2002) Boosting density estimation. In: Advances in neural information processing systems (NIPS), pp 641–648

  • Scott D (1985a) Averaged shifted histogram: effective nonparametric density estimators inseveral dimensions. Ann Stat 13(3):1024–1040

    Article  Google Scholar 

  • Scott D (1985b) Frequency polygons: theory and application. J Am Stat Assoc 80(390):348–354

    Article  MathSciNet  Google Scholar 

  • Scott D (2015) Multivariate density estimation: theory, practice, and visualization. Wiley series in probability and statistics. Wiley, Hoboken

    Book  Google Scholar 

  • Scott DW (1979) On optimal and data-based histograms. Biometrika 66:605–610

    Article  MathSciNet  Google Scholar 

  • Smyth P, Wolpert D (1999) Linearly combining density estimators via stacking. Mach Learn 36(1–2):59–83

    Article  Google Scholar 

  • Song X, Yang K, Pavel M (2004) Density boosting for gaussian mixtures. Neural Inf Process 3316:508–515

    Article  Google Scholar 

  • Wasserman L (2006) All of nonparametric statistics. Springer texts in statistics. Springer, New York

    MATH  Google Scholar 

  • Wolpert D (1992) Stacked generalization. Neural Netw 5:241–259

    Article  Google Scholar 

Download references

Acknowledgements

We would like to thank project ECOS-2014 Aprendizaje Automático para la Modelización y el Análisis de Recursos Naturales, no. U14E02, the LIA-IFUM and the ANII-Uruguay for their financial support.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mathias Bourel.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Additional results

Appendix: Additional results

1.1 Quality of the estimation

For sake of completeness we present in this appendix the individual values of Fig. 3. In the following tables (Tables 3, 4, 5, 6, 7), values are \(100\times \)MISE obtained as mean average over 100 replicates. At each line, best results are shown in bold.

Table 3 MISE for sample size \(n=50\)
Table 4 MISE for sample size \(n=100\)
Table 5 MISE for sample size \(n=200\)
Table 6 MISE for sample size \(n=500\)
Table 7 MISE for sample size \(n=1000\)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bourel, M., Cugliari, J. Bagging of density estimators. Comput Stat 34, 1849–1869 (2019). https://doi.org/10.1007/s00180-019-00889-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-019-00889-9

Keywords

Navigation