Abstract
Parametric nonlinear mixed effects models (NLMEs) are now widely used in biometrical studies, especially in pharmacokinetics research and HIV dynamics models, due to, among other aspects, the computational advances achieved during the last years. However, this kind of models may not be flexible enough for complex longitudinal data analysis. Semiparametric NLMEs (SNMMs) have been proposed as an extension of NLMEs. These models are a good compromise and retain nice features of both parametric and nonparametric models resulting in more flexible models than standard parametric NLMEs. However, SNMMs are complex models for which estimation still remains a challenge. Previous estimation procedures are based on a combination of log-likelihood approximation methods for parametric estimation and smoothing splines techniques for nonparametric estimation. In this work, we propose new estimation strategies in SNMMs. On the one hand, we use the Stochastic Approximation version of EM algorithm (SAEM) to obtain exact ML and REML estimates of the fixed effects and variance components. On the other hand, we propose a LASSO-type method to estimate the unknown nonlinear function. We derive oracle inequalities for this nonparametric estimator. We combine the two approaches in a general estimation procedure that we illustrate with simulations and through the analysis of a real data set of price evolution in on-line auctions.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Bertin, K., Le Pennec, E., Rivoirard, V.: Adaptive Dantzig density estimation. Ann. Inst. Henri Poincaré 47, 43–74 (2011)
Bickel, P.J., Ritov, Y., Tsybakov, A.B.: Simultaneous analysis of lasso and Dantzig selector. Ann. Stat. 37(4), 1705–1732 (2009)
Bühlmann, P., van de Geer, S.: Statistics for High-Dimensional Data. Springer Series in Statistics. Springer, Heidelberg (2011)
Bunea, F.: Consistent selection via the Lasso for high dimensional approximating regression models. In: Pushing the Limits of Contemporary Statistics: Contributions in Honor of Jayanta K. Ghosh. Inst. Math. Stat. Collect., vol. 3, pp. 122–137. Inst. Math. Statist., Beachwood (2008)
Bunea, F., Tsybakov, A.B., Wegkamp, M.H.: Aggregation and sparsity via l 1 penalized least squares. In: Learning Theory. Lecture Notes in Comput. Sci., vol. 4005, pp. 379–391. Springer, Berlin (2006)
Bunea, F., Tsybakov, A., Wegkamp, M.: Sparsity oracle inequalities for the Lasso. Electron. J. Stat. 1, 169–194 (2007a)
Bunea, F., Tsybakov, A.B., Wegkamp, M.H.: Aggregation for Gaussian regression. Ann. Stat. 35(4), 1674–1697 (2007b)
Comte, F., Samson, A.: Nonparametric estimation of random effects densities in linear mixed-effects model. J. Nonparametr. Stat. 24, 951–975 (2012)
Delyon, B., Lavielle, M., Moulines, E.: Convergence of a stochastic approximation version of the EM algorithm. Ann. Stat. 27, 94–128 (1999)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum-likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. B 39, 1–38 (1977)
Ding, A.A., Wu, H.: Assessing antiviral potency of anti-HIV therapies in vivo by comparing viral decay rates in viral dynamic models. Biostatistics 2, 13–29 (2001)
Foulley, J.L., Quaas, R.: Heterogeneous variances in Gaussian linear mixed models. Genet. Sel. Evol. 27, 211–228 (1995)
Ge, Z., Bickel, P., Rice, J.: An approximate likelihood approach to nonlinear mixed effects models via spline approximation. Comput. Stat. Data Anal. 46, 747–776 (2004)
van de Geer, S.: ℓ 1-regularization in high-dimensional statistical models. In: Proceedings of the International Congress of Mathematicians, vol. IV, pp. 2351–2369. Hindustan Book Agency, New Delhi (2010)
Hartford, A., Davidian, M.: Consequences of misspecifying assumptions in nonlinear mixed effects models. Comput. Stat. Data Anal. 34, 139–164 (2000)
Harville, D.: Bayesian inference for variance components using only error contrasts. Biometrika 61, 383–385 (1974)
Jank, W.: Implementing and diagnosing the stochastic approximation EM algorithm. J. Comput. Graph. Stat. 15(4), 803–829 (2006)
Jank, W., Shmueli, G.: Functional data analysis in electronic commerce research. Stat. Sci. 21, 155–166 (2006)
Ke, C., Wang, Y.: Semiparametric nonlinear mixed-effects models and their applications (with discussion). J. Am. Stat. Assoc. 96(456), 1272–1298 (2001)
Kuhn, E., Lavielle, M.: Coupling a stochastic approximation version of EM with an MCMC procedure. ESAIM Probab. Stat. 8, 115–131 (2004)
Kuhn, E., Lavielle, M.: Maximum likelihood estimation in nonlinear mixed effects models. Comput. Stat. Data Anal. 49(4), 1020–1038 (2005)
Liu, B., Müller, H.G.: Functional data analysis for sparse auction data. In: Jank, W., Shmueli, G. (eds.) Statistical Methods in E-commerce Research, pp. 269–290. Wiley, New York (2008)
Liu, W., Wu, L.: Simultaneous inference for semiparametric nonlinear mixed-effects models with covariate measurement errors and missing responses. Biometrics 63, 342–350 (2007)
Liu, W., Wu, L.: A semiparametric nonlinear mixed-effects model with non-ignorable missing data and measurement errors for HIV viral data. Comput. Stat. Data Anal. 53, 112–122 (2008)
Liu, W., Wu, L.: Some asymptotic results for semiparametric nonlinear mixed-effects models with incomplete data. J. Stat. Plan. Inference (2009). doi:10.1016j.jspi.2009.06.006
Luan, Y., Li, H.: Model-based methods for identifying periodically expressed genes based on time course microarray gene expression data. Bioinformatics 20(3), 332–339 (2004)
Meza, C., Jaffrézic, F., Foulley, J.L.: Estimation in the probit normal model for binary outcomes using the SAEM algorithm. Biom. J. 49(6), 876–888 (2007)
Meza, C., Jaffrézic, F., Foulley, J.L.: Reml estimation of variance parameters in nonlinear mixed effects models using the SAEM algorithm. Comput. Stat. Data Anal. 53(4), 1350–1360 (2009)
Patterson, H.D., Thompson, R.: Recovery of inter-block information when block sizes are unequal. Biometrika 58, 545–554 (1971)
Pinheiro, J., Bates, D.: Mixed-Effects Models in S and S-PLUS. Springer, New York (2000)
Ramos, R., Pantula, S.: Estimation of nonlinear random coefficient models. Stat. Probab. Lett. 24, 49–56 (1995)
Reithinger, F., Jank, W., Tutz, G., Shmueli, G.: Modelling price paths in on-line auctions: smoothing sparse and unevenly sampled curves by using semiparametric mixed models. Appl. Stat. 57, 127–148 (2008)
Schelldorfer, J., Bühlmann, P., van de Geer, S.: Estimation for high-dimensional linear mixed-effects models using l1-penalization. Scand. J. Stat. 38, 197–214 (2011)
Shmueli, G., Jank, W.: Visualizing online auctions. J. Comput. Graph. Stat. 14, 299–319 (2005)
Shmueli, G., Russo, R.P., Jank, W.: The BARISTA: a model for bid arrivals in online auctions. Ann. Appl. Stat. 1, 412–441 (2007)
Sklar, J.C., Wu, J., Meiring, W., Wang, Y.: Non-parametric regression with basis selection from multiple libraries. Technometrics (2012, accepted)
Tibshirani, R.: Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. B 58, 267–288 (1996)
Vonesh, E.F.: A note on the use of Laplace’s approximation for nonlinear mixed-effects models. Biometrika 83, 447–452 (1996)
Wang, Y., Brown, M.B.: A flexible model for human circadian rhythms. Biometrics 52, 588–596 (1996)
Wang, Y., Ke, C.: Assist: A suite of s functions implementing spline smoothing techniques (2004). http://wwwpstatucsbedu/faculty/yuedong/assistpdf
Wang, Y., Ke, C., Brown, M.B.: Shape-invariant modeling of circadian rhythms with random effects and smoothing spline ANOVA decompositions. Biometrics 59, 804–812 (2003)
Wang, Y., Eskridge, K., Zhang, S.: Semiparametric mixed-effects analysis of PKPD models using differential equations. J. Pharmacokinet. Pharmacodyn. 35, 443–463 (2008)
Wei, G.C., Tanner, M.A.: A Monte Carlo implementation of the EM algorithm and the poor man’s data augmentation algorithm. J. Am. Stat. Assoc. 85, 699–704 (1990)
Wu, H., Zhang, J.: The study of longterm HIV dynamics using semi-parametric non-linear mixed-effects models. Stat. Med. 21, 3655–3675 (2002)
Yuan, M., Lin, Y.: Model selection and estimation in regression with grouped variables. J. R. Stat. Soc. B 68(1), 49–67 (2006)
Acknowledgements
The authors would like to thank the anonymous Associate Editor and two referees for valuable comments and suggestions.
The research of Ana Arribas-Gil is supported by projects MTM2010-17323 and ECO2011-25706, Spain.
The research of Karine Bertin is supported by projects FONDECYT 1090285 and ECOS/CONICYT C10E03 2010, Chile.
The research of Cristian Meza is supported by project FONDECYT 11090024, Chile.
The research of Vincent Rivoirard is partly supported by the french Agence Nationale de la Recherche (ANR 2011 BS01 010 01 projet Calibration).
Author information
Authors and Affiliations
Corresponding author
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Arribas-Gil, A., Bertin, K., Meza, C. et al. LASSO-type estimators for semiparametric nonlinear mixed-effects models estimation. Stat Comput 24, 443–460 (2014). https://doi.org/10.1007/s11222-013-9380-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11222-013-9380-x