Penalty Functions for Genetic Programming Algorithms

José L. Montaña²¹,
César L. Alonso²²,
Cruz Enrique Borges²¹ &
…
Javier de la Dehesa²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6782))

Included in the following conference series:

International Conference on Computational Science and Its Applications

2241 Accesses

Abstract

Very often symbolic regression, as addressed in Genetic Programming (GP), is equivalent to approximate interpolation. This means that, in general, GP algorithms try to fit the sample as better as possible but no notion of generalization error is considered. As a consequence, overfitting, code-bloat and noisy data are problems which are not satisfactorily solved under this approach. Motivated by this situation we review the problem of Symbolic Regression under the perspective of Machine Learning, a well founded mathematical toolbox for predictive learning. We perform empirical comparisons between classical statistical methods (AIC and BIC) and methods based on Vapnik-Chrevonenkis (VC) theory for regression problems under genetic training. Empirical comparisons of the different methods suggest practical advantages of VC-based model selection. We conclude that VC theory provides methodological framework for complexity control in Genetic Programming even when its technical results seems not be directly applicable. As main practical advantage, precise penalty functions founded on the notion of generalization error are proposed for evolving GP-trees.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Multi-objective Genetic Programming with the Adaptive Weighted Splines Representation for Symbolic Regression

Evolving Simple Symbolic Regression Models by Multi-Objective Genetic Programming

Choosing function sets with better generalisation performance for symbolic regression models

Article 12 May 2020

References

Akaike, H.: Statistical prediction information. Ann. Inst. Statistic. Math. 22, 203–217 (1970)
Article MATH Google Scholar
Amil, N.M., Bredeche, N., Gagné, C., Gelly, S., Schoenauer, M., Teytaud, O.: A statistical learning perspective of genetic programming. In: Vanneschi, L., Gustafson, S., Moraglio, A., De Falco, I., Ebner, M. (eds.) EuroGP 2009. LNCS, vol. 5481, pp. 327–338. Springer, Heidelberg (2009)
Chapter Google Scholar
Bernardo, J., Smith, A.: Bayesian theory. John Wiley & Sons, Chichester (1994)
Book MATH Google Scholar
Cherkassky, V., Yunkian, M.: Comparison of Model Selection for Regression. Neural Computation 15, 1691–1714 (2003)
Article MATH Google Scholar
Koza, J.: Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge (1992)
MATH Google Scholar
Montaña, J.L.: Vcd bounds for some gp genotypes. In: ECAI, pp. 167–171 (2008)
Google Scholar
Montaña, J.L., Alonso, C.L., Borges, C.E., Crespo, J.L.: Adaptation, performance and vapnik-chervonenkis dimension of straight line programs. In: Vanneschi, L., Gustafson, S., Moraglio, A., De Falco, I., Ebner, M. (eds.) EuroGP 2009. LNCS, vol. 5481, pp. 315–326. Springer, Heidelberg (2009)
Chapter Google Scholar
Shao, X., Cherkassky, V., Li, W.: Measuring the VC-dimension using optimized experimental design. Neural Computation 12, 1969–1986 (2000)
Article Google Scholar
Teytaud, O., Gelly, S., Bredeche, N., Schoenauer, M.: Statistical Learning Theory Approach of Bloat. In: Proceedings of the 2005 conference on Genetic and Evolutionary Computation, pp. 1784–1785 (2005)
Google Scholar
Vapnik, V.: Statistical learning theory. John Wiley & Sons, Chichester (1998)
MATH Google Scholar
Vapnik, V., Chervonenkis, A.: On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probability and its Applications 16, 264–280 (1971)
Article MATH Google Scholar
Vapnik, V., Chervonenkis, A.: Ordered risk minimization. Automation and Remote Control 34, 1226–1235 (1974)
MathSciNet MATH Google Scholar
Vapnik, V., Levin, E., Cun, Y.: Measuring the VC-dimension of a learning machine. Neural Computation 6, 851–876 (1994)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Matemáticas, Estadística y Computación, Universidad de Cantabria, 39005, Santander, Spain
José L. Montaña, Cruz Enrique Borges & Javier de la Dehesa
Centro de Inteligencia Artificial, Universidad de Oviedo, Campus de Viesques, 33271, Gijón, Spain
César L. Alonso

Authors

José L. Montaña
View author publications
You can also search for this author in PubMed Google Scholar
César L. Alonso
View author publications
You can also search for this author in PubMed Google Scholar
Cruz Enrique Borges
View author publications
You can also search for this author in PubMed Google Scholar
Javier de la Dehesa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

L.I.S.U.T. - D.A.P.I.T., Basilicata University Potenza, 10, Viale dell’Ateneo Lucano, 85100, Potenza, Italy
Beniamino Murgante
Department of Mathematics and Computer Science, University of Perugia, Via Vanvitelli, 1, 06123, Perugia, Italy
Osvaldo Gervasi
Department of Applied Mathematics and Computational Sciences, University of Cantabria, Avda. de los Castros, s/n, C.P. 39005, Santander, Spain
Andrés Iglesias
School of Business Systems, Monash University, 3800, Clayton, VIC, Australia
David Taniar
Department of Intelligent Informatics, Kyushu Sangyo University, 2-3-1 Matsukadai, Higashi-ku, 813-8503, Fukuoka, Japan
Bernady O. Apduhan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Montaña, J.L., Alonso, C.L., Borges, C.E., de la Dehesa, J. (2011). Penalty Functions for Genetic Programming Algorithms. In: Murgante, B., Gervasi, O., Iglesias, A., Taniar, D., Apduhan, B.O. (eds) Computational Science and Its Applications - ICCSA 2011. ICCSA 2011. Lecture Notes in Computer Science, vol 6782. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21928-3_40

Download citation

DOI: https://doi.org/10.1007/978-3-642-21928-3_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21927-6
Online ISBN: 978-3-642-21928-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Penalty Functions for Genetic Programming Algorithms

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Multi-objective Genetic Programming with the Adaptive Weighted Splines Representation for Symbolic Regression

Evolving Simple Symbolic Regression Models by Multi-Objective Genetic Programming

Choosing function sets with better generalisation performance for symbolic regression models

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Penalty Functions for Genetic Programming Algorithms

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Multi-objective Genetic Programming with the Adaptive Weighted Splines Representation for Symbolic Regression

Evolving Simple Symbolic Regression Models by Multi-Objective Genetic Programming

Choosing function sets with better generalisation performance for symbolic regression models

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation