Neural Network Architecture Selection: Can Function Complexity Help?

Iván Gómez¹,
Leonardo Franco¹ &
José M. Jerez¹

328 Accesses
35 Citations
Explore all metrics

Abstract

This work analyzes the problem of selecting an adequate neural network architecture for a given function, comparing existing approaches and introducing a new one based on the use of the complexity of the function under analysis. Numerical simulations using a large set of Boolean functions are carried out and a comparative analysis of the results is done according to the architectures that the different techniques suggest and based on the generalization ability obtained in each case. The results show that a procedure that utilizes the complexity of the function can help to achieve almost optimal results despite the fact that some variability exists for the generalization ability of similar complexity classes of functions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

References

Arai M (1993) Bounds on the number of hidden units in binary-valued three-layer neural networks. Neural Netw 6(6): 855–860
Article Google Scholar
Barron AR (1993) Universal approximation bounds for superpositions of a sigmoidal function. IEEE Trans Inform Theory 39(3): 930–945
Article MATH MathSciNet Google Scholar
Barron AR (1994) Approximation and estimation bounds for artificial neural networks. Mach Learn 14(1): 115–133
MATH Google Scholar
Baum EB, Haussler D (1990) What size net gives valid generalization?. Neural Comput 1(1): 151–160
Article Google Scholar
Blumer A, Ehrenfeucht A, Haussler D, Warmuth MK (1989) Learnability and the Vapnik-Chervonenkis dimension. J ACM 36(4): 929–965
Article MATH MathSciNet Google Scholar
Camargo LS, Yoneyama T (2001) Specification of training sets and the number of hidden neurons for multilayer perceptrons. Neural Comput 13(12): 2673–2680
Article MATH Google Scholar
Demuth H , Beale M (1994) MATLAB neural networks toolbox—user’s guide version 4. The Math Works, USA
Google Scholar
Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7: 1–30
MathSciNet Google Scholar
Franco L (2006) Generalization ability of Boolean functions implemented in feedforward neural networks. Neurocomputing 70: 351–361
Article Google Scholar
Franco L, Anthony M (2004) On a generalization complexity measure for Boolean functions. In: Proceedings of the 2004 IEEE international joint conference on neural networks. pp 973–978
Franco L, Anthony M (2006) The influence of oppositely classified examples on the generalization complexity of Boolean functions. IEEE Trans Neural Netw 17(3): 578–590
Article Google Scholar
Franco L, Cannas SA (2000) Generalization and selection of examples in feedforward neural networks. Neural Comput 12(10): 2405–2426
Article Google Scholar
Franco L, Cannas SA (2001) Generalization properties of modular networks, implementing the parity function. IEEE Trans Neural Netw 12: 1306–1313
Article Google Scholar
Frean M (1990) The upstart algorithm, a method for constructing and training feedforward neural networks. Neural Comput 2(2): 198–209
Article Google Scholar
Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32: 675–701
Article Google Scholar
Hájek J, Šidák Z, Sen PK (1999) Theory of rank tests, 2nd ed. Academic Press, Orlando
MATH Google Scholar
Haykin S (1994) Neural networks (a comprehensive foundation). Prentice Hall, USA
MATH Google Scholar
Lawrence S, Giles CL, and Tsoi A (1996) What size neural network gives optimal generalization? Convergence properties of backpropagation. Technical Report UMIACS-TR-96-22 and CS-TR-3617, University of Maryland
LeCun Y, Denker J, Solla S, Howard RE, Jackel LD (1990) Optimal brain damage. In: Touretzky DS (eds) Advances in neural information processing systems II. Morgan Kaufman, San Mateo
Google Scholar
Liang X (2006) Removal of hidden neurons in multilayer perceptrons by orthogonal projection and weight crosswise propagation. Neural Comput Appl 16(1): 57–68
Article Google Scholar
Linial N, Mansour Y, Nisan N (1993) Constant depth circuits, Fourier transform and learnability. J ACM 40: 607–620
Article MATH MathSciNet Google Scholar
Liu Y, Starzyk JA, Zhu Z (2007) Optimizing number of hidden neurons in neural networks. In: AIAP’07 proceedings of the 25th conference on proceedings of the 25th IASTED international multi-conference. Anaheim, CA, USA, pp 121–126, ACTA Press
Masters T (1993) Practical neural network recipes in C++. Academic Press Professional, Inc., San Diego
Google Scholar
Mayoraz E (1991) On the Power of Networks of Majority Functions’. In: IWANN ’91 Proceedings of the international workshop on artificial neural networks. Springer-Verlag, London, UK, pp 78–85
Mezard M, Nadal J-P (1989) Learning in feedforward layered networks (the tiling algorithm. J Phys A Math Gen 22(12): 2191–2203
Article MathSciNet Google Scholar
Mirchandani G, Cao W (1989) On hidden nodes for neural nets. IEEE Trans Circuit Systems 36(5): 661–664
Article MathSciNet Google Scholar
Moller MF (1993) Scaled conjugate gradient algorithm for fast supervised learning. Neural Netw 6(4): 525–533
Article Google Scholar
Neuralware I (2001) The reference guide. http://www.neuralware.com
Piramuthu S, Shaw M, Gentry J (1994) A classification approach using multi-layered neural networks. Decis Support Syst 11: 509–525
Article Google Scholar
Scarselli F, Tsoi AC (1998) Universal approximation using feedforward neural networks a survey of some existing methods, and some new results. Neural Netw 11(1): 15–37
Article Google Scholar
Siu K-Y, Roychowdhury VP (1994) On optimal depth threshold circuits for multiplication and related problems. SIAM J Discret Math 7(2): 284–292
Article MATH MathSciNet Google Scholar
Vázquez EG, Yanez A, Galindo P, Pizarro J (2001) Repeated measures multiple comparison procedures applied to model selection in neural networks. In: IWANN ’01 proceedings of the 6th international work-conference on artificial and natural neural networks. Springer–Verlag, London, UK, pp 88–95
Šmieja FJ (1993) Neural network constructive algorithms (trading generalization for learning efficiency?. Circuits Syst Signal Process 12(2): 331–374
Article MATH Google Scholar
Wang J, Yi Z, Zurada JM, Lu B-L, Yin H (eds) (2006) Advances in neural networks—ISNN 2006, Third international symposium on neural networks, Chengdu, China, May 28–June 1, 2006, Proceedings, Part I, Vol. 3971 of Lecture notes in computer science. Springer
Weigend AS, Rumelhart DE, Huberman BA (1990) Generalization by weight-elimination with application to forecasting. In: NIPS. pp 875–882
Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics 1: 80–83
Article Google Scholar
Witten IH, Frank E (2005) Data mining, practical machine learning tools and techniques. Morgan Kaufmann, San Mateo
MATH Google Scholar
Yuan HC, Xiong FL, Huai XY (2003) A method for estimating the number of hidden neurons in feed-forward neural networks based on information entropy. Comput Electr Agric 40: 57–64
Article Google Scholar
Zhang Z, Ma X, Yang Y (2003) Bounds on the number of hidden neurons in three-layer binary neural networks. Neural Netw 16(7): 995–1002
Article Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Lenguajes y Ciencias de la Computación, Universidad de Málaga, Campus de Teatinos S/N, 29071, Málaga, Spain
Iván Gómez, Leonardo Franco & José M. Jerez

Authors

Iván Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Leonardo Franco
View author publications
You can also search for this author in PubMed Google Scholar
José M. Jerez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leonardo Franco.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gómez, I., Franco, L. & Jerez, J.M. Neural Network Architecture Selection: Can Function Complexity Help?. Neural Process Lett 30, 71–87 (2009). https://doi.org/10.1007/s11063-009-9108-2

Download citation

Received: 02 June 2008
Accepted: 25 May 2009
Published: 15 July 2009
Issue Date: October 2009
DOI: https://doi.org/10.1007/s11063-009-9108-2

Neural Network Architecture Selection: Can Function Complexity Help?

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

The Best Neural Network Architecture

Neural networks with linear threshold activations: structure and algorithms

Design of Binary Neurons with Supervised Learning for Linearly Separable Boolean Operations

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Neural Network Architecture Selection: Can Function Complexity Help?

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

The Best Neural Network Architecture

Neural networks with linear threshold activations: structure and algorithms

Design of Binary Neurons with Supervised Learning for Linearly Separable Boolean Operations

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation