Decomposition of heterogeneous classification problems

Chidanand Apte¹,
Se June Hong¹,
Jonathan R. M. Hosking¹,
Jorge Lepre¹,
Edwin P. D. Pednault¹ &
…
Barry K. Rosen¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1280))

Included in the following conference series:

International Symposium on Intelligent Data Analysis

741 Accesses
7 Citations

Abstract

In some classification problems the feature space is heterogeneous in that the best features on which to base the classification are different in different parts of the feature space. In some other problems the classes can be divided into subsets such that distinguishing one subset of classes from another and classifying examples within such subsets require very different decision rules, involving different sets of features. In such heterogeneous problems, many modeling techniques (including decision trees, rules, and neural networks) evaluate the performance of alternative decision rules by averaging over the entire problem space, and axe prone to generating a model that is suboptimal in any of the regions or subproblems. Better overall models can be obtained by splitting the problem appropriately and modeling each subproblem separately.

This paper presents a new measure to determine the degree of dissimilarity between the decision surfaces of two given problems, and suggests a way to search for a strategic splitting of the feature space that identifies regions with different characteristics. We illustrate the concept using a multiplexor problem.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone: Classification and regression trees. Monterey, Calif.: Wadsworth (1984).
MATH Google Scholar
J. R. Quinlan.: C4.5: programs for machine learning. Morgan Kaufmann (1993).
Google Scholar
I. Kononenko, E. Simec, and M. Robnik: Overcoming the myopia of inductive learning algorithms with RELIEFF. Applied Intelligence, 7, 39–55 (1997).
Article Google Scholar
K. Kira and L. Rendell: The feature selection problem: traditional methods and a new algorithm. Proceedings of AAAI-92 (1992), 129–134.
Google Scholar
S. J. Hong: Use of contextual information for feature ranking and discretization. IEEE Trans. Knowl. Data Eng., 9, to appear (1997).
Google Scholar
J. Dougherty, R. Kohavi, and M. Sahami: Supervised and unsupervised discretization of continuous features. Proceedings of ML-95 (1995).
Google Scholar
U. Fayyad and K. Irani: The attribute selection problem in decision tree generation. Proceedings of AAAI-92 (1992), 104–110.
Google Scholar
S. J. Hong, J. R. M. Hosking, and S. Winograd: Use of randomization to normalize feature merits. Information, Statistics and Induction in Science, eds. D. L. Dowe, K. B. Korb and J. J. Oliver. Singapore: World Scientific (1996), 10–19.
Google Scholar

Download references

Author information

Authors and Affiliations

IBM Research Division, T. J. Watson Research Center, P.O. Box 218, 10598, Yorktown Heights, NY, USA
Chidanand Apte, Se June Hong, Jonathan R. M. Hosking, Jorge Lepre, Edwin P. D. Pednault & Barry K. Rosen

Authors

Chidanand Apte
View author publications
You can also search for this author in PubMed Google Scholar
Se June Hong
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan R. M. Hosking
View author publications
You can also search for this author in PubMed Google Scholar
Jorge Lepre
View author publications
You can also search for this author in PubMed Google Scholar
Edwin P. D. Pednault
View author publications
You can also search for this author in PubMed Google Scholar
Barry K. Rosen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Xiaohui Liu Paul Cohen Michael Berthold

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Apte, C., Hong, S.J., Hosking, J.R.M., Lepre, J., Pednault, E.P.D., Rosen, B.K. (1997). Decomposition of heterogeneous classification problems. In: Liu, X., Cohen, P., Berthold, M. (eds) Advances in Intelligent Data Analysis Reasoning about Data. IDA 1997. Lecture Notes in Computer Science, vol 1280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0052826

Download citation

DOI: https://doi.org/10.1007/BFb0052826
Published: 19 May 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63346-4
Online ISBN: 978-3-540-69520-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics