[go: up one dir, main page]

Skip to main content

The Use of k-Means Algorithm to Improve Kernel Method via Instance Selection

  • Conference paper
  • First Online:
Knowledge Science, Engineering and Management (KSEM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9403))

  • 2952 Accesses

Abstract

The kernel method is well known for its success in solving the curse of dimension of linearly inseparable problems. But as an instance- based learning algorithm it suffers from high memory requirement and low efficiency in that it needs to store all of the training instances. And when there are noisy instances classification accuracy can suffer. In this paper we present an approach to alleviate both of the problems mentioned above by using k-means algorithm to select only k representativeness instances of the training data. And we view the selected k instances as the new data set, where the choice of the value of k is influenced by the size and the character of the data set. It turn out that with a carefully selected k we can still get a good performance while the number of the instances stored are greatly decreased.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Wu, X.: Top 10 algorithms in data mining. Knowledge and Information Systems 14(1), 1–37 (2008)

    Article  Google Scholar 

  2. Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press (2004)

    Google Scholar 

  3. Gretton, A., et al.: A kernel method for the two-sample-problem. In: Advances in Neural Information Processing Systems (2006)

    Google Scholar 

  4. Brighton, H., Mellish, C.: Advances in instance selection for instance-based learning algorithms. Data Mining and Knowledge Discovery 6(2), 153–172 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  5. Aha, D.W., Kibler, D., Albert, M.K.: Instance-based learning algorithms. Machine Learning 6(1), 37–66 (1991)

    Article  Google Scholar 

  6. Lange, T., Buhmann, J.M.: Fusion of similarity data in clustering. In: Proceeding of Advances in Neural Information Processing Systems (2005)

    Google Scholar 

  7. Zavrel, J., Daelemans, W.: Memory-based learning: using similarity for smoothing. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lulu Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Wang, L. (2015). The Use of k-Means Algorithm to Improve Kernel Method via Instance Selection. In: Zhang, S., Wirsing, M., Zhang, Z. (eds) Knowledge Science, Engineering and Management. KSEM 2015. Lecture Notes in Computer Science(), vol 9403. Springer, Cham. https://doi.org/10.1007/978-3-319-25159-2_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25159-2_50

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25158-5

  • Online ISBN: 978-3-319-25159-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics