skip to main content
10.1145/1015330.1015427acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

A hierarchical method for multi-class support vector machines

Published:04 July 2004Publication History

ABSTRACT

We introduce a framework, which we call Divide-by-2 (DB2), for extending support vector machines (SVM) to multi-class problems. DB2 offers an alternative to the standard one-against-one and one-against-rest algorithms. For an N class problem, DB2 produces an N − 1 node binary decision tree where nodes represent decision boundaries formed by N − 1 SVM binary classifiers. This tree structure allows us to present a generalization and a time complexity analysis of DB2. Our analysis and related experiments show that, DB2 is faster than one-against-one and one-against-rest algorithms in terms of testing time, significantly faster than one-against-rest in terms of training time, and that the cross-validation accuracy of DB2 is comparable to these two methods.

References

  1. Bennett, K. P., Cristianini, N., Shawe-Taylor, J., & Wu, D. (2000). Enlarging the margins in perceptron decision trees. Machine Learning, 41, 295--313. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Blake, C., & Merz, C. (1998). UCI repository of machine learning databases.Google ScholarGoogle Scholar
  3. Crammer, K., & Singer, Y. (2000). On the learnability and design of output codes for multiclass problems. Computational Learing Theory (pp. 35--46). Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Dumais, S. T., & Chen, H. (2000). Hierarchical classification of Web content. Proceedings of SIGIR-00, 23rd ACM International Conference on Research and Development in Information Retrieval (pp. 256--263). Athens, GR: ACM Press, New York, US. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Dy, J. G., & Brodley, C. E. (2000). Visualization and interactive feature selection for unsupervised data. Knowledge Discovery and Data Mining (pp. 360--364). Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Forgy, E. (1965). Cluster analysis of multivariate data: Efficiency vs interpretability of classifications. Biometrics, 21, 768--780.Google ScholarGoogle Scholar
  7. Friedman, J. (1996). Another approach to polychotomous classification (Technical Report). Stanford University, Department of Statistics.Google ScholarGoogle Scholar
  8. Fukunaga, k. (1990). Introduction to statistical pattern recognition. Boston, MA: Academic Press. 2 edition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Hsu, C., & Lin, C. A. comparison of methods for multiclass support vector machines. Technical report, Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan, 2001, 19.Google ScholarGoogle Scholar
  10. Knerr, S., Personnaz, L., & Dreyfus, G. (1990). Single-layer learning revisited: A stepwise procedure for building and training a neural network. Neurocomputing: Algorithms, Architectures and Applications, NATO ASI Series. Springer.Google ScholarGoogle Scholar
  11. Platt, J. (1998). Sequential minimal optimization: A fast algorithm for training support vector machines. Technical Report 98-14, Microsoft Research, Redmond, Washington, April 1998. http://www.research.microsoft.com/jplatt/smo.html.Google ScholarGoogle Scholar
  12. Platt, J., Cristianini, N., & Shawe-Taylor, J. (2000). Large margin dags for multiclass classification. Advances in Neural Information Processing Systems 12 (pp. 547--553).Google ScholarGoogle Scholar
  13. Vapnik, V. (1995). The nature of statistical learning theory. New York: Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Vapnik, V. (1998). Statistical learning theory. New York: Wiley. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Weston, J., & Watkins, C. (1999). Support vector machines for multiclass pattern recognition. Proceedings of the Seventh European Symposium On Artificial Neural Networks.Google ScholarGoogle Scholar
  1. A hierarchical method for multi-class support vector machines

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICML '04: Proceedings of the twenty-first international conference on Machine learning
        July 2004
        934 pages
        ISBN:1581138385
        DOI:10.1145/1015330
        • Conference Chair:
        • Carla Brodley

        Copyright © 2004 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 4 July 2004

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate140of548submissions,26%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader