Efficient multiclass boosting classification with active learning

Jian Huang, Seyda Ertekin, Yang Song, Hongyuan Zha, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

16 Scopus citations

Abstract

We propose a novel multiclass classification algorithm Gentle Adaptive Multiclass Boosting Learning (GAMBLE). The algorithm naturally extends the two class Gentle AdaBoost algorithm to multiclass classification by using the multiclass exponential loss and the multiclass response encoding scheme. Unlike other multiclass algorithms which reduce the K-class classification task to K binary classifications, GAMBLE handles the task directly and symmetrically, with only one committee classifier. We formally derive the GAMBLE algorithm with the quasi-Newton method, and prove the structural equivalence of the two regression trees in each boosting step. To scale up to large datasets, we utilize the generalized Query By Committee (QBC) active learning framework to focus learning on the most informative samples. Our empirical results show that with QBC-style active sample selection, we can achieve faster training time and potentially higher classification accuracy. GAMBLE'S numerical superiority, structural elegance and low computation complexity make it highly competitive with state-of-the-art multiclass classification algorithms.

Original languageEnglish (US)
Title of host publicationProceedings of the 7th SIAM International Conference on Data Mining
Pages297-308
Number of pages12
StatePublished - Dec 1 2007
Event7th SIAM International Conference on Data Mining - Minneapolis, MN, United States
Duration: Apr 26 2007Apr 28 2007

Publication series

NameProceedings of the 7th SIAM International Conference on Data Mining

Other

Other7th SIAM International Conference on Data Mining
CountryUnited States
CityMinneapolis, MN
Period4/26/074/28/07

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Fingerprint Dive into the research topics of 'Efficient multiclass boosting classification with active learning'. Together they form a unique fingerprint.

  • Cite this

    Huang, J., Ertekin, S., Song, Y., Zha, H., & Giles, C. L. (2007). Efficient multiclass boosting classification with active learning. In Proceedings of the 7th SIAM International Conference on Data Mining (pp. 297-308). (Proceedings of the 7th SIAM International Conference on Data Mining).