Siirry päänavigointiin Siirry hakuun Siirry pääsisältöön

TerseSVM : A scalable approach for learning compact models in Large-scale classification

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

Abstrakti

For large-scale multi-class classification problems, consisting of tens of thousand target categories, recent works have emphasized the need to store billions of parameters. For instance, the classical l2-norm regularization employed by a state-of-the-art method results in the model size of 17GB for a training set whose size is only 129MB. To the contrary, by using a mixed-norm regularization approach, we show that around 99.5% of the stored parameters is dispensable noise. Using this strategy, we can extract the information relevant for classification, which is constituted in remaining 0.5% of the parameters, and hence demonstrate drastic reduction in model sizes. Furthermore, the proposed method leads to improvement in generalization performance compared to state-of-the-art methods, especially for under-represented categories. Lastly, our method enjoys easy parallelization, and scales well to tens of thousand target categories.
AlkuperäiskieliEnglanti
OtsikkoProceedings of the 2016 SIAM International Conference on Data Mining (SDM)
KustantajaSociety for Industrial and Applied Mathematics
Sivut234-242
Sivumäärä9
ISBN (elektroninen)978-1-61197-434-8
DOI - pysyväislinkit
TilaJulkaistu - 2016
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaSIAM INTERNATIONAL CONFERENCE ON DATA MINING - Miami, Yhdysvallat
Kesto: 5 toukok. 20167 toukok. 2016
https://archive.siam.org/meetings/sdm16/

Conference

ConferenceSIAM INTERNATIONAL CONFERENCE ON DATA MINING
LyhennettäSDM
Maa/AlueYhdysvallat
KaupunkiMiami
Ajanjakso05/05/201607/05/2016
www-osoite

Sormenjälki

Sukella tutkimusaiheisiin 'TerseSVM : A scalable approach for learning compact models in Large-scale classification'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä