Abstract
In text classification, term frequency and term co-occurrence factors are dominantly used in weighting term features. Category relevance factors have recently been used to propose term weighting approaches. However, these approaches are mainly based on their own-designed text classifiers to adapt to category information, where the advantages of popular text classifiers have been ignored. This paper proposes a term weighting framework for text classification tasks. The framework firstly inherits the benefits of provided category information to estimate the weighting of features. Secondly, based on the feedback information, it is able to continuously adjust feature weightings to find the best representations for documents. Thirdly, the framework robustly makes it possible to work with different text classifiers on classifying the text representations, based on category information. On several corpora with SVM classifier, experiments show that given predicted information from TFxIDF method as initial status, the proposed approach leverages accuracy results and outperforms current text classification approaches.
Original language | English |
---|---|
Title of host publication | Internationoal Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2011) |
Subtitle of host publication | Lecture Notes in Computer Science |
Editors | Alexander F Gelbukh |
Place of Publication | Berlin Heidelberg |
Publisher | Springer |
Pages | 254-265 |
Number of pages | 12 |
Volume | 6609 |
ISBN (Electronic) | 9783642194375 |
ISBN (Print) | 9783642194368 |
DOIs | |
Publication status | Published - 2011 |
Event | International Conference on Intelligent Text Processing and Computational Linguistics - Tokyo, Japan Duration: 20 Feb 2011 → 26 Feb 2011 https://www.cicling.org/2011/ |
Conference
Conference | International Conference on Intelligent Text Processing and Computational Linguistics |
---|---|
Abbreviated title | CICLING |
Country/Territory | Japan |
City | Tokyo |
Period | 20/02/11 → 26/02/11 |
Internet address |