Fragments and Text Categorization
Authors | |
---|---|
Year of publication | 2004 |
Type | Article in Proceedings |
Conference | The Companion Volume to the Proceedings of 42st Annual Meeting of the Association for Computational Linguistics |
MU Faculty or unit | |
Citation | |
Field | Informatics |
Keywords | text classification; fragments |
Description | We introduce two novel methods of text categorization in which documents are split into fragments. We conducted experiments on English, French and Czech. In all cases, the problems referred to a binary document classification. We find that both methods increase the accuracy of text categorization. For the Naive Bayes classifier this increase is significant. |
Related projects: |