Building Big Czech Corpus : Collecting and Converting Czech Corpora
Authors | |
---|---|
Year of publication | 2008 |
Type | Article in Proceedings |
Conference | RASLAN 2008 |
MU Faculty or unit | |
Citation | |
Web | https://nlp.fi.muni.cz/raslan/2008/papers/11.pdf |
Field | Linguistics |
Keywords | corpus; desamb; vertjoin; |
Description | This paper describes a creating of a big Czech corpus from many Czech corpora kept on the NLP Centre server. It describes new tools developed for this purpose, difficulties which may come up and a way how solve them. |
Related projects: |