Prague Dependency Treebank Annotation Errors: A Preliminary Analysis
Authors | |
---|---|
Year of publication | 2009 |
Type | Article in Proceedings |
Conference | RASLAN 2009 : Recent Advances in Slavonic Natural Language Processing |
MU Faculty or unit | |
Citation | |
Web | http://nlp.fi.muni.cz/raslan/2009/ |
Field | Informatics |
Keywords | error in text; annotation; Prague Dependency Treebank; PDT |
Description | This paper presents a basic analysis of syntactic annotation errors and inconsistencies in the Prague Dependency Treebank, the biggest corpus of Czech with manual syntactic annotation. The corpus is used for developing and testing of many syntactic analysers of Czech and the problems in the annotation have an essential impact on the evaluation of the quality of these parsers and the results of precision measurements. We identify some of the basic annotation problems and in some cases, we outline possible solutions. |
Related projects: |