Prague Dependency Treebank Annotation Errors: A Preliminary Analysis

Warning

This publication doesn't include Faculty of Sports Studies. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

KOVÁŘ Vojtěch JAKUBÍČEK Miloš

Year of publication 2009
Type Article in Proceedings
Conference RASLAN 2009 : Recent Advances in Slavonic Natural Language Processing
MU Faculty or unit

Faculty of Informatics

Citation
Web http://nlp.fi.muni.cz/raslan/2009/
Field Informatics
Keywords error in text; annotation; Prague Dependency Treebank; PDT
Description This paper presents a basic analysis of syntactic annotation errors and inconsistencies in the Prague Dependency Treebank, the biggest corpus of Czech with manual syntactic annotation. The corpus is used for developing and testing of many syntactic analysers of Czech and the problems in the annotation have an essential impact on the evaluation of the quality of these parsers and the results of precision measurements. We identify some of the basic annotation problems and in some cases, we outline possible solutions.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info