Detecting Annotation Errors in a Corpus by Induction of Syntactic Patterns
Autoři | |
---|---|
Rok publikování | 2003 |
Druh | Článek ve sborníku |
Konference | Text, Speech and Dialogue: Sixth International Conference, TSD 2003 |
Fakulta / Pracoviště MU | |
Citace | |
Obor | Informatika |
Klíčová slova | error detection; morphological tagging; relational rule induction; syntactic patterns |
Popis | This paper brings a new method for acquisition of syntactic patterns capable of detecting errors in annotated corpora. These patterns are acquired semi-automatically, by means of an inductive logic programming (relational data mining) system followed by a human expert supervision. The patterns acquired have been used for automatic detection and subsequent manual correction of the annotation errors found in DESAM, a morphologically annotated corpus of written Czech. |
Související projekty: |