KernelTagger – a PoS Tagger for Very Small Amount of Training Data
Autoři | |
---|---|
Rok publikování | 2017 |
Druh | Článek ve sborníku |
Konference | Proceedings of the Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2017 |
Fakulta / Pracoviště MU | |
Citace | RYCHLÝ, Pavel. KernelTagger – a PoS Tagger for Very Small Amount of Training Data. In Aleš Horák, Pavel Rychlý, Adam Rambousek. Proceedings of the Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2017. Brno: Tribun EU, 2017, s. 107-110. ISBN 978-80-263-1340-3. |
www | |
Obor | Informatika |
Klíčová slova | PoS tagging; morphological tagging; language model; Czech |
Popis | The paper describes a new Part of speech (PoS) tagger which can learn a PoS tagging language model from very short annotated text with the use of much bigger non-annotated text. Only several sentences could be used for training to achieve much better accuracy than a baseline. The results cannot be compared to the results of state-of-the-art taggers but it could be used during the annotation process for a pre-annotation. |
Související projekty: |