A Distributional Multi-word Thesaurus in Sketch Engine
Autoři | |
---|---|
Rok publikování | 2019 |
Druh | Článek ve sborníku |
Konference | Proceedings of the Thirteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2019 |
Fakulta / Pracoviště MU | |
Citace | |
www | https://nlp.fi.muni.cz/raslan/2019/paper17-jakubicek.pdf |
Klíčová slova | text corpus; Sketch Engine; MWE; multi-word expressions; thesaurus |
Popis | In this paper we present an extension of the current distribu-tional thesaurus as available in the Sketch Engine corpus managementsystem towards multi-word units. We explain how multi-word sketches areused to generate multi-word unit candidates, thus preserving access to theunderlying corpus texts. Finally we present sample results on the BritishNational Corpus and discuss future development as well as difficulties inevaluation. |
Související projekty: |