Large Scale Keyword Extraction using a Finite State Backend

Investor logo

Warning

This publication doesn't include Faculty of Sports Studies. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

JAKUBÍČEK Miloš ŠMERK Pavel

Year of publication 2016
Type Article in Proceedings
Conference Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016
MU Faculty or unit

Faculty of Informatics

Citation
Web https://nlp.fi.muni.cz/raslan/2016/paper17-Jakubicek_Smerk.pdf
Field Informatics
Keywords terminology extraction; keyword extraction; fsa; Sketch Engine
Description We present a novel method for performing fast keyword extraction from large text corpora using a finite state backend. The FSA3 package has been adopted for this purposes. We outline the basic approach and present a comparison with previous hash-based method as used in Sketch Engine.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info