Automatic Structuring of Written Texts
Authors | |
---|---|
Year of publication | 1999 |
Type | Article in Proceedings |
Conference | Proceedings of 2nd International Conference on Text, Speech, and Dialogue (TSD 1999) |
MU Faculty or unit | |
Citation | |
Web | http://nlp.fi.muni.cz/publications/tsd1999_mara_hales_julinek_smrz/ |
Field | Use of computers, robotics and its application |
Keywords | text structure |
Description | This paper deals with automatic structuring and sentence boundary labelling in natural language texts. We describe the implemented structure tagging algorithm and heuristic rules that are used for automatic or semiautomatic labelling. Inside the detected sentence the algorithm performs a decomposition to clauses and then marks the parts of text which do not form a sentence, i.e. headings, signatures, tables and other structured data. We also pay attention to the processing of matched symbols in the text, especially to the analysis of direct speech notation. |
Related projects: |