Use this url to cite publication: https://hdl.handle.net/20.500.12259/36623
Identification of multiword expressions for Latvian and Lithuanian: hybrid approach
Type of publication
Straipsnis recenzuojamoje užsienio tarptautinės konferencijos medžiagoje / Article in peer-reviewed foreign international conference proceedings (P1d)
Author(s)
Author | Affiliation | |||
---|---|---|---|---|
Baltijos pažangiųjų technologijų institutas | LT | Vilniaus universitetas | LT | |
Title [en]
Identification of multiword expressions for Latvian and Lithuanian: hybrid approach
Is part of
EACL 2017: 13th workshop on multiword expressions, April 4, 2017 Valencia, Spain: proceedings of the workshop. Stroudsburg : Association for Computational Linguistics, 2017
Date Issued
Date |
---|
2017 |
Publisher
Stroudsburg : Association for Computational Linguistics, 2017
Extent
p. 97-101
Abstract (en)
We discuss an experiment on automatic identification of bi-gram multiword expressions in parallel Latvian and Lithuanian corpora. Raw corpora, lexical association measures (LAMs) and supervised machine learning (ML) are used due to deficit and quality of lexical resources (e.g., POS-tagger, parser) and tools. While combining LAMs with ML is rather effective for other languages, it has shown some nice results for Lithuanian and Latvian as well. Combining LAMs with ML we have achieved 92,4% precision and 52,2% recall for Latvian and 95,1% precision and 77,8% recall for Lithuanian.
Type of document
type::text::journal::journal article::research article
Language
Anglų / English (en)
Coverage Spatial
Jungtinės Amerikos Valstijos / United States of America (US)
Description
This research was funded by a grant (No. LIP- 027/2016) from the Research Council of Lithuania
File(s)
ISBN (of the container)
9781945626487
Other Identifier(s)
VDU02-000022094
Access Rights
Atviroji prieiga / Open Access
Creative Commons License