Please use this identifier to cite or link to this item:
Type of publication: Article in Clarivate Analytics Web of Science or Scopus DB conference proceedings (P1a);Straipsnis konferencijos medžiagoje Clarivate Analytics Web of Science ar/ir Scopus (P1a)
Field of Science: Filologija (H004);Philology (H004)
Author(s): Utka, Andrius;Boizou, Loic;Grigonytė, Gintarė;Rimkutė, Erika
Title: Automatic inference of base forms for multiword terms in Lithuanian
Is part of: Human language technologies - the Baltic perspective: the 5th international conference Baltic HLT, Tartu, Estonia, October 4–5, 2012: proceedings. Amsterdam : IOS press, 2012
Extent: p. 27-35
Date: 2012
Series/Report no.: (Frontiers in artificial inteligence and applications, v. 247)
Keywords: Lietuvių kalba;Syntagmatic lemmatisation;Lithuanian;Terminų nustatymas;Sintagminis lemavimas;Term extraction
ISBN: 9781614991328
Abstract: This paper reports on a specific problem of automatic terminology extraction in Lithuanian – base form inference. While the process of lemmatisation is properly carried out by existing tools, problems arise with normalizing multiword terms. It can be described as the discrepancy between the base form (i. e. lemma) of a term and the sequence of the base forms of constituent lexical items within a term. Lithuanian is a strongly inflected language and the lemmatisation of each word separately within a multiword term breaks the syntactic relations expressed by inflection (case, gender, number) which need to be kept in order to ensure the cohesion of the term
Affiliation(s): Humanitarinių mokslų fakultetas
Kompiuterinės lingvistikos centras
Vytauto Didžiojo universitetas
Appears in Collections:Universiteto mokslo publikacijos / University Research Publications

Files in This Item:
marc.xml11.12 kBXMLView/Open

MARC21 XML metadata

Show full item record

Page view(s)

checked on Aug 18, 2019


checked on Aug 18, 2019

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.