Please use this identifier to cite or link to this item:https://hdl.handle.net/20.500.12259/34331
Type of publication: Article in the peer-reviewed Lithuanian conference proceedings (P1f);Straipsnis recenzuojamoje Lietuvos konferencijos medžiagoje (P1f)
Field of Science: Computer science (N009);Informatika (N009)
Author(s): Man, Ka Lok;Medelis, Žygimantas;Krilavičius, Tomas;Širvinskas, Raimondas
Title: Language identification for Lithuanian, Russian and Azeri languages
Is part of: Informacinės technologijos : 19-oji tarpuniversitetinė tarptautinė magistrantų ir doktorantų konferencija "Informacinė visuomenė ir universitetinės studijos" (IVUS 2014) : konferencijos pranešimų medžiaga. Kaunas : Technologija, 2014, 19
Extent: p. 167-171
Date: 2014
Keywords: Language identification;Computer systems;Machine learning;Natural language processing;Kalbos atpažinimas
Abstract: Language identification is an important part of Natural Language Processing, because most of techniques are language sensitive, and therefore in multi-language systems language should be identified before further processing steps. Techniques and tools for more popular languages are well defined and are available in commercial and open source tools, but are not defined for less popular languages. In this work we investigate techniques for Lithuanian, Russian and Azeri (Azjerbaijani) languages. Corpora for these and similar languages (Latvian, Ukrainian, Belarusian, Turkish and Turkmen (Turkman) were collected and prepared. Selected approaches were trained. Results were evaluated using precision, recall and F-score
Internet: https://hdl.handle.net/20.500.12259/34331
https://hdl.handle.net/20.500.12259/34331
Affiliation(s): Informatikos fakultetas
Taikomosios informatikos katedra
Vytauto Didžiojo universitetas
Appears in Collections:3. Konferencijų medžiaga / Conference materials
Universiteto mokslo publikacijos / University Research Publications

Files in This Item:
marc.xml8.61 kBXMLView/Open

MARC21 XML metadata

Show full item record

Page view(s)

82
checked on May 19, 2019

Download(s)

10
checked on May 19, 2019

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.