Please use this identifier to cite or link to this item:https://hdl.handle.net/20.500.12259/34329
Type of publication: Straipsnis recenzuojamoje Lietuvos konferencijos medžiagoje (P1f);Article in peer-reviewed Lithuanian conference proceedings (P1f)
Field of Science: Informatika (N009);Computer science (N009)
Author(s): Ciganaitė, Greta;Mackutė-Varoneckienė, Aušra;Krilavičius, Tomas
Title: Text documents clustering
Is part of: Informacinės technologijos : 19-oji tarpuniversitetinė tarptautinė magistrantų ir doktorantų konferencija "Informacinė visuomenė ir universitetinės studijos" (IVUS 2014) : konferencijos pranešimų medžiaga. Kaunas : Technologija, 19 (2014)
Extent: p. 90-93
Date: 2014
Keywords: Klasterizavimas;Text document clustering;Similarity measures
Abstract: Big amounts of textual information are generated every day, and existing techniques can hardly deal with such information flow. However, users expect fast and exact information management and retrieval tools. Clustering is a well known technique for grouping similar data and in such a way making it more manageable and usable. Text clustering is an adaptation of clustering for a very specific data - documents. However, it is not transferable directly to any language, i.e. specifics of language influence performance quite a lot, as shows results for English and other well investigated languages. In this paper we apply different distances and clustering approaches for Lithuanian data, discuss results and provide recommendations for documents in Lithuanian clustering
Internet: https://hdl.handle.net/20.500.12259/34329
https://eltalpykla.vdu.lt/1/34329
Affiliation(s): Baltijos pažangių technologijų institutas, Vilnius
Informatikos fakultetas
Taikomosios informatikos katedra
Vytauto Didžiojo universitetas
Appears in Collections:3. Konferencijų medžiaga / Conference materials
Universiteto mokslo publikacijos / University Research Publications

Files in This Item:
marc.xml7.63 kBXMLView/Open

MARC21 XML metadata

Show full item record

Page view(s)

132
checked on Aug 15, 2019

Download(s)

44
checked on Aug 15, 2019

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.