Please use this identifier to cite or link to this item:
Type of publication: research article
Type of publication (PDB): Straipsnis konferencijos medžiagoje kitose duomenų bazėse / Article in conference proceedings in other databases (P1c)
Field of Science: Informatika / Informatics (N009)
Author(s): Mandravickaitė, Justina;Krilavičius, Tomas
Title: Quantitative analysis of textual genres: comparison of English and Lithuanian
Is part of: CEUR Workshop proceedings [electronic resource]: IVUS 2018, International conference on information technologies, Kaunas, Lithuania, 27 April, 2018. Aachen : CEUR-WS, 2018, Vol. 2145
Extent: p. 61-67
Date: 2018
Keywords: Quantitative genre analysis;Frequency structure of text;Vocabulary richness;Stylometry
Abstract: We report an ongoing study on quantitative characteristics of texts written in different genres. At this stage, we compared Lithuanian and English texts in terms of genres. We used 16 indices which describe frequency structure of text as well as indicate several other characteristics of written texts. Initial study showed significant differences of indices calculated for genre pairs of the same language. Hierarchical clustering revealed possible applications in using them as features for text categorization/classification by genre, though better results were achieved for Lithuanian texts
Affiliation(s): Baltijos pažangių technologijų institutas, Vilnius
Baltijos pažangiųjų technologijų institutas
Taikomosios informatikos katedra
Vilniaus universitetas
Vytauto Didžiojo universitetas
Appears in Collections:Universiteto mokslo publikacijos / University Research Publications

Files in This Item:
marc.xml6.65 kBXMLView/Open

MARC21 XML metadata

Show full item record
Export via OAI-PMH Interface in XML Formats
Export to Other Non-XML Formats

CORE Recommender

Page view(s)

checked on May 1, 2021


checked on May 1, 2021

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.