Please use this identifier to cite or link to this item:https://hdl.handle.net/20.500.12259/36092
Full metadata record
DC FieldValueLanguage
dc.contributor.authorStanikūnas, Daumantas-
dc.contributor.authorMandravickaitė, Justina-
dc.contributor.authorKrilavičius, Tomas-
dc.coverage.spatialDE-
dc.date.accessioned2018-05-08T07:29:39Z-
dc.date.available2018-05-08T07:29:39Z-
dc.date.issued2017-
dc.identifier.issn16130073-
dc.identifier.otherVDU02-000021477-
dc.identifier.urihttps://eltalpykla.vdu.lt/handle/1/36092-
dc.identifier.urihttp://ceur-ws.org/Vol-1852/p01.pdf-
dc.description.abstractConstant developments in information and computer technologies make it possible to handle constantly increasing amount of data, thereby expanding the research possibilities. In this article, we discuss and compare distance and similarity measures used in stylometric analysis which could be applied to analyze Lithuanian texts. As corpus for the analysis, transcripts of parliamentary debates by two politicians of the Lithuanian Parliament were chosen. Furthermore, comparison of distance measures, stylometric analysis and visualization were performed. Objective of the experiment was to identify what measures would perform better when executing stylometric analysis of Lithuanian texts and explore where these differences in the performance occur. Summarizing the experiment results, the recommendations are as follow: number of Most Frequent Words used should be at least 1000, Eder's Simple Delta measure can be used in general stylometric analysis of transcriptions of parliamentary debates of Lithuanian Parliament, in a case when Most Frequent Words are limited to 2000, Binomial Index shows an increase in performance over Eder's Simple Delta and thus it is more suitableen
dc.description.sponsorshipBaltijos pažangių technologijų institutas, Vilnius-
dc.description.sponsorshipBaltijos pažangiųjų technologijų institutas-
dc.description.sponsorshipMatematikos ir statistikos katedra-
dc.description.sponsorshipTaikomosios informatikos katedra-
dc.description.sponsorshipVilniaus universitetas-
dc.description.sponsorshipVytauto Didžiojo universitetas-
dc.format.extentp. 1-7-
dc.language.isoen-
dc.relation.ispartofCEUR Workshop proceedings [electronic resource]: ICYRIME 2017 : proceedings of the symposium for young researchers in informatics, mathematics and engineering, Kaunas, Lithuania, April 28, 2017. Aachen : CEUR-WS, 2017, Vol. 1852-
dc.relation.isreferencedbyScopus-
dc.rightsSutarties data 2018-05-07, nr. B000326, laisvai prieinamas internetelt_LT
dc.rights.urihttp://www.sherpa.ac.uk/romeo/search.php?issn=1613-0073-
dc.subjectStylometryen
dc.subjectComputational stylisticsen
dc.subjectStatistical analysisen
dc.subjectData visualizationen
dc.subject.classificationStraipsnis konferencijos medžiagoje kitose duomenų bazėse / Article in conference proceedings in other databases (P1c)-
dc.subject.otherMatematika / Mathematics (N001)-
dc.titleComparison of distance and similarity measures for stylometric analysis of Lithuanian textsen
dc.typeresearch article-
dcterms.bibliographicCitation16-
dc.date.updated2020-03-25T12:18Z-
local.object{"source": {"code": "vdu", "handle": "21477"}, "publisher": {"other": ["CEUR-WS"], "list": false}, "db": {"clarivate": false, "scopus": true, "list": true}, "issn": ["1613-0073"], "code": "P1c", "subject": ["N001"], "url": ["https://eltalpykla.vdu.lt/handle/1/36092", "http://ceur-ws.org/Vol-1852/p01.pdf"], "country": "DE", "language": "en", "area": "N", "original": true, "pages": 7, "sheets": 0.5, "timestamp": "20200325121801.0", "account": {"year": 2017, "late": false}, "na": 3, "nip": 0, "affiliation": [{"contribution": 0.33333333333333, "aip": 1, "country": ["LT"], "rel": "aut", "org": [{"create": true, "contribution": 0.33333333333333, "name": "Vytauto Didžiojo universitetas", "id": "111950396", "level": "0", "type": "uni", "research": "1", "status": "0", "unit": {"name": "Informatikos fakultetas", "id": "04", "level": "1", "type": "fak", "research": "1", "status": "0", "unit": {"name": "Matematikos ir statistikos katedra", "id": "0402", "level": "2", "type": "kat", "research": "1", "status": "0"}}}], "id": "5568306F5057F59C78E42746BEB72FE9", "lname": "Stanikūnas", "fname": "Daumantas", "status": "0", "name": "Stanikūnas, Daumantas"}, {"contribution": 0.33333333333333, "aip": 2, "country": ["LT"], "rel": "aut", "org": [{"create": false, "contribution": 0.16666666666667, "name": "Baltijos pažangiųjų technologijų institutas", "id": "301846141"}, {"create": false, "contribution": 0.16666666666667, "name": "Vilniaus universitetas", "id": "211950810"}], "id": "FD69D62BBC3F4592C35919291E35CCA8", "lname": "Mandravickaitė", "fname": "Justina", "status": "1", "name": "Mandravickaitė, Justina"}, {"contribution": 0.33333333333333, "aip": 2, "country": ["LT"], "rel": "aut", "org": [{"create": true, "contribution": 0.16666666666667, "name": "Vytauto Didžiojo universitetas", "id": "111950396", "level": "0", "type": "uni", "research": "1", "status": "1", "unit": {"name": "Informatikos fakultetas", "id": "04", "level": "1", "type": "fak", "research": "1", "status": "1", "unit": {"name": "Taikomosios informatikos katedra", "id": "0401", "level": "2", "type": "kat", "research": "1", "status": "1"}}}, {"create": false, "contribution": 0.16666666666667, "name": "Baltijos pažangių technologijų institutas, Vilnius", "id": "301846141"}], "id": "DD5A5F9F9ADFA0BC37D24E1184ED5391", "lname": "Krilavičius", "fname": "Tomas", "status": "1", "name": "Krilavičius, Tomas"}]}-
local.typeP-
item.fulltextWith Fulltext-
item.grantfulltextopen-
crisitem.author.deptMatematikos ir statistikos katedra-
crisitem.author.deptTaikomosios informatikos katedra-
crisitem.author.deptTaikomosios informatikos katedra-
Appears in Collections:3. Konferencijų medžiaga / Conference materials
Universiteto mokslo publikacijos / University Research Publications
Files in This Item:
Show simple item record
Export via OAI-PMH Interface in XML Formats
Export to Other Non-XML Formats


CORE Recommender

Page view(s)

112
checked on Jun 6, 2021

Download(s)

20
checked on Jun 6, 2021

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.