Please use this identifier to cite or link to this item:
Type of publication: Straipsnis konferencijos medžiagoje Clarivate Analytics Web of Science ar/ir Scopus / Article in Clarivate Analytics Web of Science or Scopus DB conference proceedings (P1a)
Field of Science: Informatika / Informatics (N009)
Author(s): Krilavičius, Tomas;Medelis, Žygimantas;Kapočiūtė-Dzikienė, Jurgita;Žalandauskas, Tomas
Title: News media analysis using focused crawl and natural language processing : case of Lithuanian news websites
Is part of: Information and software technologies : 18th international conference, ICIST 2012, Kaunas, Lithuania, September 13-14, 2012 : proceedings. Heidelberg : Springer, 2012
Extent: p. 48-61
Date: 2012
Series/Report no.: (Communications in computer and information science. Vol. 319 1865-0929)
Keywords: Natural language processing;Focused craw;Lithuanian language
ISBN: 978364233071
Abstract: The amount of information that is created, used or stored is growing exponentially and types of data sources are diverse. Most of it is available as an unstructured text. Moreover, considerable part of it is available on-line, usually accessible as Internet resources. It is too expensive or even impossible for humans to analyze all the resources for a required information. Classical Information Technology techniques are not sufficient to process such amounts of information and render it in a form convenient for further analysis. Information Retrieval (IR) and Natural Language Processing (NLP) provide a number of instruments for information analysis and retrieval. In this paper we present a combined application of NLP and IR for Lithuanian media analysis. We demonstrate that a combination of IR and NLP tools with appropriate changes can be successfully applied to Lithuanian media texts
Appears in Collections:Universiteto mokslo publikacijos / University Research Publications

Files in This Item:
marc.xml9.05 kBXMLView/Open

MARC21 XML metadata

Show full item record
Export via OAI-PMH Interface in XML Formats
Export to Other Non-XML Formats

CORE Recommender

Citations 5

checked on Feb 27, 2021

Page view(s)

checked on Mar 5, 2020


checked on Mar 5, 2020

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.