Use this url to cite dataset: https://hdl.handle.net/20.500.12259/274280
Lithuanian morphologically annotated corpus - MATAS
Type of document
dataset::survey data
Title [en]
Lithuanian morphologically annotated corpus - MATAS
Art Work Nature
0.2
Publisher
Vytauto Didžiojo universitetas / Vytautas Magnus University
Date Issued
2016-11-17
Abstract (en)
MATAS v0.2 - Morphologically annotated Lithuanian corpus (manually checked). Contains 4 parts: documents (21%), fiction (19%), periodicals (36%), scientific texts (24%). Wordform count: 1,641,263. Files: 92 Encoding: UTF-8 Tagset: Human-readable (Lithuanian tags) e.g. <word="liepos" lemma="liepa" type="dktv mot.gim vnsk K"> Date: 2014.08.06 Please use the following text to cite this item: Rimkutė E., Daudaravičius V., Utka A. 2007: Morphological Annotation of the Lithuanian Corpus. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics; Workshop Balto-Slavonic Natural Language Processing 2007, Prague, 94–99. Licence: CLARIN-LT ACA
Is Referenced by
CLARIN-LT
Coverage Spatial
LT
Language
Lietuvių / Lithuanian (lt)
URI
URI | Access Rights |
---|---|
https://hdl.handle.net/20.500.12259/274280 | |
https://doi.org/10.7220/20.500.12259/274280 | |
Lithuanian morphologically annotated corpus - MATAS v1.0 | |
Lithuanian morphologically annotated corpus - MATAS v3.0 | |
Duomenys CLARIN-LT platformoje | Duomenų rinkinys (tik metaduomenys) / Dataset (Only Metadata) |
Affiliation(s)
Funding(s)
European Regional Development Fund Nr. VP2-3.1-IVPK-12-K Syntactic and Semantic Analysis System of the Lithuanian Language for Corpus, Internet, and Public Sector EU funds