Use this url to cite publication: https://hdl.handle.net/20.500.12259/240983
Building of parallel and comparable cybersecurity corpora for bilingual terminology extraction
Type of publication
Straipsnis kitoje duomenų bazėje / Article in other database (S4)
Author(s)
Author | Affiliation |
---|---|
Mockienė, Liudmila | Mykolo Romerio universitetas |
Laurinaitis, Marius | Mykolo Romerio universitetas |
Rackevičienė, Sigita | Mykolo Romerio universitetas |
Title [en]
Building of parallel and comparable cybersecurity corpora for bilingual terminology extraction
Is part of
Selected papers from the CLARIN annual conference 2021 virtual event, 2021, 27–29 September / edited by M. Monachini and M. Eskevich
Date Issued
Date | Volume | Start Page | End Page |
---|---|---|---|
2022 | 189 | 126 | 138 |
Publisher
Linköping : Linköping University Electronic Press
Is Referenced by
Abstract
The paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE) in the cybersecurity domain within the framework of the project DVITAS. It is argued that a system of parallel, comparable, and training corpora for BiTE is particularly useful for less-resourced languages, as it allows efficiently to combine strengths and avoid weaknesses of comparable and parallel resources. A special focus is given to the availability of sources in the cybersecurity domain and issues related to copyright-protected publications, as well as the data curation performed for building the corpora and depositing them to CLARIN-LT repository.
Series/Report no.
Linköping electronic conference proceedings
Type of document
type::text::journal::journal article::research article
Language
Anglų / English (en)
Coverage Spatial
Švedija / Sweden (SE)
Date Reporting
2022
File(s)
ISBN (of the container)
9789179294441
ISSN (of the container)
1650-3686
1650-3740
Access Rights
Atviroji prieiga / Open Access