Please use this identifier to cite or link to this item:
Type of publication: master thesis
Field of Science: Informatika / Informatics (N009)
Author(s): Gebremichael Tesfagergish, Senait
Supervisor: Kapociute Dzikiene, Jurgita
Title: Deep learning-based part-of-speech tagging of the Ethiopic language
Extent: 69 p.
Date: 17-Jun-2020
Keywords: Deep neural networks;Feed forward neural networks;Convolutional neural networks;Long short -term memory;Bidirectional long short -term memory;Word2Vec embeddings;Nagaoka corpus;Tigrinya part-of-speech tagging
Abstract: Deep Neural Networks have demonstrated the great efficiency in many NLP tasks for various languages. Unfortunately, some resource-scarce languages as, e.g., Tigrinya still receive too little attention, therefore many NLP applications as part-of-speech tagging are still in their early stages. Consequently, the main objective of this research is to offer the effective part-of-speech tagging solutions for the Tigrinya language having rather small training corpus. In this paper the Deep Neural Network classifier, (i.e., Feed Forward Neural Network Long Short Term Memory, Bidirectional LSTM and Convolutional Neural Network) are investigated by applying them on a top of separately trained distributional neural Word2Vec embeddings. Seeking for the most accurate solutions, DNN models are optimized manually and automatically. Despite automatic hyper- parameter optimization demonstrates a good performance with the Convolutional Neural Network, the manually tested Bidirectional Long Short – Term Memory method achieves the highest overall accuracy equal to 91%.
Appears in Collections:2020 m. (IF mag.)

Files in This Item:
senait_gebremichael_tesfagergish_md.pdf2.14 MBAdobe PDF   Restricted AccessView/Open
Show full item record
Export via OAI-PMH Interface in XML Formats
Export to Other Non-XML Formats

CORE Recommender

Page view(s)

checked on Jun 6, 2021


checked on Jun 6, 2021

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.