Please use this identifier to cite or link to this item:https://hdl.handle.net/20.500.12259/34243
Type of publication: Straipsnis / Article
Author(s): Kazlauskienė, Asta;Raškinis, Gailius
Title: From speech corpus to intonation corpus: clustering phrase pitch contours of Lithuanian
Is part of: NEALT proceedings series. ISSN 1736-6305. 2013, [Vol.] 16, p. 353-363
Date: 2013
Keywords: Garsynas;Prozodija;Intonacija;Pagrindinis tonas;Dinaminis laiko ištiesinimas;K-vidurkių klasterizacija;Corpus;Prosody;Intonation;Pitch;Syllable;Dynamic time warping;K-means clustering
Abstract: This paper presents our research in preparation to compile a Lithuanian intonation corpus. The main objective of this research was to discover characteristic patterns of Lithuanian intonation through clustering of pitch contours of intermediate intonation phrases. The paper covers the set of procedures that were used to extend an ordinary speech corpus to make it suitable for intonation analysis. The process of intonation analysis included pitch extraction, pitch normalization, estimation of the representative frequency of a syllable, measurement of an inter-phrase similarity, k-means phrase clustering, and visualisation of clustering results. These computational procedures were applied to 23 hours of read speech containing 41417 phrases. The clustering results revealed some interesting intonation patterns of Lithuanian that could be related to the well known linguistic-prosodic phenomena. Language-independence is an important feature of computational procedures covered by this paper. If speech waveforms and the knowledge of phone and phrase boundaries are given, these procedures can be used for the analysis of intonation of other languages.
Internet: https://eltalpykla.vdu.lt/1/34243
https://hdl.handle.net/20.500.12259/34243
Appears in Collections:1. Straipsniai / Articles

Files in This Item:
Show full item record

Page view(s)

16
checked on May 24, 2019

Download(s)

12
checked on May 24, 2019

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.