Please use this identifier to cite or link to this item:
Type of publication: research article
Field of Science: Informatika / Informatics (N009)
Author(s): Norkevičius, Giedrius;Raškinis, Gailius
Title: Inter-speaker speech rate normalization for phone duration modeling of Lithuanian
Is part of: The third Baltic Conference on Human Language Technologies, Kaunas, Lithuania, October 4-5, 2007 : proceedings. Kaunas : Vytauto Didžiojo universitetas, 2007
Extent: p. 219-225
Date: 2007
Keywords: Inter speaker speech;Rate normalization;Duration model;Phone duration;CART;Speech rate;Variability;Lithuanian
ISBN: 9789955704539
Abstract: Previous research on CART based phone duration modeling of Lithuanian showed that results of model built on many speaker data are poorer than results of model built on single speaker data. It points out that inter-speaker normalization has to be done in order to use many speaker corpuses for training duration models. Three steps procedure was applied for speech rate normalization: 1. Correlation based clustering of vectors of duration averages calculated for each speaker was employed to identify language specific groups of phones; 2. Calculation of speech rate coefficients, one coefficient for every speaker and every group of phones; 3. Data normalization according to calculated coefficients. Experiments were performed on VDU-AB20 corpus which contain 300 thousand samples of vowels and 400 thousand samples of consonants and evaluated on CART based duration modelling. The achieved results of model built on many speaker data after inter speaker normalization (corr.: 0.8603 and 0.787; RMSE: 0.0228 and 0.0179 respectively for vowels and consonants) where better than results of model built on single speaker data
Affiliation(s): Vytauto Didžiojo universitetas
Appears in Collections:Universiteto mokslo publikacijos / University Research Publications

Files in This Item:
marc.xml9.78 kBXMLView/Open

MARC21 XML metadata

Show full item record
Export via OAI-PMH Interface in XML Formats
Export to Other Non-XML Formats

CORE Recommender

Page view(s)

checked on Jan 7, 2020


checked on Jan 7, 2020

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.