UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis

Birkholz, P; Martin, L; Xu, Y; Scherbaum, S; Neuschaefer-Rube, C; (2017) Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis. Computer Speech & Language , 41 pp. 116-127. 10.1016/j.csl.2016.06.004. Green open access

[thumbnail of Birkholz-CSL.pdf]
Preview
Text
Birkholz-CSL.pdf - Accepted Version

Download (1MB) | Preview

Abstract

Vocal emotions, as well as different speaking styles and speaker traits, are characterized by a complex interplay of multiple prosodic features. Natural sounding speech synthesis with the ability to control such paralinguistic aspects requires the manipulation of the corresponding prosodic features. With traditional concatenative speech synthesis it is easy to manipulate the “primary” prosodic features pitch, duration, and intensity, but it is very hard to individually control “secondary” prosodic features like phonation type, vocal tract length, articulatory precision and nasality. These secondary features can be controlled more directly with parametric synthesis methods. In the present study we analyze the ability of articulatory speech synthesis to control secondary prosodic features by rule. To this end, nine German words were re-synthesized with the software VocalTractLab 2.1 and then manipulated in different ways at the articulatory level to vary vocal tract length, articulatory precision and degree of nasality. Listening tests showed that most of the intended prosodic manipulations could be reliably identified with recognition rates between 77% and 96%. Only the manipulations to increase articulatory precision were hardly recognized. The results suggest that rule-based manipulations in articulatory synthesis are generally sufficient for the convincing synthesis of secondary prosodic features at the word level.

Type: Article
Title: Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.csl.2016.06.004
Publisher version: http://dx.doi.org/10.1016/j.csl.2016.06.004
Language: English
Additional information: Copyright © 2016 Elsevier Ltd. All rights reserved. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/
Keywords: Science & Technology, Technology, Computer Science, Artificial Intelligence, Computer Science, Prosody, Feature manipulation, Articulatory synthesis, Speech Synthesis System, Vowel Reduction, Voice Quality, Emotion, Expression, Personality, Mandarin, Speaking, English, Stress
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Speech, Hearing and Phonetic Sciences
URI: https://discovery.ucl.ac.uk/id/eprint/1503252
Downloads since deposit
296Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item