UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Modelling microprosodic effects can lead to an audible improvement in articulatory synthesis

Krug, PK; Gerazov, B; van Niekerk, DR; Xu, A; Xu, Y; Birkholz, P; (2021) Modelling microprosodic effects can lead to an audible improvement in articulatory synthesis. Journal of the Acoustical Society of America , 150 (2) pp. 1209-1217. 10.1121/10.0005876. Green open access

[thumbnail of Xu_10.0005876.pdf]
Preview
Text
Xu_10.0005876.pdf - Published Version

Download (1MB) | Preview

Abstract

When pitch is explicitly modelled for parametric speech synthesis, microprosodic variations of the fundamental frequency f0 are usually disregarded by current intonation models. While there are numerous studies dealing with the nature and the origin of microprosody, little research has been done on its audibility and its effect on the naturalness of synthetic speech. In this work, the influence of obstruent-related microprosodic variations on the perceived naturalness of articulatory speech synthesis was studied. A small corpus of 20 German words and sentences was re-synthesized using the state-of-the-art articulatory synthesizer VocalTractLab. The pitch contours of the real utterances were extracted and fitted with the Target-Approximation-Model. After the real microprosodic variations were removed from the obtained pitch contours, synthetic variations were applied based on a microprosody model. Subsequently, multiple stimuli with different microprosody amplitudes were synthesized and evaluated in a listening experiment. The results indicate that microprosodic variations are barely audible, but can lead to a greater perceived naturalness of the synthesized speech in certain cases.

Type: Article
Title: Modelling microprosodic effects can lead to an audible improvement in articulatory synthesis
Open access status: An open access version is available from UCL Discovery
DOI: 10.1121/10.0005876
Publisher version: https://doi.org/10.1121/10.0005876
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Speech, Hearing and Phonetic Sciences
URI: https://discovery.ucl.ac.uk/id/eprint/10135037
Downloads since deposit
90Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item