UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Training a Vocal Tract Synthesiser to imitate speech using Distal Supervised Learning

Howard, I; Huckvale, M; (2005) Training a Vocal Tract Synthesiser to imitate speech using Distal Supervised Learning. In: Proceedings of the 10th International Conference on Speech and Computer (SPECOM 2005). (pp. pp. 159-162). University of Patras, Wire Communications Laboratory: Patras, Greece. Green open access

[thumbnail of Huckvale_training5f8c1b36536df32ab572fb47e9848762ac86.pdf]
Preview
Text
Huckvale_training5f8c1b36536df32ab572fb47e9848762ac86.pdf

Download (190kB) | Preview

Abstract

Imitation is a powerful mechanism by which both animals and people can learn useful behavior, by copying the actions of others. We adopt this approach as a means to control an articulatory speech synthesizer. The goal of our project is to build a system that can learn to mimic speech using its own vocal tract. We approach this task by training an inverse mapping between the synthesizer’s control parameters and their auditory consequences. In this paper we compare the direct estimation of this inverse model with the distal supervised learning scheme proposed by Jordan & Rumelhart (1992). Both of these approaches involve a babbling phase, which is used to learn the auditory consequences of the articulatory controls. We show that both schemes perform well on speech generated by the synthesizer itself, when no normalization is needed, but that distal learning provided slightly better performance with speech generated by a real human subject.

Type: Proceedings paper
Title: Training a Vocal Tract Synthesiser to imitate speech using Distal Supervised Learning
Event: SpeCom: 10th International Conference on Speech and Computer 2005, Patras, Greece.
Open access status: An open access version is available from UCL Discovery
Publisher version: http://www.meta-net.eu/meta-system/kb/communicatio...
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Speech, Hearing and Phonetic Sciences
URI: https://discovery.ucl.ac.uk/id/eprint/74334
Downloads since deposit
43Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item