UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Intermediate features are not useful for tone perception

Chen, Y; Xu, Y; (2020) Intermediate features are not useful for tone perception. In: Proceedings of the 10th International Conference on Speech Prosody 2020. (pp. pp. 513-517). ISCA: Tokyo, Japan. Green open access

[thumbnail of Chen_Xu_SP2020.pdf]
Preview
Text
Chen_Xu_SP2020.pdf - Published Version

Download (726kB) | Preview

Abstract

Many theories assume that speech perception is done by first extracting features like the distinctive features, tonal features or articulatory gestures before recognizing phonetic units such as segments and tones. But it is unclear how exactly extracted features can lead to effective phonetic recognition. In this study we explore this issue by using support vector machine (SVM), a supervised machine learning model, to simulate the recognition of Mandarin tones from F0 in continuous speech. We tested how well a five-level system or a binary distinctive features system can identify Mandarin tones by training the SVM model with F0 trajectories with reduced temporal and frequency resolutions. At full resolution, the recognition rates were 97% and 86% based on the semitone and Hertz scales, respectively. At reduced temporal resolution, there was no clear decline in recognition rate until two points per syllable. At reduced frequency resolution, the recognition rate dropped rapidly: by the level with 5 bands, the accuracy was around 40% based on both Hertz and semitone scales. These results suggest that intermediate featural representations provide no benefit for tone recognition, and are unlikely to be critical for tone perception.

Type: Proceedings paper
Title: Intermediate features are not useful for tone perception
Event: 10th International Conference on Speech Prosody 2020
Open access status: An open access version is available from UCL Discovery
DOI: 10.21437/SpeechProsody.2020-105
Publisher version: http://dx.doi.org/10.21437/SpeechProsody.2020-105
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Speech, Hearing and Phonetic Sciences
URI: https://discovery.ucl.ac.uk/id/eprint/10114793
Downloads since deposit
90Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item