UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Deep learning assessment of syllable affiliation of intervocalic consonants

Liu, Zirui; Xu, Yi; (2023) Deep learning assessment of syllable affiliation of intervocalic consonants. The Journal of the Acoustical Society of America , 153 (2) pp. 848-866. 10.1121/10.0017117. Green open access

[thumbnail of Liu_Xu_JASA2023_accepted.pdf]
Preview
PDF
Liu_Xu_JASA2023_accepted.pdf - Accepted Version

Download (3MB) | Preview

Abstract

In English, a sentence like “He made out our intentions.” could be misperceived as “He may doubt our intentions.” because the coda /d/ sounds like it has become the onset of the next syllable. The nature and occurrence condition of this resyllabification phenomenon are unclear, however. Previous empirical studies mainly relied on listener judgment, limited acoustic evidence, such as voice onset time, or average formant values to determine the occurrence of resyllabification. This study tested the hypothesis that resyllabification is a coarticulatory reorganisation that realigns the coda consonant with the vowel of the next syllable. Deep learning in conjunction with dynamic time warping (DTW) was used to assess syllable affiliation of intervocalic consonants. The results suggest that convolutional neural network- and recurrent neural network-based models can detect cases of resyllabification using Mel-frequency spectrograms. DTW analysis shows that neural network inferred resyllabified sequences are acoustically more similar to their onset counterparts than their canonical productions. A binary classifier further suggests that, similar to the genuine onsets, the inferred resyllabified coda consonants are coarticulated with the following vowel. These results are interpreted with an account of resyllabification as a speech-rate-dependent coarticulatory reorganisation mechanism in speech.

Type: Article
Title: Deep learning assessment of syllable affiliation of intervocalic consonants
Open access status: An open access version is available from UCL Discovery
DOI: 10.1121/10.0017117
Publisher version: https://doi.org/10.1121/10.0017117
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Speech, Hearing and Phonetic Sciences
URI: https://discovery.ucl.ac.uk/id/eprint/10165808
Downloads since deposit
139Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item