TY - JOUR
T1 - PERCEPT-US
T2 - 26th Interspeech Conference 2025
AU - Eads, Amanda
AU - Kabakoff, Heather
AU - Benway, Nina
AU - Hitchcock, Elaine
AU - Preston, Jonathan
AU - McAllister, Tara
N1 - Publisher Copyright:
© 2025 International Speech Communication Association. All rights reserved.
PY - 2025
Y1 - 2025
N2 - We present PERCEPT-US, a multimodal corpus of audio and ultrasound data from 126 American English-speaking children ages 8 to 17. Collected during clinical trials investigating biofeedback in speech therapy for residual speech sound disorder (RSSD), it includes 80 children with no history of speech-language-hearing challenges and 46 children with no challenges other than RSSD. The corpus is balanced by sex (58 females, 68 males) and stratified by age and speech therapy history. Participants completed syllabic, word, and sentence level speech production tasks, totaling 24,699 utterances, with a focus on American English rhotics. The corpus demonstration uses mixed-effects linear regression on segmented acoustic and labeled ultrasound data from 69 children to show that rhotic tongue shape categories significantly predict the acoustics of higher formant frequency values - the first study to demonstrate this acoustic-articulatory relationship in a large pediatric sample.
AB - We present PERCEPT-US, a multimodal corpus of audio and ultrasound data from 126 American English-speaking children ages 8 to 17. Collected during clinical trials investigating biofeedback in speech therapy for residual speech sound disorder (RSSD), it includes 80 children with no history of speech-language-hearing challenges and 46 children with no challenges other than RSSD. The corpus is balanced by sex (58 females, 68 males) and stratified by age and speech therapy history. Participants completed syllabic, word, and sentence level speech production tasks, totaling 24,699 utterances, with a focus on American English rhotics. The corpus demonstration uses mixed-effects linear regression on segmented acoustic and labeled ultrasound data from 69 children to show that rhotic tongue shape categories significantly predict the acoustics of higher formant frequency values - the first study to demonstrate this acoustic-articulatory relationship in a large pediatric sample.
KW - acoustic-articulatory relationships
KW - child speech corpus
KW - English rhotics
KW - ultrasound
UR - https://www.scopus.com/pages/publications/105020062177
U2 - 10.21437/Interspeech.2025-2407
DO - 10.21437/Interspeech.2025-2407
M3 - Conference article
AN - SCOPUS:105020062177
SN - 2308-457X
SP - 2805
EP - 2809
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Y2 - 17 August 2025 through 21 August 2025
ER -