TY - JOUR
T1 - Automatically measuring L2 speech fluency without the need of ASR
T2 - 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018
AU - Fontan, Lionel
AU - Le Coz, Maxime
AU - Detey, Sylvain
N1 - Funding Information:
This research has been supported by the Japanese Society for the Promotion of Sciences through Grants-in-Aid for Scientific Research (B) No. 23320121 and No. 15H03228 to Sylvain Detey. We wish to thank Yuji Kawaguchi, Mariko Kondo, Corentin Barcat, Isabelle Racine, Jacques Durand, Mito Matsuzawa, Jean-Luc Nespoulous, Tsuyoshi Umeno, Kaori Ohmura, Xavier Aumont, as well as all the students who participated in the study.
Publisher Copyright:
© 2018 International Speech Communication Association. All rights reserved.
PY - 2018
Y1 - 2018
N2 - This research work investigates the possibility of using automatic acoustic measures to assess speech fluency in the context of second language (L2) acquisition. To this end, three experts rated speech recordings of Japanese learners of French who were instructed to read aloud a 21-sentence-long text. A Forward-Backward Divergence Segmentation (FBDS) algorithm was used to segment speech recordings (sentences) into acoustically homogeneous units at a subphonemic scale. The FBDS processing results were used - along with more classic measures such as raw percentage of speech and length/standard deviation of silent pauses - to estimate speech rate and regularity of speech rate, while a formant tracking algorithm was used to estimate speech fluidity (i.e., quality of coarticulation). A step-by-step multiple linear regression was finally computed to predict the experts' mean fluency ratings. Results show that FBDS-derived measures, raw percentage of speech, and standard deviation of the first formant curve derivative can be combined together to calculate accurate estimates of speakers' fluency scores (R = .92; P < .001). As only low-level signal features were used in the study, the method could also be relevant for the assessment of speakers of other target languages, as well as for the assessment of disordered speech.
AB - This research work investigates the possibility of using automatic acoustic measures to assess speech fluency in the context of second language (L2) acquisition. To this end, three experts rated speech recordings of Japanese learners of French who were instructed to read aloud a 21-sentence-long text. A Forward-Backward Divergence Segmentation (FBDS) algorithm was used to segment speech recordings (sentences) into acoustically homogeneous units at a subphonemic scale. The FBDS processing results were used - along with more classic measures such as raw percentage of speech and length/standard deviation of silent pauses - to estimate speech rate and regularity of speech rate, while a formant tracking algorithm was used to estimate speech fluidity (i.e., quality of coarticulation). A step-by-step multiple linear regression was finally computed to predict the experts' mean fluency ratings. Results show that FBDS-derived measures, raw percentage of speech, and standard deviation of the first formant curve derivative can be combined together to calculate accurate estimates of speakers' fluency scores (R = .92; P < .001). As only low-level signal features were used in the study, the method could also be relevant for the assessment of speakers of other target languages, as well as for the assessment of disordered speech.
KW - Automatic measures
KW - French
KW - Japanese
KW - Learner corpus
KW - Second language acquisition
KW - Speech fluency
UR - http://www.scopus.com/inward/record.url?scp=85054984906&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054984906&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2018-1336
DO - 10.21437/Interspeech.2018-1336
M3 - Conference article
AN - SCOPUS:85054984906
SN - 2308-457X
VL - 2018-September
SP - 2544
EP - 2548
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Y2 - 2 September 2018 through 6 September 2018
ER -