TY - JOUR
T1 - Prediction of L2 speech proficiency based on multi-level linguistic features
AU - De Fino, Verdiana
AU - Fontan, Lionel
AU - Pinquier, Julien
AU - Ferrané, Isabelle
AU - Detey, Sylvain
N1 - Funding Information:
5ANR-18-LCV3-0001-01, https://www.irit.fr/SAMOVA/ site/projects/current/labcom-alaia/ 6Detey, S. (dir.) (2020-2024). JSPS: Grant-in-Aid for Scientific Research (B) 20H01291. 7Detey, S. (dir.) (2011-2018). JSPS: Grant-in-Aid for Scientific Research (B) 23320121 & 15H03227
Publisher Copyright:
Copyright © 2022 ISCA.
PY - 2022
Y1 - 2022
N2 - This study investigates the possibility to use automatic, multi-level features for the prediction of L2 speech proficiency. The method was applied on a corpus containing audio recordings and transcripts for 38 Japanese learners of French who participated in a semi-spontaneous oral production task. Each learner's speech proficiency level was assessed by three experienced French teachers. Audio recordings were processed to extract features related to the pronunciation skills and phonetic fluency of the learners, while the transcripts were used to measure their lexical, syntactic, and discursive abilities in French. A Lasso regression using a leave-one-out cross-validation procedure was used to select relevant features and to accurately predict speech proficiency scores. The results show that five features related to the phonetic fluency (speech rate), lexical abilities (lexical density), discourse planning and elaboration skills (number of hesitation and false starts, mean utterance length) of the learners can be used to predict speech proficiency ratings (r = 0.71, mean absolute error on a 5-point scale: 0.53).
AB - This study investigates the possibility to use automatic, multi-level features for the prediction of L2 speech proficiency. The method was applied on a corpus containing audio recordings and transcripts for 38 Japanese learners of French who participated in a semi-spontaneous oral production task. Each learner's speech proficiency level was assessed by three experienced French teachers. Audio recordings were processed to extract features related to the pronunciation skills and phonetic fluency of the learners, while the transcripts were used to measure their lexical, syntactic, and discursive abilities in French. A Lasso regression using a leave-one-out cross-validation procedure was used to select relevant features and to accurately predict speech proficiency scores. The results show that five features related to the phonetic fluency (speech rate), lexical abilities (lexical density), discourse planning and elaboration skills (number of hesitation and false starts, mean utterance length) of the learners can be used to predict speech proficiency ratings (r = 0.71, mean absolute error on a 5-point scale: 0.53).
KW - automatic assessment
KW - linguistic levels
KW - non-native speech
KW - prediction
KW - semi-spontaneous speech
UR - http://www.scopus.com/inward/record.url?scp=85140077068&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140077068&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2022-10369
DO - 10.21437/Interspeech.2022-10369
M3 - Conference article
AN - SCOPUS:85140077068
SN - 2308-457X
VL - 2022-September
SP - 4043
EP - 4047
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
Y2 - 18 September 2022 through 22 September 2022
ER -