TY - GEN
T1 - Unsupervised learning of vowels from continuous speech based on self-organized phoneme acquisition model
AU - Miyazawa, Kouki
AU - Kikuchi, Hideaki
AU - Mazuka, Reiko
N1 - Funding Information:
This work was supported by KAKENHI 21610028.
PY - 2010
Y1 - 2010
N2 - All normal humans can acquire their native phoneme systems simply by living in their native language environment. However, it is unclear as to how infants learn the acoustic expression of each phoneme of their native languages. In recent studies, researchers have inspected phoneme acquisition by using a computational model. However, these studies have used read speech that has a limited vocabulary as input and do not handle a continuous speech that is almost comparable to a natural environment. Therefore, in this study, we use natural continuous speech and build a self-organization model that simulates the cognitive ability of the humans, and we analyze the quality and quantity of the speech information that is necessary for the acquisition of the native vowel system. Our model is designed to learn values of the acoustic characteristic of a natural continuous speech and to estimate the number and boundaries of the vowel categories without using explicit instructions. In the simulation trial, we investigate the relationship between the quantity of learning and the accuracy for the vowels in a single Japanese speaker's natural speech. As a result, it is found that the vowel recognition accuracy of our model is comparable to that of an adult.
AB - All normal humans can acquire their native phoneme systems simply by living in their native language environment. However, it is unclear as to how infants learn the acoustic expression of each phoneme of their native languages. In recent studies, researchers have inspected phoneme acquisition by using a computational model. However, these studies have used read speech that has a limited vocabulary as input and do not handle a continuous speech that is almost comparable to a natural environment. Therefore, in this study, we use natural continuous speech and build a self-organization model that simulates the cognitive ability of the humans, and we analyze the quality and quantity of the speech information that is necessary for the acquisition of the native vowel system. Our model is designed to learn values of the acoustic characteristic of a natural continuous speech and to estimate the number and boundaries of the vowel categories without using explicit instructions. In the simulation trial, we investigate the relationship between the quantity of learning and the accuracy for the vowels in a single Japanese speaker's natural speech. As a result, it is found that the vowel recognition accuracy of our model is comparable to that of an adult.
KW - Language acquisition
KW - Neural network
KW - Vowels
UR - http://www.scopus.com/inward/record.url?scp=79959816529&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79959816529&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:79959816529
T3 - Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
SP - 2914
EP - 2917
BT - Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
PB - International Speech Communication Association
ER -