TY - CONF
T1 - THE DESIGN OF THE NEWSPAPER-BASED JAPANESE LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION CORPUS
AU - Itou, Katunobu
AU - Yamamoto, Mikio
AU - Takeda, Kazuya
AU - Takezawa, Toshiyuki
AU - Matsuoka, Tatsuo
AU - Kobayashi, Tetsunori
AU - Shikano, Kiyohiro
AU - Itahashi, Shuichi
N1 - Funding Information:
The prompting texts and the bigram language models for the Mainichi Newspaper article sentences were prepared by Akinori Ito (Yamagata Univ.), Takehito Utsuro (NAIST), Tatsuya Kawahara (Kyoto Univ.), Toru Shimizu (KDD), Masafumi Tamoto, Kazuhiro Arai (NTT), and Nobuaki Minematsu (TUT). We used the NIST SPHERE package to attach headers to the wave files and for the “shorten” compression technique used to reduce the number of CD-ROMs. The NIST SPHERE package was implemented by the Spoken Natural Language processing group, National Institute of Standards and Technology, U.S.A. The'shorten' compression technique was developed by Tony Robinson at Cambrigde University and SoftSound Limited, UK. The speech data was collected by the efforts of many volunteers at the 39 research institutes. We would like to thank all of the above groups and individuals.
Publisher Copyright:
© 1998. 5th International Conference on Spoken Language Processing, ICSLP 1998. All rights reserved.
PY - 1998
Y1 - 1998
N2 - In this paper we present the first public Japanese speech corpus for large vocabulary continuous speech recognition (LVCSR) technology, which we have titled JNAS (Japanese Newspaper Article Sentences). We designed it to be comparable to the corpora used in the American and European LVCSR projects. The corpus contains speech recordings (60 hrs.) and their orthographic transcriptions for 306 speakers (153 males and 153 females) reading excerpts from the newspaper's articles and phonetically balanced (PB) sentences. This corpus contains utterances of about 45,000 sentences as a whole with each speaker reading about 150 sentences. JNAS is being distributed on 16 CD-ROMs.
AB - In this paper we present the first public Japanese speech corpus for large vocabulary continuous speech recognition (LVCSR) technology, which we have titled JNAS (Japanese Newspaper Article Sentences). We designed it to be comparable to the corpora used in the American and European LVCSR projects. The corpus contains speech recordings (60 hrs.) and their orthographic transcriptions for 306 speakers (153 males and 153 females) reading excerpts from the newspaper's articles and phonetically balanced (PB) sentences. This corpus contains utterances of about 45,000 sentences as a whole with each speaker reading about 150 sentences. JNAS is being distributed on 16 CD-ROMs.
UR - http://www.scopus.com/inward/record.url?scp=85128361526&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85128361526&partnerID=8YFLogxK
M3 - Paper
AN - SCOPUS:85128361526
T2 - 5th International Conference on Spoken Language Processing, ICSLP 1998
Y2 - 30 November 1998 through 4 December 1998
ER -