TY - JOUR
T1 - Zipf's law in phonograms and Weibull distribution in ideograms
T2 - Comparison of English with Japanese
AU - Nabeshima, Terutaka
AU - Gunji, Yukio Pegio
PY - 2004/2
Y1 - 2004/2
N2 - Frequency distribution of word usage in a word sequence generated by capping is estimated in terms of the number of "hits" in retrieval of web-pages, to evaluate structure of semantics proper not to a particular text but to a language. Especially we compare distribution of English sequences with Japanese ones and obtain that, for English and Japanese phonogram, frequency of word usage against rank follows power-law function with exponent 1 and, for Japanese ideogram, it follows stretched exponential (Weibull distribution) function. We also discuss that such a difference can result from difference of phonogram based- (English) and ideogram-based language (Japanese).
AB - Frequency distribution of word usage in a word sequence generated by capping is estimated in terms of the number of "hits" in retrieval of web-pages, to evaluate structure of semantics proper not to a particular text but to a language. Especially we compare distribution of English sequences with Japanese ones and obtain that, for English and Japanese phonogram, frequency of word usage against rank follows power-law function with exponent 1 and, for Japanese ideogram, it follows stretched exponential (Weibull distribution) function. We also discuss that such a difference can result from difference of phonogram based- (English) and ideogram-based language (Japanese).
KW - Ideogram
KW - Phonogram
KW - Weibull distribution
UR - http://www.scopus.com/inward/record.url?scp=0842328616&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0842328616&partnerID=8YFLogxK
U2 - 10.1016/j.biosystems.2003.11.002
DO - 10.1016/j.biosystems.2003.11.002
M3 - Article
C2 - 15013225
AN - SCOPUS:0842328616
SN - 0303-2647
VL - 73
SP - 131
EP - 139
JO - BioSystems
JF - BioSystems
IS - 2
ER -