Zipf's law in phonograms and Weibull distribution in ideograms: Comparison of English with Japanese

Terutaka Nabeshima, Yukio Pegio Gunji*

*この研究の対応する著者

研究成果: Article査読

10 被引用数 (Scopus)

抄録

Frequency distribution of word usage in a word sequence generated by capping is estimated in terms of the number of "hits" in retrieval of web-pages, to evaluate structure of semantics proper not to a particular text but to a language. Especially we compare distribution of English sequences with Japanese ones and obtain that, for English and Japanese phonogram, frequency of word usage against rank follows power-law function with exponent 1 and, for Japanese ideogram, it follows stretched exponential (Weibull distribution) function. We also discuss that such a difference can result from difference of phonogram based- (English) and ideogram-based language (Japanese).

本文言語English
ページ(範囲)131-139
ページ数9
ジャーナルBioSystems
73
2
DOI
出版ステータスPublished - 2004 2月
外部発表はい

ASJC Scopus subject areas

  • 統計学および確率
  • モデリングとシミュレーション
  • 生化学、遺伝学、分子生物学(全般)
  • 応用数学

フィンガープリント

「Zipf's law in phonograms and Weibull distribution in ideograms: Comparison of English with Japanese」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル