Decline of Pearson’s r with categorization of variables: a large-scale simulation

Takahiro Onoshima*, Kenpei Shiina, Takashi Ueda, Saori Kubo

*この研究の対応する著者

研究成果: Article査読

1 被引用数 (Scopus)

抄録

It is often said that correlation coefficients computed from categorical variables are biased and thus should not be used. However, practitioners often ignore this longstanding caveat from statisticians. Although some studies have examined the bias, the true extent is still unknown. This study is an extensive attempt to determine the range and degree of the biases. In our simulation, continuous variables were categorized according to various thresholds and used to compute Pearson’s r. The results indicated that there were more serious biases than highlighted in previous studies. The results also revealed that increasing data size did not reduce the biases. Possible ways to cope with the biases are discussed.

本文言語English
ページ(範囲)389-399
ページ数11
ジャーナルBehaviormetrika
46
2
DOI
出版ステータスPublished - 2019 10月 1

ASJC Scopus subject areas

  • 実験心理学および認知心理学
  • 分析
  • 臨床心理学
  • 応用数学

フィンガープリント

「Decline of Pearson’s r with categorization of variables: a large-scale simulation」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル