TY - JOUR
T1 - Item difficulty parameter estimation using the idea of the graded response model and computerized adaptive testing
AU - Ozaki, Koken
AU - Toyoda, Hideki
PY - 2009/3/1
Y1 - 2009/3/1
N2 - In test operations using IRT (item response theory), items are included in a test before being used to rate subjects and the response data is used to estimate their item parameters. However, this method of test operation may lead to item content leakage and an adequate test operation can become difficult. To address this problem, Ozaki and Toyoda (2005, 2006) developed item difficulty parameter estimation methods that use paired comparison data from the perspective of the difficulty of items as judged by raters familiar with the field. In the present paper, an improved method of item difficulty parameter estimation is developed. In this new method, an item for which the difficulty parameter is to be estimated is compared with multiple items simultaneously, from the perspective of their difficulty. This is not a one-to-one comparison but a one-to-many comparison. In the comparisons, raters are informed that items selected from an item pool are ordered according to difficulty. The order will provide insight to improve the accuracy of judgment.
AB - In test operations using IRT (item response theory), items are included in a test before being used to rate subjects and the response data is used to estimate their item parameters. However, this method of test operation may lead to item content leakage and an adequate test operation can become difficult. To address this problem, Ozaki and Toyoda (2005, 2006) developed item difficulty parameter estimation methods that use paired comparison data from the perspective of the difficulty of items as judged by raters familiar with the field. In the present paper, an improved method of item difficulty parameter estimation is developed. In this new method, an item for which the difficulty parameter is to be estimated is compared with multiple items simultaneously, from the perspective of their difficulty. This is not a one-to-one comparison but a one-to-many comparison. In the comparisons, raters are informed that items selected from an item pool are ordered according to difficulty. The order will provide insight to improve the accuracy of judgment.
KW - Computerized adaptive testing
KW - Difficulty parameter estimation
KW - Equating
KW - Graded response model
KW - Test operation
UR - http://www.scopus.com/inward/record.url?scp=62549104196&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=62549104196&partnerID=8YFLogxK
U2 - 10.1111/j.1468-5884.2009.00383.x
DO - 10.1111/j.1468-5884.2009.00383.x
M3 - Article
AN - SCOPUS:62549104196
SN - 0021-5368
VL - 51
SP - 1
EP - 12
JO - Japanese Psychological Research
JF - Japanese Psychological Research
IS - 1
ER -