TY - GEN
T1 - The impact of intent selection on diversified search evaluation
AU - Sakai, Tetsuya
AU - Dou, Zhicheng
AU - Clarke, Charles L.A.
N1 - Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.
PY - 2013
Y1 - 2013
N2 - To construct a diversified search test collection, a set of possible subtopics (or intents) needs to be determined for each topic, in one way or another, and per-intent relevance assessments need to be obtained. In the TREC Web Track Diversity Task, subtopics are manually developed at NIST, based on results of automatic click log analysis; in the NTCIR INTENT Task, intents are determined by manually clustering "subtopics strings" returned by participating systems. In this study, we address the following research question: Does the choice of intents for a test collection affect relative performances of diversified search systems? To this end, we use the TREC 2012 Web Track Diversity Task data and the NTCIR-10 INTENT-2 Task data, which share a set of 50 topics but have different intent sets. Our initial results suggest that the choice of intents may affect relative performances, and that this choice may be far more important than how many intents are selected for each topic.
AB - To construct a diversified search test collection, a set of possible subtopics (or intents) needs to be determined for each topic, in one way or another, and per-intent relevance assessments need to be obtained. In the TREC Web Track Diversity Task, subtopics are manually developed at NIST, based on results of automatic click log analysis; in the NTCIR INTENT Task, intents are determined by manually clustering "subtopics strings" returned by participating systems. In this study, we address the following research question: Does the choice of intents for a test collection affect relative performances of diversified search systems? To this end, we use the TREC 2012 Web Track Diversity Task data and the NTCIR-10 INTENT-2 Task data, which share a set of 50 topics but have different intent sets. Our initial results suggest that the choice of intents may affect relative performances, and that this choice may be far more important than how many intents are selected for each topic.
KW - Diversity
KW - Evaluation
KW - Intents
KW - Subtopics
KW - Test collections
UR - http://www.scopus.com/inward/record.url?scp=84883100429&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84883100429&partnerID=8YFLogxK
U2 - 10.1145/2484028.2484105
DO - 10.1145/2484028.2484105
M3 - Conference contribution
AN - SCOPUS:84883100429
SN - 9781450320344
T3 - SIGIR 2013 - Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval
SP - 921
EP - 924
BT - SIGIR 2013 - Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval
T2 - 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2013
Y2 - 28 July 2013 through 1 August 2013
ER -