TY - GEN
T1 - Estimating intent types for search result diversification
AU - Tsukuda, Kosetsu
AU - Sakai, Tetsuya
AU - Dou, Zhicheng
AU - Tanaka, Katsumi
PY - 2013
Y1 - 2013
N2 - Given an ambiguous or underspecified query, search result diversification aims at accommodating different user intents within a single Search Engine Result Page (SERP). While automatic identification of different intents for a given query is a crucial step for result diversification, also important is the estimation of intent types (informational vs. navigational). If it is possible to distinguish between informational and navigational intents, search engines can aim to return one best URL for each navigational intent, while allocating more space to the informational intents within the SERP. In light of the observations, we propose a new framework for search result diversification that is intent importance-aware and type-aware. Our experiments using the NTCIR-9 INTENT Japanese Subtopic Mining and Document Ranking test collections show that: (a) our intent type estimation method for Japanese achieves 64.4% accuracy; and (b) our proposed diversification method achieves 0.6373 in D#-nDCG and 0.5898 in DIN#-nDCG over 56 topics, which are statistically significant gains over the top performers of the NTCIR-9 INTENT Japanese Document Ranking runs. Moreover, our relevance oriented model significantly outperforms our diversity oriented model and the original model by Dou et al..
AB - Given an ambiguous or underspecified query, search result diversification aims at accommodating different user intents within a single Search Engine Result Page (SERP). While automatic identification of different intents for a given query is a crucial step for result diversification, also important is the estimation of intent types (informational vs. navigational). If it is possible to distinguish between informational and navigational intents, search engines can aim to return one best URL for each navigational intent, while allocating more space to the informational intents within the SERP. In light of the observations, we propose a new framework for search result diversification that is intent importance-aware and type-aware. Our experiments using the NTCIR-9 INTENT Japanese Subtopic Mining and Document Ranking test collections show that: (a) our intent type estimation method for Japanese achieves 64.4% accuracy; and (b) our proposed diversification method achieves 0.6373 in D#-nDCG and 0.5898 in DIN#-nDCG over 56 topics, which are statistically significant gains over the top performers of the NTCIR-9 INTENT Japanese Document Ranking runs. Moreover, our relevance oriented model significantly outperforms our diversity oriented model and the original model by Dou et al..
KW - Intent Type
KW - Search Result Diversity
KW - Subtopic
UR - http://www.scopus.com/inward/record.url?scp=84893314872&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84893314872&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-45068-6_3
DO - 10.1007/978-3-642-45068-6_3
M3 - Conference contribution
AN - SCOPUS:84893314872
SN - 9783642450679
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 25
EP - 37
BT - Information Retrieval Technology - 9th Asia Information Retrieval Societies Conference, AIRS 2013, Proceedings
T2 - 9th Asia Information Retrieval Societies Conference on Information Retrieval Technology, AIRS 2013
Y2 - 9 December 2013 through 11 December 2013
ER -