Mining for personal name aliases on the web

Danushka Bollegala*, Taiki Honma, Yutaka Matsuo, Mitsuru Ishizuka

*この研究の対応する著者

研究成果: Conference contribution

22 被引用数 (Scopus)

抄録

We propose a novel approach to find aliases of a given name from the web. We exploit a set of known names and their aliases as training data and extract lexical patterns that convey information related to aliases of names from text snippets returned by a web search engine. The patterns are then used to find candidate aliases of a given name. We use anchor texts and hyperlinks to design a word co-occurrence model and define numerous ranking scores to evaluate the association between a name and its candidate aliases. The proposed method outperforms numerous baselines and previous work on alias extraction on a dataset of personal names, achieving a statistically significant mean reciprocal rank of 0.6718. Moreover, the aliases extracted using the proposed method improve recall by 20% in a relation-detection task.

本文言語English
ホスト出版物のタイトルProceeding of the 17th International Conference on World Wide Web 2008, WWW'08
ページ1107-1108
ページ数2
DOI
出版ステータスPublished - 2008
外部発表はい
イベント17th International Conference on World Wide Web 2008, WWW'08 - Beijing
継続期間: 2008 4月 212008 4月 25

Other

Other17th International Conference on World Wide Web 2008, WWW'08
CityBeijing
Period08/4/2108/4/25

ASJC Scopus subject areas

  • コンピュータ ネットワークおよび通信

フィンガープリント

「Mining for personal name aliases on the web」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル