抄録
We propose a novel approach to find aliases of a given name from the web. We exploit a set of known names and their aliases as training data and extract lexical patterns that convey information related to aliases of names from text snippets returned by a web search engine. The patterns are then used to find candidate aliases of a given name. We use anchor texts and hyperlinks to design a word co-occurrence model and define numerous ranking scores to evaluate the association between a name and its candidate aliases. The proposed method outperforms numerous baselines and previous work on alias extraction on a dataset of personal names, achieving a statistically significant mean reciprocal rank of 0.6718. Moreover, the aliases extracted using the proposed method improve recall by 20% in a relation-detection task.
本文言語 | English |
---|---|
ホスト出版物のタイトル | Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08 |
ページ | 1107-1108 |
ページ数 | 2 |
DOI | |
出版ステータス | Published - 2008 |
外部発表 | はい |
イベント | 17th International Conference on World Wide Web 2008, WWW'08 - Beijing 継続期間: 2008 4月 21 → 2008 4月 25 |
Other
Other | 17th International Conference on World Wide Web 2008, WWW'08 |
---|---|
City | Beijing |
Period | 08/4/21 → 08/4/25 |
ASJC Scopus subject areas
- コンピュータ ネットワークおよび通信