Computing cross-lingual synonym set similarity by using princeton annotated gloss corpus

Yoshihiko Hayashi*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

This paper proposes a method to compute cross-lingual semantic similarity between synonym sets. By making use of Princeton Annotated Gloss Corpus as the source of target language statistics, the proposed method exhibited promising results in the experiments: More than 73% of the Princeton WordNet synsets were successfully recovered within the top-5 candidates, given a corresponding set of Japanese WordNet synsets. As the proposed method minimally requires that the input to be seen as an apparently synonymous word set, the method could be extended and the performance would be further improved by incorporating richer information such as textual glosses and/or structural constraints posed by the lexical resources at hand.

Original languageEnglish
Title of host publicationGWC 2012
Subtitle of host publication6th International Global Wordnet Conference, Proceedings
PublisherTribun EU s. r. o.
Pages134-141
Number of pages8
ISBN (Print)9788026302445
Publication statusPublished - 2012
Externally publishedYes
Event6th International Global Wordnet Conference, GWC 2012 - Matsue, Japan
Duration: 2012 Jan 92012 Jan 13

Publication series

NameGWC 2012: 6th International Global Wordnet Conference, Proceedings

Other

Other6th International Global Wordnet Conference, GWC 2012
Country/TerritoryJapan
CityMatsue
Period12/1/912/1/13

ASJC Scopus subject areas

  • Language and Linguistics
  • Literature and Literary Theory

Fingerprint

Dive into the research topics of 'Computing cross-lingual synonym set similarity by using princeton annotated gloss corpus'. Together they form a unique fingerprint.

Cite this