PAPER DIGEST
Most Influential ACL 2017 Paper · 2026-03 edition

Cross-lingual Name Tagging And Linking For 282 Languages

Xiaoman Pan, Boliang Zhang, Jonathan May, Joel Nothman, Kevin Knight, Heng Ji

Venue
Annual Meeting of the Association for Computational Linguistics (ACL) 2017
Recognition
Most Influential ACL 2017 Paper (Rank No. 15)
Edition
2026-03
Impact factor
7
Certificate ID
b5f468f72ae3adc7

Abstract

The ambitious goal of this work is to develop a cross-lingual name tagging and linking framework for 282 languages that exist in Wikipedia. Given a document in any of these languages, our framework is able to identify name mentions, assign a coarse-grained or fine-grained type to each mention, and link it to an English Knowledge Base (KB) if it is linkable. We achieve this goal by performing a series of new KB mining methods: generating "silver-standard" annotations by transferring annotations from English to other languages through cross-lingual links and KB properties, refining annotations through self-training and topic selection, deriving language-specific morphology features from anchor links, and mining word translation pairs from cross-lingual links. Both name tagging and linking results for 282 languages are promising on Wikipedia data and on-Wikipedia data.

Download PDF certificate