BIP! Finder - Unsupervised adaptation of supervised part-of-speech taggers for closely related languages

2014 • Unsupervised adaptation of supervised part-of-speech taggers for closely related languages

Authors: Scherrer, Yves

Venue: Proceedings of the First Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects

Type: Publication

Abstract: When developing NLP tools for low-resource languages, one is often confronted with the lack of annotated data. We propose to circumvent this bottleneck by training a supervised HMM tagger on a closely related language for which annotated data are available, and translating the words in the tagger parameter files into the low-resource language. The translation dictionaries are created with unsupervised lexicon induction techniques that rely only on raw textual data. We obtain a tagging accuracy of up to 89.08% using a Spanish tagger adapted to C... (read more)

Impact:

2.085865E-9 3.3915712E-9 6 2

/ Attention: 0 19

Topics: Natural language processing Artificial intelligence Speech recognition

DOI: 10.3115/v1/w14-5304

External links: Crossref OpenAIRE

BibTex PDF

Topic-specific impact indicators

Popularity:
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
Influence:
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
Citation Count:
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
Impulse:
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.