A Hybrid Approach for Transliterated Word-Level Language Identification: CRF with Post Processing Heuristics

Autores UPV
Año
CONGRESO A Hybrid Approach for Transliterated Word-Level Language Identification: CRF with Post Processing Heuristics

Abstract

In this paper, we describe a hybrid approach for word-level language identification of Bangla words written in Roman script and mixed with English words as part of our participation in the shared task on transliterated search at Forum for Information Retrieval Evaluation in 2014.