Volver atrás Publicación

An Empirical Analysis of Data Selection Techniques in Statistical Machine Translation

Imprimir

¿Quieres contarnos tu reto? Pincha aquí y te ayudamos a encontrar una solución

Autores UPV

Chinea Ríos Mara, Sanchis Trilles Germán, Casacuberta Nolla Francisco

Año

2015

Revista

Procesamiento del Lenguaje Natural

Abstract

Domain adaptation has recently gained interest in statistical machine translation. One of the adaptation techniques is based in the selection data. Data selection aims to select the best subset of the bilingual sentences from an available pool of sentences, with which to train a SMT system. In this paper, we study how aect the bilingual corpora used for the data selection methods in the translation quality.

Más Información