Volver atrás Publicación

Language Identification for Interactive Handwriting Transcription of Multilingual Documents

Imprimir

¿Quieres contarnos tu reto? Pincha aquí y te ayudamos a encontrar una solución

Autores UPV

Agua Teba Miguel Ángel del, Serrano Martínez-Santos Nicolás, Juan Císcar Alfonso

Año

2011

Revista

Lecture Notes in Computer Science

Abstract

An effective approach to handwriting transcription of (old) documents is to follow a sequential, line-by-line transcription of the whole document, in which a continuously retrained system interacts with the user. In the case of multilingual documents, however, a minor yet important issue for this interactive approach is to first identify the language of the current text line image to be transcribed. In this paper, we propose a probabilistic framework and three techniques for this purpose. Empirical results are reported on an entire 764-page multilingual document for which previous empirical tests were limited to its first 180 pages, written only in Spanish. © 2011 Springer-Verlag.

Más Información