[EN] High-quality translation between any pair of languages can be achieved by human post-editing of the outputs of a MT system or by following the Interactive Machine Translation (IMT) approach. In the interactive pattern ...
[EN] Achieving high-quality translation between any pair of languages is not possible with the current Machine-Translation (MT) technology a human post-editing of the outputs of the MT system being necessary. Therefore, ...
This paper shows how the nowadays prevalent technology used in HTR borrows concepts and methods from the field of ASR; i.e. those based on Hidden Markov Models (HMMs). Additionally, it will be described a HTR approach based ...
[EN] Document Layout Analysis (DLA) is a process that must be performed before attempting to recognize the content of handwritten musical scores by a modern automatic or semiautomatic system. DLA should provide the ...
A multimodal interactive approach for transcription of ancient documents is proposed. In this approach, user’s feedback directly facilitates improvements to system accuracy while multimodality increases system ergonomy and ...
Currently, automatic handwriting recognition systems are ineffectual in unconstrained handwriting documents. Therefore, to obtain perfect transcriptions, heavy human intervention is required to validate and correct the ...
The amount of digitized legacy documents has been rising dramatically over the last years due mainly to the increasing number of on-line digital libraries publishing this kind of documents. On one hand, the vast majority ...
Romero Gómez, Verónica(Universitat Politècnica de València, 2010-09-20)
En esta tesis se presenta un nuevo marco interactivo y multimodal para la transcripción de
Documentos manuscritos. Esta aproximación, lejos de proporcionar la transcripción completa
pretende asistir al experto en la dura ...
Noya García, Ernesto(Universitat Politècnica de València, 2015-10-06)
[ES] El aumento de libros manuscritos digitalizados, y la falta de herramientas precisas para
clasificarlos, ha motivado la creación de un consorcio de universidades europeas bajo el
cual la universidad politécnica de ...
Puigcerver I Pérez, Joan(Universitat Politècnica de València, 2015-07-17)
[EN] In this master thesis several approaches are presented to support out of vocabulary queries in a Word
Graph (WG)-based Keyword Spotting (KWS) application for handwritten text lines. Generally, KWS
assigns a score ...
[EN] Keyword spotting techniques are becoming cost-effective solutions for information retrieval in handwritten documents. We explore the extension of the single-word, line-level probabilistic indexing approach described ...
[EN] Lexicon-based handwritten text keyword spotting (KWS) has proven to be a faster and more accurate alternative to lexicon-free methods. Nevertheless, since lexicon-based KWS relies on a predefined vocabulary, fixed in ...
Bueno Hurtado, Carmen(Universitat Politècnica de València, 2024-01-03)
[ES] La b´usqueda de informaci´on musical en grandes series de im´agenes de partituras
musicales manuscritas antiguas es un problema de gran inter´es para historiadores, music´ologos, gestores de archivos y p´ublico en ...
[EN] Historical records of daily activities provide intriguing insights into the life of our ancestors, useful for demography studies and genealogical research. Automatic processing of historical documents, however, has ...
Toselli, Alejandro Héctor; Leiva, Luis A.; Bordes-Cabrera, Isabel; Hernández-Tornero, Celio; BOSCH CAMPOS, VICENTE; Vidal, Enrique(Oxford University Press, 2018-04)
[EN] We present a process for cost-effective transcription of cursive handwritten text
images that has been tested on a 1,000-page 17th-century book about botanical
species. The process comprised two main tasks, namely: ...
[EN] Two methods are presented to improve word confidence scores for Line-Level Query-by-String Lexicon-Free Keyword Spotting (KWS) in handwritten text images. The first one approaches true relevance probabilities by means ...
[EN] Two document processing applications are con-
sidered: computer-assisted transcription of text images
(CATTI) and Keyword Spotting (KWS), for transcribing
and indexing handwritten documents, respectively. Instead
of ...
Computer Assisted Transcription of Text Images (CATTI)
and Key-Word Spotting (KWS) applications aim at transcribing and
indexing handwritten documents respectively. They both are approached
by means of Word Graphs (WG) ...