- -

End-to-End Neural Optical Music Recognition of Monophonic Scores

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

End-to-End Neural Optical Music Recognition of Monophonic Scores

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Calvo-Zaragoza, Jorge es_ES
dc.contributor.author Rizo, D. es_ES
dc.date.accessioned 2020-05-20T03:01:59Z
dc.date.available 2020-05-20T03:01:59Z
dc.date.issued 2018-04 es_ES
dc.identifier.uri http://hdl.handle.net/10251/143793
dc.description.abstract [EN] Optical Music Recognition is a field of research that investigates how to computationally decode music notation from images. Despite the efforts made so far, there are hardly any complete solutions to the problem. In this work, we study the use of neural networks that work in an end-to-end manner. This is achieved by using a neural model that combines the capabilities of convolutional neural networks, which work on the input image, and recurrent neural networks, which deal with the sequential nature of the problem. Thanks to the use of the the so-called Connectionist Temporal Classification loss function, these models can be directly trained from input images accompanied by their corresponding transcripts into music symbol sequences. We also present the Printed Images of Music Staves (PrIMuS) dataset, containing more than 80,000 monodic single-staff real scores in common western notation, that is used to train and evaluate the neural approach. In our experiments, it is demonstrated that this formulation can be carried out successfully. Additionally, we study several considerations about the codification of the output musical sequences, the convergence and scalability of the neural models, as well as the ability of this approach to locate symbols in the input score. es_ES
dc.description.sponsorship This work was supported by the Social Sciences and Humanities Research Council of Canada, and the Spanish Ministerio de Economia y Competitividad through Project HISPAMUS Ref. No. TIN2017-86576-R (supported by UE FEDER funds). es_ES
dc.language Inglés es_ES
dc.publisher MDPI AG es_ES
dc.relation.ispartof Applied Sciences es_ES
dc.rights Reconocimiento (by) es_ES
dc.subject Optical Music Recognition es_ES
dc.subject End-to-end recognition es_ES
dc.subject Deep Learning es_ES
dc.subject Music score images es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title End-to-End Neural Optical Music Recognition of Monophonic Scores es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.3390/app8040606 es_ES
dc.relation.projectID info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2017-86576-R/ES/PRESERVACION DEL PATRIMONIO DE MUSICA ESPAÑOLA MANUSCRITA MEDIANTE TRANSCRIPCION AUTOMATICA/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Calvo-Zaragoza, J.; Rizo, D. (2018). End-to-End Neural Optical Music Recognition of Monophonic Scores. Applied Sciences. 8(4). https://doi.org/10.3390/app8040606 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion https://doi.org/10.3390/app8040606 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 8 es_ES
dc.description.issue 4 es_ES
dc.identifier.eissn 2076-3417 es_ES
dc.relation.pasarela S\361821 es_ES
dc.contributor.funder European Regional Development Fund es_ES
dc.contributor.funder Agencia Estatal de Investigación es_ES
dc.contributor.funder Social Sciences and Humanities Research Council of Canada es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem