Live Streaming Speech Recognition Using Deep Bidirectional LSTM Acoustic Models and Interpolated Language Models

Jorge-Cano, Javier; Giménez Pastor, Adrián; Silvestre Cerdà, Joan Albert; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons

doi:10.1109/TASLP.2021.3133216

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Live Streaming Speech Recognition Using Deep Bidirectional LSTM Acoustic Models and Interpolated Language Models

Mostrar el registro completo del ítem

Jorge-Cano, J.; Giménez Pastor, A.; Silvestre Cerdà, JA.; Civera Saiz, J.; Sanchis Navarro, JA.; Juan, A. (2022). Live Streaming Speech Recognition Using Deep Bidirectional LSTM Acoustic Models and Interpolated Language Models. IEEE/ACM Transactions on Audio Speech and Language Processing. 30:148-161. https://doi.org/10.1109/TASLP.2021.3133216

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10251/182807

Ficheros en el ítem

Nombre: Jorge-CanoGimenez ...

Tamaño: 1.609Mb

Formato: PDF

Descripción: Versión editorial

Abrir/Preview

Metadatos del ítem

Título:

Live Streaming Speech Recognition Using Deep Bidirectional LSTM Acoustic Models and Interpolated Language Models

Autor:

Jorge-Cano, Javier Giménez Pastor, Adrián

Silvestre Cerdà, Joan Albert

Civera Saiz, Jorge

Sanchis Navarro, José Alberto

Juan, Alfons

Entidad UPV:

Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació

Fecha difusión:

2022

Resumen:

[EN] Although Long-Short Term Memory (LSTM) networks and deep Transformers are now extensively used in offline ASR, it is unclear how best offline systems can be adapted to work with them under the streaming setup. After ...[+]

Palabras clave:

Automatic speech recognition , Streaming , Decoding , Acoustic modeling , Language modeling , Neural networks

Derechos de uso:

Reconocimiento (by)

Fuente:

IEEE/ACM Transactions on Audio Speech and Language Processing. (issn: 2329-9290 )

DOI:

10.1109/TASLP.2021.3133216

Editorial:

Institute of Electrical and Electronics Engineers

Versión del editor:

https://doi.org/10.1109/TASLP.2021.3133216

Coste APC:

2500 €

Código del Proyecto:

info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-094879-B-I00/ES/SUBTITULACION MULTILINGUE DE CLASES DE AULA Y SESIONES PLENARIAS/
info:eu-repo/grantAgreement/GENERALITAT VALENCIANA//PROMETEO%2F2019%2F111//CLASSROOM ACTIVITY RECOGNITION/
info:eu-repo/grantAgreement/EC/H2020/761758/EU
info:eu-repo/grantAgreement/COMISION DE LAS COMUNIDADES EUROPEA//2020-1-SI01-KA226-SCH-093604//EDUCATIONAL EXPLANATIONS AND PRACTICES IN EMERGENCY REMOTE TEACHING/
info:eu-repo/grantAgreement/EC/H2020/952215/EU

Agradecimientos:

This work was supported in part by European Union's Horizon 2020 Research and Innovation Programme under Grant 761758 (X5gon), and 952215 (TAILOR) and Erasmus+ Education Program under Grant Agreement 20-226-093604-SCH, in ...[+]

Tipo:

Artículo

recommendations

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro completo del ítem

Live Streaming Speech Recognition Using Deep Bidirectional LSTM Acoustic Models and Interpolated Language Models

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Live Streaming Speech Recognition Using Deep Bidirectional LSTM Acoustic Models and Interpolated Language Models

Ficheros en el ítem

Metadatos del ítem

recommendations

Este ítem aparece en la(s) siguiente(s) colección(ones)