A phonetic-based approach to query-by-example spoken term detection

Hurtado Oliver, Lluis Felip; Calvo Lance, Marcos; Gómez Adrian, Jon Ander; García Granada, Fernando; Sanchís Arnal, Emilio

doi:10.1007/978-3-642-41822-8_63

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

A phonetic-based approach to query-by-example spoken term detection

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: articuloCIARP13-C ...

Tamaño: 306.2Kb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: 82580504.pdf

Tamaño: 178.1Kb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Hurtado Oliver, Lluis Felip	es_ES
dc.contributor.author	Calvo Lance, Marcos	es_ES
dc.contributor.author	Gómez Adrian, Jon Ander	es_ES
dc.contributor.author	García Granada, Fernando	es_ES
dc.contributor.author	Sanchís Arnal, Emilio	es_ES
dc.date.accessioned	2014-07-03T17:37:24Z
dc.date.issued	2013
dc.identifier.isbn	978-3-642-41821-1
dc.identifier.issn	0302-9743
dc.identifier.uri	http://hdl.handle.net/10251/38560
dc.description	The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-41822-8_63	es_ES
dc.description.abstract	Query-by-Example Spoken Term Detection (QbE-STD) tasks are usually addressed by representing speech signals as a sequence of feature vectors by means of a parametrization step, and then using a pattern matching technique to find the candidate detections. In this paper, we propose a phoneme-based approach in which the acoustic frames are first converted into vectors representing the a posteriori probabilities for every phoneme. This strategy is specially useful when the language of the task is a priori known. Then, we show how this representation can be used for QbE-STD using both a Segmental Dynamic Time Warping algorithm and a graph-based method. The proposed approach has been evaluated with a QbE-STD task in Spanish, and the results show that it can be an adequate strategy for tackling this kind of problems	es_ES
dc.description.sponsorship	Work partially supported by the Spanish Ministerio de Economía y Competitividad under contract TIN2011-28169-C05-01 and FPU Grant AP2010-4193, and by the Vic. d’Investigació of the UPV (PAID-06-10)
dc.format.extent	8	es_ES
dc.language	Inglés	es_ES
dc.publisher	Springer Verlag (Germany)	es_ES
dc.relation.ispartof	Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications	es_ES
dc.relation.ispartofseries	Lecture Notes in Computer Science;
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Spoken Term Detection	es_ES
dc.subject	Query-by-Example	es_ES
dc.subject	Automatic Speech Recognition	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	A phonetic-based approach to query-by-example spoken term detection	es_ES
dc.type	Capítulo de libro	es_ES
dc.identifier.doi	10.1007/978-3-642-41822-8_63
dc.relation.projectID	info:eu-repo/grantAgreement/MICINN//TIN2011-28169-C05-01/ES/TIMPANO-UPV: TECNOLOGIAS PARA LA INTERACCION CONVERSACIONAL COMPLAJE PERSONA-MAQUINA CON APRENDIZAJE DINAMICO/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MECD//AP2010-4193/ES/AP2010-4193/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/UPV//PAID-06-10/	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.description.bibliographicCitation	Hurtado Oliver, LF.; Calvo Lance, M.; Gómez Adrian, JA.; García Granada, F.; Sanchís Arnal, E. (2013). A phonetic-based approach to query-by-example spoken term detection. En Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Springer Verlag (Germany). 8529:504-511. https://doi.org/10.1007/978-3-642-41822-8_63	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.conferencename	18th Iberoamerican Congress, CIARP 2013	es_ES
dc.relation.conferencedate	November 20-23, 2013,	es_ES
dc.relation.conferenceplace	Havana, Cuba	es_ES
dc.relation.publisherversion	http://link.springer.com/chapter/10.1007/978-3-642-41822-8_63	es_ES
dc.description.upvformatpinicio	504	es_ES
dc.description.upvformatpfin	511	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	8529	es_ES
dc.relation.senia	255946
dc.contributor.funder	Universitat Politècnica de València
dc.contributor.funder	Ministerio de Ciencia e Innovación	es_ES
dc.contributor.funder	Ministerio de Educación, Cultura y Deporte	es_ES
dc.description.references	Anguera, X., Macrae, R., Oliver, N.: Partial sequence matching using an unbounded dynamic time warping algorithm. In: ICASSP, pp. 3582–3585 (2010)	es_ES
dc.description.references	Hazen, T., Shen, W., White, C.: Query-by-example spoken term detection using phonetic posteriorgram templates. In: ASRU, pp. 421–426 (2009)	es_ES
dc.description.references	Zhang, Y., Glass, J.: Unsupervised spoken keyword spotting via segmental DTW on gaussian posteriorgrams. In: ASRU, pp. 398–403 (2009)	es_ES
dc.description.references	Akbacak, M., Vergyri, D., Stolcke, A.: Open-vocabulary spoken term detection using graphone-based hybrid recognition systems. In: ICASSP, pp. 5240–5243 (2008)	es_ES
dc.description.references	Fiscus, J.G., Ajot, J., Garofolo, J.S., Doddingtion, G.: Results of the 2006 spoken term detection evaluation. In: Proceedings of ACM SIGIR Workshop on Searching Spontaneous Conversational, pp. 51–55 (2007)	es_ES
dc.description.references	Metze, F., Barnard, E., Davel, M., Van Heerden, C., Anguera, X., Gravier, G., Rajput, N., et al.: The spoken web search task. In: Working Notes Proceedings of the MediaEval 2012 Workshop (2012)	es_ES
dc.description.references	Gómez, J.A., Castro, M.J.: Automatic segmentation of speech at the phonetic level. In: Caelli, T.M., Amin, A., Duin, R.P.W., Kamel, M.S., de Ridder, D. (eds.) SSPR & SPR 2002. LNCS, vol. 2396, pp. 672–680. Springer, Heidelberg (2002)	es_ES
dc.description.references	Gómez, J.A., Sanchis, E., Castro-Bleda, M.J.: Automatic speech segmentation based on acoustical clustering. In: Hancock, E.R., Wilson, R.C., Windeatt, T., Ulusoy, I., Escolano, F. (eds.) SSPR & SPR 2010. LNCS, vol. 6218, pp. 540–548. Springer, Heidelberg (2010)	es_ES
dc.description.references	Moreno, A., Poch, D., Bonafonte, A., Lleida, E., Llisterri, J., Marino, J., Nadeu, C.: Albayzin speech database: Design of the phonetic corpus. In: Third European Conference on Speech Communication and Technology (1993)	es_ES
dc.description.references	Park, A., Glass, J.: Towards unsupervised pattern discovery in speech. In: ASRU, pp. 53–58 (2005)	es_ES
dc.description.references	Kullback, S.: Information theory and statistics. Courier Dover Publications (1997)	es_ES
dc.description.references	MAVIR corpus, http://www.lllf.uam.es/ESP/CorpusMavir.html	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [48344]

Mostrar el registro sencillo del ítem

A phonetic-based approach to query-by-example spoken term detection

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

A phonetic-based approach to query-by-example spoken term detection

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)