- -

A phonetic-based approach to query-by-example spoken term detection

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

A phonetic-based approach to query-by-example spoken term detection

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Hurtado Oliver, Lluis Felip es_ES
dc.contributor.author Calvo Lance, Marcos es_ES
dc.contributor.author Gómez Adrian, Jon Ander es_ES
dc.contributor.author García Granada, Fernando es_ES
dc.contributor.author Sanchís Arnal, Emilio es_ES
dc.date.accessioned 2014-07-03T17:37:24Z
dc.date.issued 2013
dc.identifier.isbn 978-3-642-41821-1
dc.identifier.issn 0302-9743
dc.identifier.uri http://hdl.handle.net/10251/38560
dc.description The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-41822-8_63 es_ES
dc.description.abstract Query-by-Example Spoken Term Detection (QbE-STD) tasks are usually addressed by representing speech signals as a sequence of feature vectors by means of a parametrization step, and then using a pattern matching technique to find the candidate detections. In this paper, we propose a phoneme-based approach in which the acoustic frames are first converted into vectors representing the a posteriori probabilities for every phoneme. This strategy is specially useful when the language of the task is a priori known. Then, we show how this representation can be used for QbE-STD using both a Segmental Dynamic Time Warping algorithm and a graph-based method. The proposed approach has been evaluated with a QbE-STD task in Spanish, and the results show that it can be an adequate strategy for tackling this kind of problems es_ES
dc.description.sponsorship Work partially supported by the Spanish Ministerio de Economía y Competitividad under contract TIN2011-28169-C05-01 and FPU Grant AP2010-4193, and by the Vic. d’Investigació of the UPV (PAID-06-10)
dc.format.extent 8 es_ES
dc.language Inglés es_ES
dc.publisher Springer Verlag (Germany) es_ES
dc.relation.ispartof Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications es_ES
dc.relation.ispartofseries Lecture Notes in Computer Science;
dc.rights Reserva de todos los derechos es_ES
dc.subject Spoken Term Detection es_ES
dc.subject Query-by-Example es_ES
dc.subject Automatic Speech Recognition es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title A phonetic-based approach to query-by-example spoken term detection es_ES
dc.type Capítulo de libro es_ES
dc.identifier.doi 10.1007/978-3-642-41822-8_63
dc.relation.projectID info:eu-repo/grantAgreement/MICINN//TIN2011-28169-C05-01/ES/TIMPANO-UPV: TECNOLOGIAS PARA LA INTERACCION CONVERSACIONAL COMPLAJE PERSONA-MAQUINA CON APRENDIZAJE DINAMICO/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MECD//AP2010-4193/ES/AP2010-4193/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/UPV//PAID-06-10/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Hurtado Oliver, LF.; Calvo Lance, M.; Gómez Adrian, JA.; García Granada, F.; Sanchís Arnal, E. (2013). A phonetic-based approach to query-by-example spoken term detection. En Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Springer Verlag (Germany). 8529:504-511. https://doi.org/10.1007/978-3-642-41822-8_63 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename 18th Iberoamerican Congress, CIARP 2013 es_ES
dc.relation.conferencedate November 20-23, 2013, es_ES
dc.relation.conferenceplace Havana, Cuba es_ES
dc.relation.publisherversion http://link.springer.com/chapter/10.1007/978-3-642-41822-8_63 es_ES
dc.description.upvformatpinicio 504 es_ES
dc.description.upvformatpfin 511 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 8529 es_ES
dc.relation.senia 255946
dc.contributor.funder Universitat Politècnica de València
dc.contributor.funder Ministerio de Ciencia e Innovación es_ES
dc.contributor.funder Ministerio de Educación, Cultura y Deporte es_ES
dc.description.references Anguera, X., Macrae, R., Oliver, N.: Partial sequence matching using an unbounded dynamic time warping algorithm. In: ICASSP, pp. 3582–3585 (2010) es_ES
dc.description.references Hazen, T., Shen, W., White, C.: Query-by-example spoken term detection using phonetic posteriorgram templates. In: ASRU, pp. 421–426 (2009) es_ES
dc.description.references Zhang, Y., Glass, J.: Unsupervised spoken keyword spotting via segmental DTW on gaussian posteriorgrams. In: ASRU, pp. 398–403 (2009) es_ES
dc.description.references Akbacak, M., Vergyri, D., Stolcke, A.: Open-vocabulary spoken term detection using graphone-based hybrid recognition systems. In: ICASSP, pp. 5240–5243 (2008) es_ES
dc.description.references Fiscus, J.G., Ajot, J., Garofolo, J.S., Doddingtion, G.: Results of the 2006 spoken term detection evaluation. In: Proceedings of ACM SIGIR Workshop on Searching Spontaneous Conversational, pp. 51–55 (2007) es_ES
dc.description.references Metze, F., Barnard, E., Davel, M., Van Heerden, C., Anguera, X., Gravier, G., Rajput, N., et al.: The spoken web search task. In: Working Notes Proceedings of the MediaEval 2012 Workshop (2012) es_ES
dc.description.references Gómez, J.A., Castro, M.J.: Automatic segmentation of speech at the phonetic level. In: Caelli, T.M., Amin, A., Duin, R.P.W., Kamel, M.S., de Ridder, D. (eds.) SSPR & SPR 2002. LNCS, vol. 2396, pp. 672–680. Springer, Heidelberg (2002) es_ES
dc.description.references Gómez, J.A., Sanchis, E., Castro-Bleda, M.J.: Automatic speech segmentation based on acoustical clustering. In: Hancock, E.R., Wilson, R.C., Windeatt, T., Ulusoy, I., Escolano, F. (eds.) SSPR & SPR 2010. LNCS, vol. 6218, pp. 540–548. Springer, Heidelberg (2010) es_ES
dc.description.references Moreno, A., Poch, D., Bonafonte, A., Lleida, E., Llisterri, J., Marino, J., Nadeu, C.: Albayzin speech database: Design of the phonetic corpus. In: Third European Conference on Speech Communication and Technology (1993) es_ES
dc.description.references Park, A., Glass, J.: Towards unsupervised pattern discovery in speech. In: ASRU, pp. 53–58 (2005) es_ES
dc.description.references Kullback, S.: Information theory and statistics. Courier Dover Publications (1997) es_ES
dc.description.references MAVIR corpus, http://www.lllf.uam.es/ESP/CorpusMavir.html es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem