Mostrar el registro sencillo del ítem
dc.contributor.author | Hurtado Oliver, Lluis Felip | es_ES |
dc.contributor.author | Calvo Lance, Marcos | es_ES |
dc.contributor.author | Gómez Adrian, Jon Ander | es_ES |
dc.contributor.author | García Granada, Fernando | es_ES |
dc.contributor.author | Sanchís Arnal, Emilio | es_ES |
dc.date.accessioned | 2014-07-03T17:37:24Z | |
dc.date.issued | 2013 | |
dc.identifier.isbn | 978-3-642-41821-1 | |
dc.identifier.issn | 0302-9743 | |
dc.identifier.uri | http://hdl.handle.net/10251/38560 | |
dc.description | The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-41822-8_63 | es_ES |
dc.description.abstract | Query-by-Example Spoken Term Detection (QbE-STD) tasks are usually addressed by representing speech signals as a sequence of feature vectors by means of a parametrization step, and then using a pattern matching technique to find the candidate detections. In this paper, we propose a phoneme-based approach in which the acoustic frames are first converted into vectors representing the a posteriori probabilities for every phoneme. This strategy is specially useful when the language of the task is a priori known. Then, we show how this representation can be used for QbE-STD using both a Segmental Dynamic Time Warping algorithm and a graph-based method. The proposed approach has been evaluated with a QbE-STD task in Spanish, and the results show that it can be an adequate strategy for tackling this kind of problems | es_ES |
dc.description.sponsorship | Work partially supported by the Spanish Ministerio de Economía y Competitividad under contract TIN2011-28169-C05-01 and FPU Grant AP2010-4193, and by the Vic. d’Investigació of the UPV (PAID-06-10) | |
dc.format.extent | 8 | es_ES |
dc.language | Inglés | es_ES |
dc.publisher | Springer Verlag (Germany) | es_ES |
dc.relation.ispartof | Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications | es_ES |
dc.relation.ispartofseries | Lecture Notes in Computer Science; | |
dc.rights | Reserva de todos los derechos | es_ES |
dc.subject | Spoken Term Detection | es_ES |
dc.subject | Query-by-Example | es_ES |
dc.subject | Automatic Speech Recognition | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | A phonetic-based approach to query-by-example spoken term detection | es_ES |
dc.type | Capítulo de libro | es_ES |
dc.identifier.doi | 10.1007/978-3-642-41822-8_63 | |
dc.relation.projectID | info:eu-repo/grantAgreement/MICINN//TIN2011-28169-C05-01/ES/TIMPANO-UPV: TECNOLOGIAS PARA LA INTERACCION CONVERSACIONAL COMPLAJE PERSONA-MAQUINA CON APRENDIZAJE DINAMICO/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/MECD//AP2010-4193/ES/AP2010-4193/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/UPV//PAID-06-10/ | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació | es_ES |
dc.description.bibliographicCitation | Hurtado Oliver, LF.; Calvo Lance, M.; Gómez Adrian, JA.; García Granada, F.; Sanchís Arnal, E. (2013). A phonetic-based approach to query-by-example spoken term detection. En Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Springer Verlag (Germany). 8529:504-511. https://doi.org/10.1007/978-3-642-41822-8_63 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.conferencename | 18th Iberoamerican Congress, CIARP 2013 | es_ES |
dc.relation.conferencedate | November 20-23, 2013, | es_ES |
dc.relation.conferenceplace | Havana, Cuba | es_ES |
dc.relation.publisherversion | http://link.springer.com/chapter/10.1007/978-3-642-41822-8_63 | es_ES |
dc.description.upvformatpinicio | 504 | es_ES |
dc.description.upvformatpfin | 511 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.description.volume | 8529 | es_ES |
dc.relation.senia | 255946 | |
dc.contributor.funder | Universitat Politècnica de València | |
dc.contributor.funder | Ministerio de Ciencia e Innovación | es_ES |
dc.contributor.funder | Ministerio de Educación, Cultura y Deporte | es_ES |
dc.description.references | Anguera, X., Macrae, R., Oliver, N.: Partial sequence matching using an unbounded dynamic time warping algorithm. In: ICASSP, pp. 3582–3585 (2010) | es_ES |
dc.description.references | Hazen, T., Shen, W., White, C.: Query-by-example spoken term detection using phonetic posteriorgram templates. In: ASRU, pp. 421–426 (2009) | es_ES |
dc.description.references | Zhang, Y., Glass, J.: Unsupervised spoken keyword spotting via segmental DTW on gaussian posteriorgrams. In: ASRU, pp. 398–403 (2009) | es_ES |
dc.description.references | Akbacak, M., Vergyri, D., Stolcke, A.: Open-vocabulary spoken term detection using graphone-based hybrid recognition systems. In: ICASSP, pp. 5240–5243 (2008) | es_ES |
dc.description.references | Fiscus, J.G., Ajot, J., Garofolo, J.S., Doddingtion, G.: Results of the 2006 spoken term detection evaluation. In: Proceedings of ACM SIGIR Workshop on Searching Spontaneous Conversational, pp. 51–55 (2007) | es_ES |
dc.description.references | Metze, F., Barnard, E., Davel, M., Van Heerden, C., Anguera, X., Gravier, G., Rajput, N., et al.: The spoken web search task. In: Working Notes Proceedings of the MediaEval 2012 Workshop (2012) | es_ES |
dc.description.references | Gómez, J.A., Castro, M.J.: Automatic segmentation of speech at the phonetic level. In: Caelli, T.M., Amin, A., Duin, R.P.W., Kamel, M.S., de Ridder, D. (eds.) SSPR & SPR 2002. LNCS, vol. 2396, pp. 672–680. Springer, Heidelberg (2002) | es_ES |
dc.description.references | Gómez, J.A., Sanchis, E., Castro-Bleda, M.J.: Automatic speech segmentation based on acoustical clustering. In: Hancock, E.R., Wilson, R.C., Windeatt, T., Ulusoy, I., Escolano, F. (eds.) SSPR & SPR 2010. LNCS, vol. 6218, pp. 540–548. Springer, Heidelberg (2010) | es_ES |
dc.description.references | Moreno, A., Poch, D., Bonafonte, A., Lleida, E., Llisterri, J., Marino, J., Nadeu, C.: Albayzin speech database: Design of the phonetic corpus. In: Third European Conference on Speech Communication and Technology (1993) | es_ES |
dc.description.references | Park, A., Glass, J.: Towards unsupervised pattern discovery in speech. In: ASRU, pp. 53–58 (2005) | es_ES |
dc.description.references | Kullback, S.: Information theory and statistics. Courier Dover Publications (1997) | es_ES |
dc.description.references | MAVIR corpus, http://www.lllf.uam.es/ESP/CorpusMavir.html | es_ES |