- -

A phonetic-based approach to query-by-example spoken term detection

RiuNet: Institutional repository of the Polithecnic University of Valencia

Share/Send to

Cited by

Statistics

A phonetic-based approach to query-by-example spoken term detection

Show full item record

Hurtado Oliver, LF.; Calvo Lance, M.; Gómez Adrian, JA.; García Granada, F.; Sanchís Arnal, E. (2013). A phonetic-based approach to query-by-example spoken term detection. En Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Springer Verlag (Germany). 8529:504-511. doi:10.1007/978-3-642-41822-8_63

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10251/38560

Files in this item

Item Metadata

Title: A phonetic-based approach to query-by-example spoken term detection
Author: Hurtado Oliver, Lluis Felip Calvo Lance, Marcos Gómez Adrian, Jon Ander García Granada, Fernando Sanchís Arnal, Emilio
UPV Unit: Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació
Issued date:
Abstract:
Query-by-Example Spoken Term Detection (QbE-STD) tasks are usually addressed by representing speech signals as a sequence of feature vectors by means of a parametrization step, and then using a pattern matching technique ...[+]
Subjects: Spoken Term Detection , Query-by-Example , Automatic Speech Recognition
Copyrigths: Reserva de todos los derechos
ISBN: 978-3-642-41821-1
Source:
Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. (issn: 0302-9743 )
DOI: 10.1007/978-3-642-41822-8_63
Publisher:
Springer Verlag (Germany)
Publisher version: http://link.springer.com/chapter/10.1007/978-3-642-41822-8_63
Conference name: 18th Iberoamerican Congress, CIARP 2013
Conference place: Havana, Cuba
Conference date: November 20-23, 2013,
Series: Lecture Notes in Computer Science;
Project ID:
MINECO/TIN2011-28169-C05-01
MINECO/FPU-AP2010-4193
UPV/PAID-06-10
Description: The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-41822-8_63
Thanks:
Work partially supported by the Spanish Ministerio de Economía y Competitividad under contract TIN2011-28169-C05-01 and FPU Grant AP2010-4193, and by the Vic. d’Investigació of the UPV (PAID-06-10)
Type: Capítulo de libro

References

Anguera, X., Macrae, R., Oliver, N.: Partial sequence matching using an unbounded dynamic time warping algorithm. In: ICASSP, pp. 3582–3585 (2010)

Hazen, T., Shen, W., White, C.: Query-by-example spoken term detection using phonetic posteriorgram templates. In: ASRU, pp. 421–426 (2009)

Zhang, Y., Glass, J.: Unsupervised spoken keyword spotting via segmental DTW on gaussian posteriorgrams. In: ASRU, pp. 398–403 (2009) [+]
Anguera, X., Macrae, R., Oliver, N.: Partial sequence matching using an unbounded dynamic time warping algorithm. In: ICASSP, pp. 3582–3585 (2010)

Hazen, T., Shen, W., White, C.: Query-by-example spoken term detection using phonetic posteriorgram templates. In: ASRU, pp. 421–426 (2009)

Zhang, Y., Glass, J.: Unsupervised spoken keyword spotting via segmental DTW on gaussian posteriorgrams. In: ASRU, pp. 398–403 (2009)

Akbacak, M., Vergyri, D., Stolcke, A.: Open-vocabulary spoken term detection using graphone-based hybrid recognition systems. In: ICASSP, pp. 5240–5243 (2008)

Fiscus, J.G., Ajot, J., Garofolo, J.S., Doddingtion, G.: Results of the 2006 spoken term detection evaluation. In: Proceedings of ACM SIGIR Workshop on Searching Spontaneous Conversational, pp. 51–55 (2007)

Metze, F., Barnard, E., Davel, M., Van Heerden, C., Anguera, X., Gravier, G., Rajput, N., et al.: The spoken web search task. In: Working Notes Proceedings of the MediaEval 2012 Workshop (2012)

Gómez, J.A., Castro, M.J.: Automatic segmentation of speech at the phonetic level. In: Caelli, T.M., Amin, A., Duin, R.P.W., Kamel, M.S., de Ridder, D. (eds.) SSPR & SPR 2002. LNCS, vol. 2396, pp. 672–680. Springer, Heidelberg (2002)

Gómez, J.A., Sanchis, E., Castro-Bleda, M.J.: Automatic speech segmentation based on acoustical clustering. In: Hancock, E.R., Wilson, R.C., Windeatt, T., Ulusoy, I., Escolano, F. (eds.) SSPR & SPR 2010. LNCS, vol. 6218, pp. 540–548. Springer, Heidelberg (2010)

Moreno, A., Poch, D., Bonafonte, A., Lleida, E., Llisterri, J., Marino, J., Nadeu, C.: Albayzin speech database: Design of the phonetic corpus. In: Third European Conference on Speech Communication and Technology (1993)

Park, A., Glass, J.: Towards unsupervised pattern discovery in speech. In: ASRU, pp. 53–58 (2005)

Kullback, S.: Information theory and statistics. Courier Dover Publications (1997)

MAVIR corpus, http://www.lllf.uam.es/ESP/CorpusMavir.html

[-]

This item appears in the following Collection(s)

Show full item record