- -

Speaker Localization and Detection in Videoconferencing Environments Using a Modified SRP-PHAT Algorithm

RiuNet: Institutional repository of the Polithecnic University of Valencia

Share/Send to

Cited by

Statistics

Speaker Localization and Detection in Videoconferencing Environments Using a Modified SRP-PHAT Algorithm

Show simple item record

Files in this item

dc.contributor.author Martí Guerola, Amparo es_ES
dc.contributor.author Cobos Serrano, Máximo es_ES
dc.contributor.author Aguilera Martí, Emanuel es_ES
dc.contributor.author López Monfort, José Javier es_ES
dc.date.accessioned 2015-11-17T16:15:54Z
dc.date.available 2015-11-17T16:15:54Z
dc.date.issued 2011
dc.identifier.issn 1889-8297
dc.identifier.uri http://hdl.handle.net/10251/57648
dc.description.abstract [EN] The Steered Response Power - Phase Transform (SRP-PHAT) algorithm has been shown to be one of the most robust sound source localization approaches operating in noisy and reverberant environments. However, its practical implementation is usually based on a costly fine grid-search procedure, making the computational cost of the method a real issue. In this paper, we introduce an effective strategy which performs a full exploration of the sampled space rather than computing the SRP at discrete spatial positions, increasing its robustness and allowing for a coarser spatial grid that reduces the computational cost required in a practical implementation. The modified SRP-PHAT functional has been successfully implemented in a real time speaker localization system for multiparticipant videoconferencing environments. Moreover, a localization-based speech-non speech frame discriminator is presented. es_ES
dc.description.sponsorship This work was supported by the Ministry of Education and Science under the project TEC2009-14414-C03-01.
dc.language Inglés es_ES
dc.publisher Instituto de Telecomunicaciones y Aplicaciones Multimedia (ITEAM)" es_ES
dc.relation info:eu-repo/grantAgreement/MICINN//TEC2009-14414-C03-01/ES/Procesado De Sonido Para Entornos Emergentes De Comunicacion/ es_ES
dc.relation.ispartof Waves es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Sound source localization es_ES
dc.subject SRP-PHAT es_ES
dc.subject Microphone array es_ES
dc.subject Speaker detection es_ES
dc.subject Speech enhancement es_ES
dc.subject.classification TEORIA DE LA SEÑAL Y COMUNICACIONES es_ES
dc.title Speaker Localization and Detection in Videoconferencing Environments Using a Modified SRP-PHAT Algorithm es_ES
dc.type Artículo es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Comunicaciones - Departament de Comunicacions es_ES
dc.contributor.affiliation Universitat Politècnica de València. Instituto Universitario de Telecomunicación y Aplicaciones Multimedia - Institut Universitari de Telecomunicacions i Aplicacions Multimèdia es_ES
dc.description.bibliographicCitation Martí Guerola, A.; Cobos Serrano, M.; Aguilera Martí, E.; López Monfort, JJ. (2011). Speaker Localization and Detection in Videoconferencing Environments Using a Modified SRP-PHAT Algorithm. Waves. 3:40-47. http://hdl.handle.net/10251/57648 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion http://www.iteam.upv.es/waves.php es_ES
dc.description.upvformatpinicio 40 es_ES
dc.description.upvformatpfin 47 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 3 es_ES
dc.relation.senia 212003 es_ES
dc.contributor.funder Ministerio de Educación y Ciencia
dc.contributor.funder Ministerio de Ciencia e Innovación es_ES


This item appears in the following Collection(s)

Show simple item record