Transferencia de timbre acústico en tiempo real mediante técnicas de aprendizaje profundo

Kramer Savelev, Andrey

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Transferencia de timbre acústico en tiempo real mediante técnicas de aprendizaje profundo

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: Kramer - Transferencia ...

Tamaño: 1.735Mb

Formato: PDF

Solicitar una copia al autor

dc.contributor.advisor	Silvestre Cerdà, Joan Albert	es_ES
dc.contributor.advisor	López Esquibel, Federico José	es_ES
dc.contributor.author	Kramer Savelev, Andrey	es_ES
dc.date.accessioned	2020-07-15T08:54:18Z
dc.date.available	2020-07-15T08:54:18Z
dc.date.created	2020-07-14
dc.date.issued	2020-07-15	es_ES
dc.identifier.uri	http://hdl.handle.net/10251/148037
dc.description.abstract	[ES] La transferencia de timbre acústico es una aplicación de la teoría del procesamiento de la señal, que tiene como objeto transformar el timbre de una señal de audio en otro timbre completamente distinto. Un ejemplo representativo sería transformar la voz de un hombre cantando en música de violín, manteniendo el tono y la expresividad originales de la voz humana. Esta aplicación viene experimentando, en los últimos años, un creciente interés por parte de la industria de los videojuegos, entre otras, ya que permite a los jugadores dotar de voces personalizadas a sus identidades digitales (avatares), enriqueciendo significativamente su experiencia «gaming». Prueba de este interés es el reciente desarrollo por parte de Google de la librería de código abierto Differentiable Digital Signal Processing (DDSP), que permite solucionar el problema de la transferencia de timbre acústico mediante técnicas de aprendizaje profundo. No obstante, esta librería solo permite trabajar en contextos «off-line», esto es, requiere procesar toda la señal del audio para poder generar la transformación acústica (generación en diferido). El objetivo principal de este trabajo es adaptar esta tecnología a entornos de «streaming», lo cual permitiría realizar la transformación de timbre al mismo tiempo que se genera la señal acústica original (generación en tiempo real). Para ello, se propone y se evalúa una modificación de la arquitectura interna de la librería, basada en redes neuronales, que satisfaga las restricciones temporales inherentes del contexto de «streaming».	es_ES
dc.description.abstract	[EN] Acoustic timbre transfer is an application of the theory of signal processing, the aim of which is to transform the timbre of an audio signal into a completely different timbre. A representative example would be to transform the voice of a man singing into violin music, keeping the original tone and expressiveness of the human voice. This application has experienced a growing interest among the video game and other industries over the last years, as it allows gamers to provide personalized voices to their digital identities (avatars), significantly enriching their gaming experience. Proof of this interest is the recent development by Google of the open source library Differentiable Digital Signal Processing (DDSP), which allows the problem of acoustic timbre transfer to be solved using deep learning techniques. Nevertheless, this library is only capable of working in off-line contexts, in other words, it requires processing the entire audio signal in order to generate the acoustic transformation (non-real-time generation). The main objective of this work is to adapt this technology to streaming environments, which would allow the timbre to be transformed at the same time as the original acoustic signal is generated (real-time generation). To this end, a modification of the internal architecture of the library, based in neural networks, that would satisfy the temporal constraints inherent to the streaming context is proposed and evaluated.	es_ES
dc.format.extent	46	es_ES
dc.language	Español	es_ES
dc.publisher	Universitat Politècnica de València	es_ES
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Procesamiento digital de señales	es_ES
dc.subject	Aprendizaje profundo	es_ES
dc.subject	Tiempo real	es_ES
dc.subject	Transferencia de timbre	es_ES
dc.subject	Transformación de voz	es_ES
dc.subject	Digital signal processing	es_ES
dc.subject	Deep learning	es_ES
dc.subject	Real-time	es_ES
dc.subject	Timbre transfer	es_ES
dc.subject	Voice transformation	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.subject.other	Grado en Ingeniería Informática-Grau en Enginyeria Informàtica	es_ES
dc.title	Transferencia de timbre acústico en tiempo real mediante técnicas de aprendizaje profundo	es_ES
dc.type	Proyecto/Trabajo fin de carrera/grado	es_ES
dc.rights.accessRights	Cerrado	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Escola Tècnica Superior d'Enginyeria Informàtica	es_ES
dc.description.bibliographicCitation	Kramer Savelev, A. (2020). Transferencia de timbre acústico en tiempo real mediante técnicas de aprendizaje profundo. http://hdl.handle.net/10251/148037	es_ES
dc.description.accrualMethod	TFGM	es_ES
dc.relation.pasarela	TFGM\129822	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

ETSINF - Trabajos académicos [5160]
Escola Tècnica Superior d'Enginyeria Informàtica

Mostrar el registro sencillo del ítem

Transferencia de timbre acústico en tiempo real mediante técnicas de aprendizaje profundo

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Transferencia de timbre acústico en tiempo real mediante técnicas de aprendizaje profundo

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)