Wav2Pix Enhancement and evaluation of a speech-conditioned image generator

Consultar RECERCAT

Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/2117/131759

Título:	Wav2Pix Enhancement and evaluation of a speech-conditioned image generator; Video sign language generation conditioned by language
Autor/a:	Tubau Pires, Miquel
Otros autores:	Universitat Politècnica de Catalunya. Departament de Ciències de la Computació; Giró Nieto, Xavier; Belanche Muñoz, Luis Antonio; Cardoso Duarte, Amanda
Abstract:	We propose the enhancement and evaluation of a deep neural network that is trained from scratch in an end-to-end fashion, generating a face directly from the raw speech waveform without any additional identity information (e.g reference image or one-hot encoding).
Materia(s):	-Àrees temàtiques de la UPC::Informàtica -Machine learning -Computer vision -deep learning -adversarial learning -face synthesis -Aprenentatge automàtic -Visió per ordinador
Derechos:
Tipo de documento:	Trabajo fin de máster
Editor:	Universitat Politècnica de Catalunya
Compartir:

Tubau Pires, Miquel

Coordinación

Patrocinio