Title:
|
Video object linguistic grounding
|
Author:
|
Herrera-Palacio, Alba; Ventura, Carles; Giró Nieto, Xavier
|
Other authors:
|
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo |
Abstract:
|
The goal of this work is segmenting on a video sequence the objects which are mentioned in a linguistic description of the scene. We have adapted an existing deep neural network that achieves state of the art performance in semi-supervised video object segmentation, to add a linguistic branch that would generate an attention map over the video frames, making the segmentation of the objects temporally consistent along the sequence. |
Abstract:
|
Peer Reviewed |
Subject(s):
|
-Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial -Neural networks (Computer science) -Linguistics -Image processing -- Digital techniques -Video object gounding -Neural networks -Linguistics -Xarxes neuronals (Informàtica) -- Aplicacions -Lingüística -Imatges -- Processament -- Tècniques digitals |
Rights:
|
|
Document type:
|
Article - Published version Conference Object |
Published by:
|
Association for Computing Machinery (ACM)
|
Share:
|
|