Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
2019-04-15
Semantic segmentation and depth estimation are two important tasks in computer vision, and many methods have been developed to tackle them. Commonly these two tasks are addressed independently, but recently the idea of merging these two problems into a sole framework has been studied under the assumption that integrating two highly correlated tasks may benefit each other to improve the estimation accuracy. In this paper, depth estimation and semantic segmentation are jointly addressed using a single RGB input image under a unified convolutional neural network. We analyze two different architectures to evaluate which features are more relevant when shared by the two tasks and which features should be kept separated to achieve a mutual improvement. Likewise, our approaches are evaluated under two different scenarios designed to review our results versus single-task and multi-task methods. Qualitative and quantitative experiments demonstrate that the performance of our methodology outperforms the state of the art on single-task approaches, while obtaining competitive results compared with other multi-task methods.
Peer Reviewed
Postprint (author's final draft)
Article
English
Àrees temàtiques de la UPC::Enginyeria de la telecomunicació; Neural networks (Computer science); Image processing; depth estimation; semantic segmentation; convolutional neural networks; hybrid architecture; Xarxes neuronals (Informàtica); Imatges -- Processament
Multidisciplinary Digital Publishing Institute (MDPI)
https://www.mdpi.com/1424-8220/19/8/1795
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
Open Access
Attribution-NonCommercial-NoDerivs 3.0 Spain
E-prints [72986]