Visual semantic re-ranker for text spotting

Inicio | ¿Qué es? | Contacto

English | Català

Consultar RECERCAT

Por comunidades y
colecciones Por fecha Por autores Por títulos Por temas (CDU)

Consultar departamento

Por fecha Por autores Por títulos Por temas (CDU)

Estadisticas

Del documento Todo RECERCAT

Mi RECERCAT

Entrar Alertas por correo-e

Directorio de otros repositorios

RECERCAT Principal > Universitat Politècnica de Catalunya > Documents de recerca > Visualizar documento

Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/2117/132950

Título:	Visual semantic re-ranker for text spotting
Autor/a:	Sabir, Ahmed; Moreno-Noguer, Francesc; Padró, Lluís
Otros autores:	Institut de Robòtica i Informàtica Industrial; Universitat Politècnica de Catalunya. Departament de Ciències de la Computació; Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI; Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
Abstract:	The final publication is available at link.springer.com
Abstract:	Many current state-of-the-art methods for text recognition are based on purely local information and ignore the semantic corre- lation between text and its surrounding visual context. In this paper, we propose a post-processing approach to improve the accuracy of text spotting by using the semantic relation between the text and the scene. We initially rely on an off-the-shelf deep neural network that provides a series of text hypotheses for each input image. These text hypotheses are then re-ranked using the semantic relatedness with the object in the image. As a result of this combination, the performance of the original network is boosted with a very low computational cost. The proposed framework can be used as a drop-in complement for any text-spotting algorithm that outputs a ranking of word hypotheses. We validate our approach on ICDAR’17 shared task dataset.
Abstract:	Peer Reviewed
Materia(s):	-Àrees temàtiques de la UPC::Informàtica::Automàtica i control -Computer vision -Text spotting -Deep learning -Semantic visual context -Classificació INSPEC::Pattern recognition::Computer vision
Derechos:
Tipo de documento:	Artículo - Versión presentada Objeto de conferencia
Compartir:

Mostrar el registro completo del ítem

Documentos relacionados

Otros documentos del mismo autor/a

XLike project language analysis services

Carreras, Xavier; Padró, Lluís; Zhang, Lei; Rettinger, Achim; Li, Zhixing; García Cuesta, Esteban; Agic, Zeljko; Bekavac, Bozo; Fortuna, Blaz; Stajner, Tadej

Language processing infrastructure in the XLike project

Padró, Lluís; Agic, Zeljko; Carreras, Xavier; Fortuna, Blaz; García Cuesta, Esteban; Li, Zhixing; Stajner, Tadej; Tadic, Marko

NLP4BPM : Natural language processing tools for business process management

Delicado Alcántara, Luis; Sánchez Ferreres, Josep; Carmona Vargas, Josep; Padró, Lluís

CARPANTA eats words you don't need from e-mail

Alonso, Laura; Casas Fernández, Bernardino; Castellón Masalles, Irene; Climent, Salvador (Climent Roca); Padró, Lluís

Lexicón computacional de marcadores del discurso

Alonso, Laura; Castellón Masalles, Irene; Padró, Lluís

Accesibilidad | Aviso legal | Política de Cookies | Documentos de uso interno

Coordinación

Patrocinio