Knowledge-based and data-driven approaches for georeferencing of informal documents

Inicio | ¿Qué es? | Contacto

English | Català

Consultar RECERCAT

Por comunidades y
colecciones Por fecha Por autores Por títulos Por temas (CDU)

Consultar departamento

Por fecha Por autores Por títulos Por temas (CDU)

Estadisticas

Del documento Todo RECERCAT

Mi RECERCAT

Entrar Alertas por correo-e

Directorio de otros repositorios

RECERCAT Principal > Universitat Politècnica de Catalunya > Documents de recerca > Visualizar documento

Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/2117/86563

Título:	Knowledge-based and data-driven approaches for georeferencing of informal documents
Autor/a:	Ferrés Domènech, Daniel; Rodríguez Hontoria, Horacio
Otros autores:	Universitat Politècnica de Catalunya. Departament de Ciències de la Computació; Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
Abstract:	This paper describes Knowledge-Based and Data-Driven approaches we have followed for generic Textual Georeferencing of Informal Documents. Textual georeferencing consists in assigning a set of geographical coordinates to formal (news, reports,..) or informal (blogs, social networks, chats, tagsets,...) texts and documents. The system presented in this paper has been designed to deal with informal documents from social sites. The paper describes four Georeferencing approaches, experiments, and results at the MediaEval 2014 Placing Task (ME2014PT) evaluation, and posterior experiments. The task consisted of predicting the most probable geographical coordinates of Flickr images and videos using its visual, audio and metadata associated features. Our approaches used only Flickr users textual metadata annotations and tagsets. The four approaches used for this task were: 1) a Geographical Knowledge-Based (GeoKB) approach that uses Toponym Disambiguation heuristics, 2) the Hiemstra Language Model (HLM), TFIDF and BM25 Information Retrieval (IR) approaches with Re-Ranking, 3) a combination of the GeoKB and the IR models with Re-Ranking (GeoFusion). 4) a combination of the GeoFusion with a HLM model derived from the English Wikipedia georeferenced pages. The HLM approach with Re-Ranking showed the best performance in accuracy within a margin of distance errors ranging from 10m to 1km. The GeoFusion approaches achieved the best results in accuracies from 10km to 5,000km. Both approaches achieved state-of-the-art results at ME2014PT evaluation and posterior experiments, including the best results for distance accuracies of 1000km and 5,000km in the task where only the official training dataset can be used to predict the coordinates.
Abstract:	Peer Reviewed
Materia(s):	-Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural -Georeference -Textual georeferencing -Toponym disambiguation -Language models -Information retrieval -Geographical gazetteers -Georeferenciació
Derechos:
Tipo de documento:	Artículo - Versión publicada Objeto de conferencia
Editor:	Springer
Compartir:

Mostrar el registro completo del ítem

Documentos relacionados

Otros documentos del mismo autor/a

TALP at MediaEval 2010 placing task: geographical focus detection of Flickr textual annotations

Ferrés Domènech, Daniel; Rodríguez Hontoria, Horacio

TALP at MediaEval 2011 Placing Task: georeferencing Flickr videos with geographical knowledge and information retrieval

Ferrés Domènech, Daniel; Rodríguez Hontoria, Horacio

TALP at WePS-3 2010

Ferrés Domènech, Daniel; Rodríguez Hontoria, Horacio

Georeferencing textual annotations and tagsets with geographical knowledge and language models

Ferrés Domènech, Daniel; Rodríguez Hontoria, Horacio

TALP-UPC at TREC 2005: Experiments using voting scheme among three heterogeneous QA systems

Ferrés Domènech, Daniel; Kanaan Izquierdo, Samir; González Pellicer, Edgar; Ageno Pulido, Alicia; Fuentes Fort, Maria; Rodríguez Hontoria, Horacio; Surdeanu, Mihai; Turmo Borras, Jorge

Accesibilidad | Aviso legal | Política de Cookies | Documentos de uso interno

Coordinación

Patrocinio