Iarg-AnCora: Spanish corpus annotated with implicit arguments

Taulé Delor, Mariona; Peris Morant, Aina; Rodríguez Hontoria, Horacio; Taulé Delor, Mariona; Peris Morant, Aina; Rodríguez Hontoria, Horacio

Iarg-AnCora: Spanish corpus annotated with implicit arguments

Autor/a

Taulé Delor, Mariona

Peris Morant, Aina

Rodríguez Hontoria, Horacio

Fecha de publicación

2020-10-21T13:29:46Z

2016-09-01

2020-10-21T13:29:47Z

Resumen

This article presents the Spanish Iarg-AnCora corpus (400 k-words, 13,883 sentences) annotated with the implicit arguments of deverbal nominalizations (18,397 occurrences). We describe the methodology used to create it, focusing on the annotation scheme and criteria adopted. The corpus was manually annotated and an interannotator agreement test was conducted (81 % observed agreement) in order to ensure the reliability of the final resource. The annotation of implicit arguments results in an important gain in argument and thematic role coverage (128 % on average). It is the first corpus annotated with implicit arguments for the Spanish language with a wide coverage that is freely available. This corpus can subsequently be used by machine learning-based semantic role labeling systems, and for the linguistic analysis of implicit arguments grounded on real data. Semantic analyzers are essential components of current language technology applications, which need to obtain a deeper understanding of the text in order to make inferences at the highest level to obtain qualitative improvements in the results.

Tipo de documento

Artículo

Versión aceptada

Lengua

Inglés

Materias y palabras clave

Corpus (Lingüística); Semàntica; Castellà (Llengua); Corpora (Linguistics); Semantics; Spanish language

Publicado por

Springer Verlag

Documentos relacionados

Versió postprint del document publicat a: https://doi.org/10.1007/s10579-015-9334-3

Language Resources And Evaluation, 2016, vol. 50, num. 3, p. 549-584

https://doi.org/10.1007/s10579-015-9334-3

Citación recomendada

Esta citación se ha generado automáticamente.

Exportar

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Derechos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Filologia Catalana i Lingüística General [953]

ISGlobal - Institut de Salut Global de Barcelona [61457]

Iarg-AnCora: Spanish corpus annotated with implicit arguments

Autor/a

Fecha de publicación

Compartir

Resumen

Tipo de documento

Lengua

Materias y palabras clave

Publicado por

Documentos relacionados

Citación recomendada

Exportar

Derechos

Este ítem aparece en la(s) siguiente(s) colección(ones)