Title:
|
An evaluation framework based on gold standard models for definition question answering
|
Author:
|
Kanaan Izquierdo, Samir; Turmo Borras, Jorge
|
Other authors:
|
Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics; Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural; Universitat Politècnica de Catalunya. SISBIO - Senyals i Sistemes Biomèdics |
Abstract:
|
This paper presents a weak supervised evaluation framework for definition question answering (DefQA) called Solon. It automatically evaluates a set of DefQA systems using existing human definitions as gold standard models. This way it is able to overcome known limitations of the evaluation methods in the state of the art. In addition, Solon assumes that each DefQA task may require a different evaluation configuration, and it is able to automatically find the best one. The results obtained in our experiments show that Solon performs well with respect to the evaluation methods in the state of the art with the advantage that it is less supervised. |
Subject(s):
|
-Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial -Definition question answering -Evaluation |
Rights:
|
|
Document type:
|
Article - Published version Report |
Share:
|
|