Spanish morphological generation with wide-coverage lexicons and decision trees

Publication date

2017-03-10T12:49:21Z

2017-03-10T12:49:21Z

2017

Abstract

Morphological Generation is the task of producing the appropiate in- flected form of a lemma in a given textual context and according to some morphological features. This paper describes and evaluates wide-coverage morphological lexicons and a Decision Tree algorithm that perform Morphological Generation in Spanish at state-of-the art level. The Freeling, Leffe and Apertium Spanish lexicons, the J48 Decision Tree algorithm and the combination of J48 with Freeling and Leffe lexicons have been evaluated with the following datasets for Spanish: i) CoNLL2009 Shared Task dataset, ii) Durrett and DeNero dataset of Spanish Verbs (DDN), and iii) SIGMORPHON 2016 Shared Task (task-1) dataset. The results show that: i) the Freeling and Leffe lexicons achieve high coverage and precision over the DDN and SIGMORPHON 2016 datasets, ii) the J48 algorithm achieves state-of-the-art results in all of the three datasets, and iii) the combination of Freeling, Leffe and the J48 algorithm outperformed the results of our other approaches in the three evaluation datasets, improved slightly the results of the CoNLL2009 and SIGMORPHON 2016 reported in the state-of-the-art literature, and achieved results comparable to the ones reported in the state-of-the-art literature on the DDN dataset evaluation.


This work was partly funded by the ABLETO-INCLUDE project (European Commission CIP Grant No. 621055), the TUNER project (TIN2015-65308-C5-5-R, MINECO/FEDER, UE), and the Spanish MINECO Ministry (MDM-2015-0502).

Document Type

Article


Published version

Language

English

Publisher

Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)

Related items

Procesamiento del Lenguaje Natural. 2017;58:109-116

info:eu-repo/grantAgreement/EC/FP7/621055

info:eu-repo/grantAgreement/Es/1PE/TIN2015-65308-C5-5-R

Recommended citation

This citation was generated automatically.

Rights

© Sociedad Española para el Procesamiento de Lenguaje Natural

This item appears in the following Collection(s)