ADN-classifier: automatically assigning denotation types to nominalizations

dc.contributor
Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics
dc.contributor
Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
dc.contributor.author
Peris, Aina
dc.contributor.author
Taulé, Mariona
dc.contributor.author
Boleda Torrent, Gemma
dc.contributor.author
Rodríguez Hontoria, Horacio
dc.date.issued
2010
dc.identifier
Peris, A. [et al.]. ADN-classifier: automatically assigning denotation types to nominalizations. A: International Conference on Language Resources and Evaluation. "International Conference on Language Resources and Evaluation". Valletta: 2010.
dc.identifier
2-9517408-6-7
dc.identifier
https://hdl.handle.net/2117/10374
dc.description.abstract
This paper presents the ADN-Classifier, an Automatic classification system of Spanish Deverbal Nominalizations aimed at identifying its semantic denotation (i.e. event, result, underspecified, or lexicalized). The classifier can be used for NLP tasks such as coreference resolution or paraphrase detection. To our knowledge, the ADN-Classifier is the first effort in acquisition of denotations for nominalizations using Machine Learning.We compare the results of the classifier when using a decreasing number of Knowledge Sources, namely (1) the complete nominal lexicon (AnCora-Nom) that includes sense distictions, (2) the nominal lexicon (AnCora-Nom) removing the sense-specific information, (3) nominalizations’ context information obtained from a treebank corpus (AnCora-Es) and (4) the combination of the previous linguistic resources. In a realistic scenario, that is, without sense distinction, the best results achieved are those taking into account the information declared in the lexicon (89.40% accuracy). This shows that the lexicon contains crucial information (such as argument structure) that corpus-derived features cannot substitute for.
dc.description.abstract
Peer Reviewed
dc.description.abstract
Postprint (published version)
dc.format
1 p.
dc.format
application/pdf
dc.language
eng
dc.rights
Open Access
dc.subject
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural
dc.subject
Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject
Spanish Deverbal Nominalizations (Classification system)
dc.subject
ADN-Classifier (Automatic classification system)
dc.subject
Natural language processing (Computer science)
dc.subject
Computational linguistics -- Research
dc.subject
Lingüística computacional
dc.subject
Corpus (Lingüística)
dc.subject
Castellà -- Lexicografia
dc.title
ADN-classifier: automatically assigning denotation types to nominalizations
dc.type
Conference report


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

E-prints [73026]