To access the full text documents, please follow this link: http://hdl.handle.net/10230/35242

TALN at SemEval-2016 Task 11: modelling complex words by contextual, lexical and semantic features
Ronzano, Francesco; AbuRa’ed, Ahmed; Espinosa-Anke, Luis; Saggion, Horacio
Comunicació presentada al 10th International Workshop on Semantic Evaluation (SemEval 2016), celebrat els dies 16 i 17 de juny de 2016 a San Diego, EUA.
This paper presents the participation of the TALN team in the Complex Word Identification Task of SemEval-2016 (Task 11). The purpose of the task was to determine if a word in a given sentence can be judged as complex or not by a certain target audience. To experiment with word complexity identification approaches, Task organizers provided a training set of 2,237 words judged as complex or not by 20 human evaluators, together with the sentence in which each word occurs. In our contribution we modelled each word to evaluate as a numeric vector populated with a set of lexical, semantic and contextual features that may help assess the complexity of a word. We trained a Random Forest classifier to automatically decide if each word is complex or not. We submitted two runs in which we respectively considered unweighted and weighted instances of complex words to train our classifier, where the weight of each instance is proportional to the number of evaluators that judged the word as complex. Our system scored as the third best performing one.
This work is partly supported by the Spanish Ministry of Economy and Competitiveness under the Maria de Maeztu Units of Excellence Programme (MDM-2015-0502) and the ABLE-TO-INCLUDE Project (Competitivity and Innovation Programme of the European Commission, CIP-ICT-PSP-2013-7/621055).
-Tractament del llenguatge natural (Informàtica)
© ACL, Creative Commons Attribution 4.0 License
http://creativecommons.org/licenses/by/4.0/
Conference Object
Article - Published version
ACL (Association for Computational Linguistics)
         

Show full item record

Related documents

Other documents of the same author

Saggion, Horacio; AbuRa’ed, Ahmed; Ronzano, Francesco
Espinosa-Anke, Luis; Ronzano, Francesco; Saggion, Horacio
Espinosa-Anke, Luis; Carlini, Roberto; Saggion, Horacio; Ronzano, Francesco
Barbieri, Francesco; Camacho-Collados, Jose; Ronzano, Francesco; Espinosa-Anke, Luis; Ballesteros, Miguel; Basile, Valerio; Patti, Viviana; Saggion, Horacio
 

Coordination

 

Supporters