Title:
|
MultiScien: a bi-lingual natural language processing system for mining and enrichment of scientific collections
|
Author:
|
Saggion, Horacio; Ronzano, Francesco; Accuosto, Pablo; Ferrés, Daniel
|
Abstract:
|
Comunicació presentada a: 2nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL 2017) celebrada a Tokyo l'11 d'agost de 2017. |
Abstract:
|
In the current online Open Science context, scientific datasets
and tools for deep text analysis, visualization and exploitation play a major
role.We present a system for deep analysis and annotation of scientific
text collections. We also introduce the first version of the SEPLN Anthology,
a bi-lingual (Spanish and English) fully annotated text resource
in the field of natural language processing that we created with our system.
Moreover, a faceted-search and visualization system to explore the
created resource is introduced. All resources created for this paper will
be available to the research community. |
Abstract:
|
This work is (partly) supported by the Spanish Ministry of Economy and Com-
petitiveness under the Maria de Maeztu Units of Excellence Programme (MDM-
2015-0502) and by the TUNER project (TIN2015-65308-C5-5-R, MINECO/FEDER,
UE). |
Subject(s):
|
-Language Resources -Scientific Text Corpora -Information Extraction -Data Visualization -Semantic Analysis -PDF Conversion |
Rights:
|
© 2017 for the individual papers by the papers' authors. Copying permitted for private and academic purposes. This volume is published and copyrighted by its editors.
|
Document type:
|
Conference Object Article - Accepted version |
Published by:
|
CEUR Workshop Proceedings
|
Share:
|
|