Automatic selection of HPSG-parsed sentences for treebank construction

Publication date

2019-02-25T09:40:00Z

2019-02-25T09:40:00Z

2014

Abstract

This article presents an ensemble parse approach to detecting and selecting high-quality linguistic analyses output by a hand-crafted HPSG grammar of Spanish implemented in the LKB system. The approach uses full agreement (i.e., exact syntactic match) along with a MaxEnt parse selection model and a statistical dependency parser trained on the same data. The ultimate goal is to develop a hybrid corpus annotation methodology that combines fully automatic annotation and manual parse selection, in order to make the annotation task more efficient while maintaining high accuracy and the high degree of consistency necessary for any foreseen uses of a treebank.


This work was supported by grant Ramón y Cajal from Spanish MICINN and the project METANET4U. We thank the reviewers for their comments and Carlos Morell for his support.

Document Type

Article


Published version

Language

English

Publisher

ACL (Association for Computational Linguistics)

Related items

Computational Linguistics. 2014 Sep;40(3):523-31.

info:eu-repo/grantAgreement/EC/FP7/270893

Recommended citation

This citation was generated automatically.

Rights

© ACL, Creative Commons Attribution 4.0 License

https://creativecommons.org/licenses/by/4.0/

This item appears in the following Collection(s)