You are here

Elliphant: Improved Automatic Detection of Zero Subjects and Impersonal Constructions in Spanish

TitleElliphant: Improved Automatic Detection of Zero Subjects and Impersonal Constructions in Spanish
Publication TypeConference Paper
Year of Publication2012
AuthorsRello, L, Baeza-Yates, R, Mitkov, R
Conference Name13th Conference of the European chapter of the Association for Computational Linguistics (EACL 2012)
PublisherAssociation for Computational Linguistics
Conference LocationAvignon, France
Abstract

In pro-drop languages, the detection of explicit subjects, zero subjects and non-referential impersonal constructions is crucial for anaphora and co-reference resolution. While the identification of explicit and zero subjects has attracted the attention of researchers in the past, the automatic identification of impersonal constructions in Spanish has not been addressed yet and this work is the first such study. In this paper we present a corpus to underpin research on the automatic detection of these linguistic phenomena in Spanish and a novel machine learning-based methodology for their computational treatment. This study also provides an analysis of the features, discusses performance across two different genres and offers error analysis.
The evaluation results show that our system performs better in detecting explicit subjects than alternative systems.

URLhttp://www.aclweb.org/anthology-new/E/E12/E12-1072.pdf