|Title||Evaluating automatic annotation: automatically detecting and enriching instances of the dative alternation|
|Publication Type||Journal Article|
|Year of Publication||2011|
|Authors||Theijssen, D, Boves, L, van Halteren, H, Oostdijk, N|
|Journal||Language Resources and Evaluation|
|Keywords||Automatic annotation, Dative alternation, Intrinsic and extrinsic evaluation, Logistic regression, Syntactic alternation|
In this article, we automatically create two large and richly annotated data sets for studying the English dative alternation. With an intrinsic and an extrinsic evaluation, we address the question of whether such data sets that are obtained and enriched automatically are suitable for linguistic research, even if they contain errors. The extrinsic evaluation consists of building logistic regression models with these data sets. We conclude that the automatic approach for detecting instances of the dative alternation still needs human intervention, but that it is indeed possible to annotate the instances with features that are syntactic, semantic and discourse-related in nature. Only the automatic classification of the concreteness of nouns is problematic.