|Title||Integrating a Natural Language Message Pre-Processor with UIMA|
|Publication Type||Journal Article|
|Year of Publication||2008|
|Authors||Nyberg, E, Riebling, E, Wang, RC, Frederking, R|
This paper describes the use of the Unstructured Information Management Architecture (UIMA) to integrate a set of natural language processing (NLP) tools in the RADAR system. The challenge was to define a common data model and a set of component interfaces for these tools, so that they could be integrated into a single system. The integrated system is used to pre-process each email arriving in the RADAR user’s IMAP store. We present a UIMA collection processing engine for RADAR, including a common type system for text analysis results and annotators for each of the NLP tools. The paper also includes an analysis of system performance and a discussion of the lessons learned through use the of UIMA for this integration task. 1.