Description:
This document focuses on the development and integration of the first version of the Corpus Acquisition and Annotation (CAA)/nsubsystem in the PANACEA platform. This version incorporates a Corpus Acquisition Component (CAC) and a Cleanup and Normalization Component (CNC) as planned in Section 7 of D4.1 Technologies and tools for corpus creation, normalization and annotation. The present deliverable, together with D4.3, constitutes the second milestone of WP4.