English-French and English-Greek parallel corpus for the Environment and Labour Legislation domains

Citació

  • Pecina P, Toral A, Papavassiliou V, Prokopidis P, Arranz V. English-French and English-Greek parallel corpus for the Environment and Labour Legislation domains [Internet]. Final report 13 Oct 2010. 15 p. (Panacea Project. Work Package Reports, no. D5.3). Available from:

Enllaç permanent

Descripció

  • Descripció

    This report describes the deliverable D5.3 of the PANACEA project: parallel, sententially aligned texts, cleaned and prepared for training-building translation models. This deliverable is/nan outcome of the work packages WP4.1, WP4.2, and WP5.1. The resulting domain-specific parallel corpus includes a total of 4 million words for two language pairs: English–French and/nEnglish–Greek and two domains: Environment and Labour Legislation. The data for each language pair and domain is split into three sets: training data for building translation models, development data for development purposes, and test data for testing and evaluation.
  • Mostra el registre complet