Incorporating subject areas into the Apertium machine translation system

dc.contributor.authorDuran Cals, Jordi
dc.contributor.authorVillarejo Muñoz, Luis
dc.contributor.authorFarrús, Mireia
dc.contributor.authorOrtiz, Sergio
dc.contributor.authorRamírez, Gemma
dc.date.accessioned2020-10-30T10:37:31Z
dc.date.available2020-10-30T10:37:31Z
dc.date.issued2013
dc.description.abstractThe Universitat Oberta de Catalunya (Open University of Catalonia, UOC), is a public university based in Barcelona. The UOC is characterised by three main factors: (a) it is a virtual university based in an e-Learning model, (b) it is based in a strongly Spanish-Catalan bilingual region, and (c) students come from around the world, so that linguistic and cultural diversity is a crucial factor. Within this context, it becomes essential to meet the UOC’s linguistic needs taking into account its particular characteristics. One of the tools created to this end is the adaptation of Apertium, a free/open-source rule-based machine translation platform, which can be found under http://apertium.uoc.edu/, customised to the translation needs of the institution in order to offer the best possible service to their user community. In order to continue adapting and adding value to the existing tool for generalisable large-scale applications, the UOC’s translation system has recently implemented a semantic filter based on subject fields aimed at improving the translation quality and at better fitting the university needs. The paper will explain all the steps of this adaptive process, as well as a demonstration of the resulting tool: (a) the choice of the subject fields according to the university studies, (b) the design and implementation of the dictionaries used to extract the required information to filter and disambiguate homonym and polysemous terms, including source code in the dictionaries, and (c) the design and implementation of the corresponding web interface.
dc.format.mimetypeapplication/pdf
dc.identifier.citationDuran J, Villarejo L, Farrús M, Ortiz S, Ramírez G. Incorporating subject areas into the Apertium machine translation system. In: Przepiórkowski A, Piasecki M, Jassem K, Fuglewicz P, editors. Computational linguistics: Applications. Berlin: Springer; 2013. p. 281-92. DOI: 10.1007/978-3-642-34399-5_15
dc.identifier.doihttp://dx.doi.org/10.1007/978-3-642-34399-5_15
dc.identifier.isbn978-3-642-34399-5
dc.identifier.urihttp://hdl.handle.net/10230/45627
dc.language.isoeng
dc.publisherSpringer
dc.relation.ispartofPrzepiórkowski A, Piasecki M, Jassem K, Fuglewicz P, editors. Computational linguistics: Applications. Berlin: Springer; 2013.
dc.rights© Springer This is a author's accepted manuscript of: Duran J, Villarejo L, Farrús M, Ortiz S, Ramírez G. Incorporating subject areas into the Apertium machine translation system. In: Przepiórkowski A, Piasecki M, Jassem K, Fuglewicz P, editors. Computational linguistics: Applications. Cham: Springer; 2013. p. 281-92.. The final version is available online at: http://dx.doi.org/10.1007/978-3-642-34399-5_15
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.subject.keywordSubject area
dc.subject.keywordMachine translation
dc.subject.keywordLearn technology
dc.subject.keywordStatistical machine translation
dc.subject.keywordSubject field
dc.titleIncorporating subject areas into the Apertium machine translation system
dc.typeinfo:eu-repo/semantics/bookPart
dc.type.versioninfo:eu-repo/semantics/acceptedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
duran_complingu_incor.pdf
Size:
195.99 KB
Format:
Adobe Portable Document Format