<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns="http://purl.org/rss/1.0/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
<channel rdf:about="http://hdl.handle.net/10230/6161">
<title>Departament de Traducció i Ciències del Llenguatge</title>
<link>http://hdl.handle.net/10230/6161</link>
<description/>
<items>
<rdf:Seq>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20422"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20421"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20420"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20419"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20418"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20417"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20416"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20384"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20383"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20382"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20381"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20380"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20329"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20328"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20327"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20326"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/20325"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/16994"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/16982"/>
<rdf:li rdf:resource="http://hdl.handle.net/10230/16967"/>
</rdf:Seq>
</items>
<dc:date>2013-05-22T07:56:27Z</dc:date>
</channel>
<item rdf:about="http://hdl.handle.net/10230/20422">
<title>Automatic Extraction of Polar Adjectives for the Creation of Polarity Lexicons</title>
<link>http://hdl.handle.net/10230/20422</link>
<description>Automatic Extraction of Polar Adjectives for the Creation of Polarity Lexicons
Vázquez, Silvia; Padró, Muntsa; Bel, Núria; Gonzalo, Julio
Automatic creation of polarity lexicons is a crucial issue to be solved in order to reduce time and&#13;
efforts in the first steps of Sentiment Analysis. In this paper we present a methodology based on&#13;
linguistic cues that allows us to automatically discover, extract and label subjective adjectives&#13;
that should be collected in a domain-based polarity lexicon. For this purpose, we designed a&#13;
bootstrapping algorithm that, from a small set of seed polar adjectives, is capable to iteratively&#13;
identify, extract and annotate positive and negative adjectives. Additionally, the method&#13;
automatically creates lists of highly subjective elements that change their prior polarity even&#13;
within the same domain. The algorithm proposed reached a precision of 97.5% for positive&#13;
adjectives and 71.4% for negative ones in the semantic orientation identification task.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20421">
<title>Tradução online</title>
<link>http://hdl.handle.net/10230/20421</link>
<description>Tradução online
Poch, Marc; Bel, Núria; Toral, Antonio; Hamon, Olivier; Quochi, Valeria
Um desafio estratégico no mundo globalizado atual é superar a barreira da língua através da tecnologia. Na Europa, um projeto está avaliando e desenvolvendo uma plataforma que poderá traduzir a enorme quantidade de dados (orais e escritos) produzidos pelas pessoas, num processo chamado "tradução online". Conheça a seguir os detalhes do estudo.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20420">
<title>Automatic Lexical Semantic Classification of Nouns</title>
<link>http://hdl.handle.net/10230/20420</link>
<description>Automatic Lexical Semantic Classification of Nouns
Bel, Núria; Romeo, Lauren; Padró, Muntsa
The work we present here addresses cue-based noun classification in English and Spanish. Its main objective is to automatically acquire lexical semantic information by classifying nouns into previously known noun lexical classes. This is achieved by using particular aspects of linguistic contexts as cues that identify a specific lexical class. Here we concentrate on the task of identifying such cues and the theoretical background that allows for an assessment of the complexity of the task. The results show that, despite of the a-priori complexity of the task, cue-based classification is a useful tool in the automatic acquisition of lexical semantic classes.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20419">
<title>A Classification of Adjectives for Polarity Lexicons Enhancement</title>
<link>http://hdl.handle.net/10230/20419</link>
<description>A Classification of Adjectives for Polarity Lexicons Enhancement
Vázquez, Silvia; Bel, Núria
Subjective language detection is one of the most important challenges in Sentiment Analysis. Because of the weight and frequency in opinionated texts, adjectives are considered a key piece in the opinion extraction process. These subjective units are more and more frequently collected in polarity lexicons in which they appear annotated with their prior polarity. However, at the moment, any polarity lexicon takes into account prior polarity variations across domains. This paper proves that a majority of adjectives change their prior polarity value depending on the domain. We propose a distinction between domain dependent and romain independent adjectives. Moreover, our analysis led us to propose a further classification related to subjectivity degree: constant, mixed and highly subjective adjectives. Following this classification, polarity values will be a better support for Sentiment Analysis.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20418">
<title>Towards a User-Friendly Platform for Building Language Resources based on Web Services</title>
<link>http://hdl.handle.net/10230/20418</link>
<description>Towards a User-Friendly Platform for Building Language Resources based on Web Services
Poch, Marc; Toral, Antonio; Hamon, Olivier; Quochi, Valeria; Bel, Núria
This paper presents the platform developed in the PANACEA project, a distributed factory that automates the stages involved in the acquisition, production, updating and maintenance of Language Resources required by Machine Translation and other Language Technologies. We adopt a set of tools that have been successfully used in the Bioinformatics field, they are adapted to the needs of our field and used to deploy web services, which can be combined to build more complex processing chains (workflows). This paper describes the platform and its different components (web services, registry, workflows, social network and interoperability). We demonstrate the scalability of the platform by carrying out a set of massive data experiments. Finally, a validation of the platform across a set of required criteria proves its usability for different types of users (non-technical users and providers).
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20417">
<title>Towards the Fully Automatic Merging of Lexical Resources: A Step Forward</title>
<link>http://hdl.handle.net/10230/20417</link>
<description>Towards the Fully Automatic Merging of Lexical Resources: A Step Forward
Padró, Muntsa; Bel, Núria; Necsulescu, Silvia
This article reports on the results of the research done towards the fully automatically merging of lexical resources. Our main goal is to show the generality of the proposed approach, which have been previously applied to merge Spanish Subcategorization Frames lexica. In this work we extend and apply the same technique to perform the merging of morphosyntactic lexica encoded in LMF. The experiments showed that the technique is general enough to obtain good results in these two different tasks which is an important step towards performing the merging of lexical resources fully automatically.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20416">
<title>Mining and exploiting domain-specific corpora in the PANACEA platform</title>
<link>http://hdl.handle.net/10230/20416</link>
<description>Mining and exploiting domain-specific corpora in the PANACEA platform
Bel, Núria; Papavasiliou, Vassilis; Prokopidis, Prokopis; Toral, Antonio; Arranz, Victoria
The objective of the PANACEA ICT-2007.2.2 EU project is to build a platform that automates the stages involved in the acquisition,production, updating and maintenance of the large language resources required by, among others, MT systems. The development of a Corpus Acquisition Component (CAC) for extracting monolingual and bilingual data from the web is one of the most innovative building blocks of PANACEA. The CAC, which is the first stage in the PANACEA pipeline for building Language Resources, adopts an efficient and distributed methodology to crawl for web documents with rich textual content in specific languages and predefined domains. The CAC includes modules that can acquire parallel data from sites with in-domain content available in more than one language. In order to extrinsically evaluate the CAC methodology, we have conducted several experiments that used crawled parallel corpora for the identification and extraction of parallel sentences using sentence alignment. The corpora were then successfully used for domain adaptation of Machine Translation Systems.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20384">
<title>Towards the automatic merging of language resources</title>
<link>http://hdl.handle.net/10230/20384</link>
<description>Towards the automatic merging of language resources
Necsulescu Silvia; Bel, Núria; Padró, Muntsa; Marimón, Montserrat; Revilla, Eva
Language Resources are a critical component for Natural Language Processing applications. Throughout the years many resources were manually created for the same task, but with different granularity and coverage information. To create richer resources for a broad range of potential reuses, nformation from all resources has to be joined into one. The hight cost of comparing and merging different resources by hand has been a bottleneck for merging existing resources. With the objective of reducing human intervention, we present a new method for automating merging resources. We have addressed the merging of two verbs subcategorization frame (SCF) lexica for Spanish. The results achieved, a new lexicon with enriched information and conflicting information signalled, reinforce our idea that this approach can be applied for other task of NLP.
</description>
<dc:date>2011-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20383">
<title>Using qualia information to identify lexical semantic classes in an unsupervised clustering task</title>
<link>http://hdl.handle.net/10230/20383</link>
<description>Using qualia information to identify lexical semantic classes in an unsupervised clustering task
Romeo, Lauren; Mendes, Sara; Bel, Núria
Acquiring lexical information is a complex problem, typically approached by relying on a number of contexts to contribute information for classification. One of the first issues to address in this domain is the determination of such contexts. The work presented here proposes the use of automatically obtained FORMAL role descriptors as features used to draw nouns from the same lexical semantic class together in an unsupervised clustering task. We have dealt with three lexical semantic classes (HUMAN, LOCATION and EVENT) in English. The results obtained show that it is possible to discriminate between elements from different lexical semantic classes using only FORMAL role information, hence validating our initial hypothesis. Also, iterating our method accurately accounts for fine-grained distinctions within lexical classes, namely distinctions involving ambiguous expressions. Moreover, a filtering and bootstrapping strategy employed in extracting FORMAL role descriptors proved to minimize effects of sparse data and noise in our task.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20382">
<title>PANACEA (Platform for Automatic, Normalised Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies)</title>
<link>http://hdl.handle.net/10230/20382</link>
<description>PANACEA (Platform for Automatic, Normalised Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies)
Bel, Núria; Poch, Marc; Toral, Antonio
The objective of PANACEA is to build a factory of LRs that automates the stages involved in the acquisition, production, updating and maintenance of LRs required by MT systems and by other applications based on language technologies, and simplifies eventual issues regarding intellectual property rights. This automation will cut down the cost, time and human effort significantly. These reductions of costs and time are the only way to guarantee the continuous supply of LRs that MT and other language technologies will be demanding in the multilingual Europe.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20381">
<title>Efficiency-based evaluation of aligners for industrial applications</title>
<link>http://hdl.handle.net/10230/20381</link>
<description>Efficiency-based evaluation of aligners for industrial applications
Toral, Antonio; Poch, Marc; Pecina, Pavel; Thurmair, Gregor
This paper presents a novel efficiencybased evaluation of sentence and word aligners. This assessment is critical in order to make a reliable use in industrial scenarios. The evaluation shows that the resources&#13;
required by aligners differ rather broadly. Subsequently, we establish limitation mechanisms on a set of aligners deployed as web services. These results, paired with the quality expected from the aligners, allow providers to choose the most appropriate aligner according to the task at hand.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20380">
<title>Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies: PANACEA</title>
<link>http://hdl.handle.net/10230/20380</link>
<description>Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies: PANACEA
Bel, Núria
El objetivo de PANACEA es engranar diferentes herramientas avanzadas para construir una fábrica de Recursos Lingüísticos (RL), una línea de producción que automatice los pasos implicados en la adquisición, producción, actualización y mantenimiento de los RL que la Traducción Automática y otras tecnologías lingüísticas, necesitan.
</description>
<dc:date>2010-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20329">
<title>Web Services for Bayesian Learning</title>
<link>http://hdl.handle.net/10230/20329</link>
<description>Web Services for Bayesian Learning
Padró, Muntsa; Bel, Núria
In this demonstration we present our web services to perform Bayesian learning for classification tasks.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20328">
<title>Language Resources Factory: case study on the acquisition of Translation Memories</title>
<link>http://hdl.handle.net/10230/20328</link>
<description>Language Resources Factory: case study on the acquisition of Translation Memories
Poch, Marc; Toral, Antonio; Bel, Núria
This paper demonstrates a novel distributed architecture to facilitate the acquisition of Language Resources. We build a factory that automates the stages involved in the acquisition, production, updating and maintenance of these resources. The factory is designed as a platform where functionalities are deployed as web services, which can be combined in complex acquisition chains using workflows. We show a case study, which acquires a Translation Memory for a given pair of languages and a domain using web services for crawling, sentence alignment and conversion to TMX.
</description>
<dc:date>2012-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20327">
<title>Interoperability and Technology for a Language Resources Factory</title>
<link>http://hdl.handle.net/10230/20327</link>
<description>Interoperability and Technology for a Language Resources Factory
Poch, Marc; Bel, Núria
This document describes some of the technological aspects of a project devoted to the creation of a factory for language resources. The project’s objectives are explained, as well as the idea to create a distributed infrastructure of web services. This document focuses on two main topics of the factory: (1) the technological  approaches chosen to develop the factory, i.e. software, protocols, servers, etc. (2) and Interoperability as the main challenge is to permit different NLP tools work together in the factory. This document explains why XCES and GrAF are chosen as the main formats used for the linguistic data exchange.
</description>
<dc:date>2011-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20326">
<title>A Method Towards the Fully Automatic Merging of Lexical Resources</title>
<link>http://hdl.handle.net/10230/20326</link>
<description>A Method Towards the Fully Automatic Merging of Lexical Resources
Bel, Núria; Padró, Muntsa; Necsulescu, Silvia
Lexical Resources are a critical component for Natural Language Processing applications. However, the high cost of comparing and merging different resources has been a bottleneck to obtain richer resources and a broader range of potential uses for a significant number of languages. With the objective of reducing cost by eliminating human intervention, we present a new method towards the automatic merging of resources. This method includes both, the automatic mapping of resources involved to a common format and merging them, once in this format. This paper presents how we have addressed the merging of two verb subcategorization frame lexica for Spanish, but our method will be extended to cover other types of Lexical Resources. The achieved results, that almost replicate human work, demonstrate the feasibility of the approach.
</description>
<dc:date>2011-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/20325">
<title>Automatic Detection of Non-deverbal Event Nouns for Quick Lexicon Production</title>
<link>http://hdl.handle.net/10230/20325</link>
<description>Automatic Detection of Non-deverbal Event Nouns for Quick Lexicon Production
Bel, Núria; Coll, Maria; Resnik, Gabriela
In this work we present the results of experimental work on the development of lexical class-based lexica by automatic means. Our purpose is to assess the use of linguistic lexical-class based information as a feature selection methodology for the use of classifiers in quick lexical development. The results show that the approach can help reduce the human effort required in the development of language resources significantly.
</description>
<dc:date>2010-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/16994">
<title>Towards a user-friendly webservice architecture for statistical machine translation in the PANACEA project</title>
<link>http://hdl.handle.net/10230/16994</link>
<description>Towards a user-friendly webservice architecture for statistical machine translation in the PANACEA project
Toral, Antonio; Pecina, Pavel; Way, Andy; Poch, Marc
This paper presents a webservice architecture for Statistical Machine Translation aimed at non-technical users. A workflow&#13;
editor allows a user to combine different webservices using a graphical user interface. In the current state of this project,&#13;
the webservices have been implemented for a range of sentential and sub-sentential&#13;
aligners. The advantage of a common interface and a common data format allows the user to build workflows exchanging different aligners.
Comunicació presentada a les 15a Conferència de la European Association for Machine Translation, celebrada a Leuven, Bèlgica, els dies 30 i 31 maig de 2011.
</description>
<dc:date>2011-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/16982">
<title>Semántica de los adverbios de modo celerativos</title>
<link>http://hdl.handle.net/10230/16982</link>
<description>Semántica de los adverbios de modo celerativos
Torner Castells, Sergi
En este artículo se analiza la semántica de los adverbios de modo del español, que indican velocidad (deprisa, despacio, rápidamente, lentamente, etc.). La modificación que llevan a cabo estos adverbios, a los que denominamos celebrativos, puede incidir sobre los distintos aspectos del significado oracional. Por ello, el principal objetivo  de este trabajo consiste en identificar cuáles son sus posibles lecturas, para proponer una representación formal del significado de cada una de las unidades analizadas; así mismo, se ofrece una caracterización general semántica de este conjunto de modificadores. Se distinguen tres subclases de adverbios celerativos: los que modifican un argumento eventivo, los que modifican un argumento temporal y los que establecen una relación entre dos eventos.
</description>
<dc:date>2003-01-01T00:00:00Z</dc:date>
</item>
<item rdf:about="http://hdl.handle.net/10230/16967">
<title>La duración consonántica en castellano</title>
<link>http://hdl.handle.net/10230/16967</link>
<description>La duración consonántica en castellano
Barrio Estévez, Laura del; Torner Castells, Sergi
This paper presents the results of an experimental analysis of the duralion of Spanish consonants. The influence of three different factors—stress, location on the&#13;
syllable, and pause vicinity— on the duration of consonants in reading text has been studied. Phonetic context and speech rate have also been controlled. The obtained results seem to indicate that the vicinity of a pause and the location of the consonant in the syllable have a systematic effect on all the consonants, while the stress only affects the duration of a rcduced set of voiced&#13;
allophones ([1], [r] and [n]). Some general conclusions about the mean duration of each consonant, and about the relationship between duration and voicing, manner and place of articulation, have also been established.
</description>
<dc:date>1999-01-01T00:00:00Z</dc:date>
</item>
</rdf:RDF>
