|
dc.contributor.author
|
Castellano Hereza, Sergi |
|
dc.contributor.other
|
Guigó Serra, Roderic |
|
dc.contributor.other
|
Universitat Pompeu Fabra. Departament de Ciències Experimentals i de la Salut |
|
dc.date.accessioned
|
2011-06-29T09:40:53Z |
|
dc.date.available
|
2011-06-29T09:40:53Z |
|
dc.date.issued
|
2004-07-23 |
|
dc.identifier.uri
|
http://hdl.handle.net/10230/11803 |
|
dc.description.abstract
|
Although the genome sequence and gene content are available for an increasing number of organisms, eukaryotic selenoproteins remain poorly characterized. In these proteins, selenium (Se) is incorporated in the form of selenocysteine(Sec), the 21st amino acid. Selenocysteine is cotranslationally inserted in response to UGA codons (a stop signal in the canonical genetic code). The alternative decoding is mediated by a stem-loop structure in the 3'UTR of selenoprotein mRNAs (the SECIS element). Selenium is implicated in male infertility, cancer and heart diseases, viral expression and ageing. In addition, most selenoproteins have homologues in which Sec is replaced by cysteine (Cys).Genome biologists rely on the high-quality annotation of genomes to bridge the gap from the sequence to the biology of the organism. However, for selenoproteins, which mediate the biological functions of selenium, the dual role of the UGA codon confounds both the automatic annotation pipelines and the human curators. In consequence, selenoproteins are misannotated in the majority of genome projects. Furthermore, the finding of novel selenoprotein families remains a difficult task in the newly released genome sequences.In the last few years, we have contributed to the exhaustive description of the eukaryotic selenoproteome (set of eukaryotic selenoproteins) through the development of a number of ad hoc computational tools. Our approach is based on the capacity of predicting SECIS elements, standard genes and genes with a UGA codon in-frame in one or multiple genomes. Indeed, the comparative analysis plays an essential role because 1) SECIS sequences are conserved between close species (eg. human-mouse); and 2) sequence conservation across a UGA codon between genomes at further phylogenetic distance strongly suggests a coding function (eg. human-fugu). Our analysis of the fly, human and Takifugu and Tetraodon genomes have resulted in 9 novel selenoprotein families. Therefore, 20 distinct selenoprotein families have been described in eukaryotes to date. Most of these families are widely (but not uniformly) distributed across eukaryotes, either as true selenoproteins or Cys-homologues.The correct annotation of selenoproteins is thus providing insight into the evolution of the usage of Sec. Our data indicate a discrete evolutionary distribution of selenoprotein in eukaryotes and suggest that, contrary to the prevalent thinking of an increase in the number of selenoproteins from less to more complex genomes, Sec-containing proteins scatter all along the complexity scale. We believe that the particular distribution of each family is mediated by an ongoing process of Sec/Cys interconversion, in which contingent events could play a role as important as functional constraints. The characterization of eukaryotic selenoproteins illustrates some of the most important challenges involved in the completion of the gene annotation of genomes. Notably among them, the increasing number of exceptions to our standard theory of the eukaryotic gene and the necessity of sequencing genomes at different evolutionary distances towards such a complete annotation. |
|
dc.format.mimetype
|
application/pdf |
|
dc.language.iso
|
eng |
|
dc.publisher
|
Universitat Pompeu Fabra |
|
dc.rights
|
info:eu-repo/semantics/openAccess |
|
dc.rights
|
ADVERTIMENT. La consulta d'aquesta tesi queda condicionada a l'acceptació de les següents condicions d'ús. La difusió d'aquesta tesi per mitjà del servei TDX ha estat autoritzada pels titulars dels drets de propietat intel·lectual únicament per a usos privats emmarcats en activitats d'investigació i docència. No s'autoritza la seva reproducció amb finalitats de lucre ni la seva difusió i posada a disposició des d'un lloc aliè al servei TDX. No s'autoritza la presentació del seu contingut en una finestra o marc aliè a TDX (framing). Aquesta reserva de drets afecta tant al resum de presentació de la tesi com als seus continguts. En la utilització o cita de parts de la tesi és obligat indicar el nom de la persona autora. |
|
dc.title
|
Towards the characterization of the eukaryotic selenoproteome: a computational approach |
|
dc.date.modified
|
2011-04-13T03:58:55Z |
|
dc.subject.keyword
|
aspectes genètics |
|
dc.subject.keyword
|
selenocisteïna |
|
dc.subject.keyword
|
processament de dades |
|
dc.subject.keyword
|
data procesing |
|
dc.subject.keyword
|
seqüències dels aminoàcids |
|
dc.subject.keyword
|
genetic aspects |
|
dc.subject.keyword
|
amino acid sequence |
|
dc.subject.keyword
|
selenocisteine |
|
dc.subject.keyword
|
575 - Genètica general. Citogenètica general. Immunogenètica. Evolució. Filogènia |
Show simple document record