Harrow, JenniferNagy, AlindaReymond, AlexandreAlioto, TylerPatthy, LaszloAntonarakis, Stylianos E.Guigó Serra, Roderic2011-11-282011-11-282009Harrow J, Nagy A, Reymond A, Alioto T, Patthy L, Antonarakis S E, Guigó R. Identifying protein-coding genes in genomic sequences. Genome Biology. 2009;10:201. DOI: 10.1186/gb-2009-10-1-2011465-6906http://hdl.handle.net/10230/13151The vast majority of the biology of a newly sequenced genome is inferred from the set of encoded proteins. Predicting this set is therefore invariably the first step after the completion of the genome DNA sequence. Here we review the main computational pipelines used to generate the human reference protein-coding gene sets.application/pdfeng© 2009 BioMed Central Ltd./nThe electronic version of this article is the complete one and can be found online at http://genomebiology.com/2009/10/1/201Biologia computacional -- MètodesSeqüència de nucleòtidsIdentifying protein-coding genes in genomic sequencesinfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1186/gb-2009-10-1-201AnimalsHumansProteinsGenesGenomeinfo:eu-repo/semantics/openAccess