SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences

Citation

Agostini F, Cirillo D, Ponti RD, Tartaglia GG. SeAMotE: a method for/nhigh-throughput motif discovery in nucleic acid sequences. BMC Genomics. 2014 Oct 23;15:925. DOI: 10.1186/1471-2164-15-925

Permanent Link

Description

Abstract
BACKGROUND: The large amount of data produced by high-throughput sequencing poses new computational challenges. In the last decade, several tools have been developed for the identification of transcription and splicing factor binding sites. RESULTS: Here, we introduce the SeAMotE (Sequence Analysis of Motifs Enrichment) algorithm for discovery of regulatory regions in nucleic acid sequences. SeAMotE provides (i) a robust analysis of high-throughput sequence sets, (ii) a motif search based on pattern occurrences and (iii) an easy-to-use web-server interface. We applied our method to recently published data including 351 chromatin immunoprecipitation (ChIP) and 13 crosslinking immunoprecipitation (CLIP) experiments and compared our results with those of other well-established motif discovery tools. SeAMotE shows an average accuracy of 80% in finding discriminative motifs and outperforms other methods available in literature. CONCLUSIONS: /nSeAMotE is a fast, accurate and flexible algorithm for the identification of sequence patterns involved in protein-DNA and protein-RNA recognition. The server can be freely accessed at http://s.tartaglialab.com/new_submission/seamote.
DOI
http://dx.doi.org/10.1186/1471-2164-15-925
Collections
Articles (Departament de Medicina i Ciències de la Vida)
Articles (Center for Genomic Regulation (CRG))
Documents OpenAIRE (Open Access Infrastructure for Research in Europe)

Files