MasterOfPores: A workflow for the analysis of Oxford nanopore direct RNA sequencing datasets

dc.contributor.authorCozzuto, Luca
dc.contributor.authorLiu, Huanle
dc.contributor.authorPryszcz, Leszek Piotr, 1985-
dc.contributor.authorHermoso Pulido, Antonio
dc.contributor.authorDelgado-Tejedor, Anna
dc.contributor.authorPonomarenko, Julia
dc.contributor.authorNovoa, Eva Maria
dc.date.accessioned2020-05-07T11:02:55Z
dc.date.available2020-05-07T11:02:55Z
dc.date.issued2020
dc.description.abstractThe direct RNA sequencing platform offered by Oxford Nanopore Technologies allows for direct measurement of RNA molecules without the need of conversion to complementary DNA, fragmentation or amplification. As such, it is virtually capable of detecting any given RNA modification present in the molecule that is being sequenced, as well as provide polyA tail length estimations at the level of individual RNA molecules. Although this technology has been publicly available since 2017, the complexity of the raw Nanopore data, together with the lack of systematic and reproducible pipelines, have greatly hindered the access of this technology to the general user. Here we address this problem by providing a fully benchmarked workflow for the analysis of direct RNA sequencing reads, termed MasterOfPores. The pipeline starts with a pre-processing module, which converts raw current intensities into multiple types of processed data including FASTQ and BAM, providing metrics of the quality of the run, quality-filtering, demultiplexing, base-calling and mapping. In a second step, the pipeline performs downstream analyses of the mapped reads, including prediction of RNA modifications and estimation of polyA tail lengths. Four direct RNA MinION sequencing runs can be fully processed and analyzed in 10 h on 100 CPUs. The pipeline can also be executed in GPU locally or in the cloud, decreasing the run time fourfold. The software is written using the NextFlow framework for parallelization and portability, and relies on Linux containers such as Docker and Singularity for achieving better reproducibility. The MasterOfPores workflow can be executed on any Unix-compatible OS on a computer, cluster or cloud without the need of installing any additional software or dependencies, and is freely available in Github (https://github.com/biocorecrg/master_of_pores). This workflow simplifies direct RNA sequencing data analyses, facilitating the study of the (epi)transcriptome at single molecule resolution.
dc.description.sponsorshipThis work was partly supported by the Spanish Ministry of Economy, Industry and Competitiveness (MEIC) (PGC2018-098152-A-100 to EN) and by the Australian Research Council (DP180103571 to EN). LP was supported by funding from the European Union’s H2020 Research and Innovation Programme under Marie Skłodowska-Curie grant agreement no. 754422
dc.format.mimetypeapplication/pdf
dc.identifier.citationCozzuto L, Liu H, Pryszcz LP, Pulido TH, Delgado-Tejedor A, Ponomarenko J et al. MasterOfPores: A workflow for the analysis of Oxford nanopore direct RNA sequencing datasets. Front Genet. 2020 Mar 17; 11:211. DOI: 10.3389/fgene.2020.00211
dc.identifier.doihttp://dx.doi.org/10.3389/fgene.2020.00211
dc.identifier.issn1664-8021
dc.identifier.urihttp://hdl.handle.net/10230/44450
dc.language.isoeng
dc.publisherFrontiers Media
dc.relation.ispartofFrontiers in Genetics. 2020 Mar 17; 11:211
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/754422
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/2PE/PGC2018-098152-A-100
dc.rights© 2020 by Luca Cozzuto et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject.otherGenètica
dc.subject.otherSeqüència de nucleòtids
dc.subject.otherNanotecnologia
dc.titleMasterOfPores: A workflow for the analysis of Oxford nanopore direct RNA sequencing datasets
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
Coz_FG_Mas.pdf
Mida:
5.31 MB
Format:
Adobe Portable Document Format

Llicència

Drets