MultiDataSet: an R package for encapsulating multiple data sets with application to omic data integration

dc.contributor.authorHernandez-Ferrer, Carles, 1987-ca
dc.contributor.authorRuiz-Arenas, Carlosca
dc.contributor.authorBeltran-Gomila, Albaca
dc.contributor.authorGonzález Ruiz, Juan Ramónca
dc.date.accessioned2018-06-27T08:08:08Z
dc.date.available2018-06-27T08:08:08Z
dc.date.issued2017
dc.description.abstractBACKGROUND: Reduction in the cost of genomic assays has generated large amounts of biomedical-related data. As a result, current studies perform multiple experiments in the same subjects. While Bioconductor's methods and classes implemented in different packages manage individual experiments, there is not a standard class to properly manage different omic datasets from the same subjects. In addition, most R/Bioconductor packages that have been designed to integrate and visualize biological data often use basic data structures with no clear general methods, such as subsetting or selecting samples. RESULTS: To cover this need, we have developed MultiDataSet, a new R class based on Bioconductor standards, designed to encapsulate multiple data sets. MultiDataSet deals with the usual difficulties of managing multiple and non-complete data sets while offering a simple and general way of subsetting features and selecting samples. We illustrate the use of MultiDataSet in three common situations: 1) performing integration analysis with third party packages; 2) creating new methods and functions for omic data integration; 3) encapsulating new unimplemented data from any biological experiment.CONCLUSIONS: MultiDataSet is a suitable class for data integration under R and Bioconductor framework.
dc.description.sponsorshipThis work has been partly funded by the Spanish Ministry of Economy and Competitiveness (MTM2015-68140-R). CH-F was supported by a grant from European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no 308333 – the HELIX project. CR-A was supported by a FI fellowship from Catalan Government (#016FI_B 00272)
dc.format.mimetypeapplication/pdf
dc.identifier.citationHernandez-Ferrer C, Ruiz-Arenas C, Beltran-Gomila A, González JR. MultiDataSet: an R package for encapsulating multiple data sets with application to omic data integration. BMC Bioinformatics. 2017 Jan 17; 18(1): 36. DOI: 10.1186/s12859-016-1455-1
dc.identifier.doihttp://dx.doi.org/10.1186/s12859-016-1455-1
dc.identifier.issn1471-2105
dc.identifier.urihttp://hdl.handle.net/10230/34978
dc.language.isoeng
dc.publisherBioMed Centralca
dc.relation.ispartofBMC Bioinformatics. 2017 Jan 17;18(1):36
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/FP7/308333
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/1PE/MTM2015-68140-R
dc.rights© Carles Hernandez-Ferrer, Carlos Ruiz-Arenas, Alba Beltran-Gomila, Juan R. González. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/)
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject.otherGenòmica
dc.subject.otherProgramari
dc.titleMultiDataSet: an R package for encapsulating multiple data sets with application to omic data integrationca
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Hern_BMC_Multi.pdf
Size:
613.52 KB
Format:
Adobe Portable Document Format

License

Rights