Cloud computing for genomic data analysis and collaboration

B Langmead, A Nellore - Nature Reviews Genetics, 2018 - nature.com
B Langmead, A Nellore
Nature Reviews Genetics, 2018nature.com
Next-generation sequencing has made major strides in the past decade. Studies based on
large sequencing data sets are growing in number, and public archives for raw sequencing
data have been doubling in size every 18 months. Leveraging these data requires
researchers to use large-scale computational resources. Cloud computing, a model
whereby users rent computers and storage from large data centres, is a solution that is
gaining traction in genomics research. Here, we describe how cloud computing is used in …
Abstract
Next-generation sequencing has made major strides in the past decade. Studies based on large sequencing data sets are growing in number, and public archives for raw sequencing data have been doubling in size every 18 months. Leveraging these data requires researchers to use large-scale computational resources. Cloud computing, a model whereby users rent computers and storage from large data centres, is a solution that is gaining traction in genomics research. Here, we describe how cloud computing is used in genomics for research and large-scale collaborations, and argue that its elasticity, reproducibility and privacy features make it ideally suited for the large-scale reanalysis of publicly available archived data, including privacy-protected data.
nature.com