학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

A scalable data analysis platform for metagenomics

Resource Type: Conference
Authors: Tang, Wei; Wilkening, Jared; Desai, Narayan; Gerlach, Wolfgang; Wilke, Andreas; Meyer, Folker
Source: 2013 IEEE International Conference on Big Data Big Data, 2013 IEEE International Conference on. :21-26 Oct, 2013
Subject: Computing and Processing
Electric shock
Servers
Data analysis
Pipelines
Bioinformatics
Throughput
data management system
workflow
metagenomics
bioinformatics
data analysis platform
cloud computing
Language

Online Access

Full Text (IEEE)

초록

With the advent of high-throughput DNA sequencing technology, the analysis and management of the increasing amount of biological sequence data has become a bottleneck for scientific progress. For example, MG-RAST, a metagenome annotation system serving a large scientific community worldwide, has experienced a sustained, exponential growth in data submissions for several years; and this trend is expected to continue. To address the computational challenges posed by this workload, we developed a new data analysis platform, including a data management system (Shock) for biological sequence data and a workflow management system (AWE) supporting scalable, fault-tolerant task and resource management. Shock and AWE can be used to build a scalable and reproducible data analysis infrastructure for upper-level biological data analysis services.

공지

DAU Library

학술논문

요약정보

A scalable data analysis platform for metagenomics

Online Access

초록