High throughput computing in bioinformatics: workflows, containers and emerging paradigms

Loading...
Thumbnail Image

Date

2018

Journal Title

Journal ISSN

Volume Title

Publisher

University of the Western Cape

Abstract

Next Generation Sequencing has brought genomic analysis within the range of a great number of laboratories, while increasing the demand for bioinformatic analysis. These typically comprise workflows composed out of chains of analyses with data flowing between workflow steps. Such analysis is amenable to High Throughput Computing, a form of high performance computing characterised by a focus on overall analysis throughput rather than optimisation of a single application. In recent years workflow languages and container technologies have become a key part in composing efficient, reproducible and re-usable bionformatic workflows. These technologies, however, pose a challenge for High Performance Computing providers as they require different characteristics from an execution environment to that provided by traditional HPC clusters. These challenges will be discussed and some approaches to solving them will be discussed.

Description

Keywords

High Throughput Computing, Bioinformatics, Infrastructure Engineering, Asset Management: Applied Computer Science

Citation

Van Heusden, Peter (2018). High Throughput Computing in bioinformatics: workflows, containers and emerging paradigms. University of Western Cape. Presentation. https://doi.org/10.25379/uwc.7438616.v1

Collections