High throughput computing in bioinformatics: workflows, containers and emerging paradigms
Loading...
Date
2018
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
University of the Western Cape
Abstract
Next Generation Sequencing has brought genomic analysis within the range of a great number of laboratories, while increasing the demand for bioinformatic analysis. These typically comprise workflows composed out of chains of analyses with data flowing between workflow steps. Such analysis is amenable to High Throughput Computing, a form of high performance computing characterised by a focus on overall analysis throughput rather than optimisation of a single application. In recent years workflow languages and container technologies have become a key part in composing efficient, reproducible and re-usable bionformatic workflows. These technologies, however, pose a challenge for High Performance Computing providers as they require different characteristics from an execution environment to that provided by traditional HPC clusters. These challenges will be discussed and some approaches to solving them will be discussed.
Description
Keywords
High Throughput Computing, Bioinformatics, Infrastructure Engineering, Asset Management: Applied Computer Science
Citation
Van Heusden, Peter (2018). High Throughput Computing in bioinformatics: workflows, containers and emerging paradigms. University of Western Cape. Presentation. https://doi.org/10.25379/uwc.7438616.v1