Enabling the processing of bioinformatics workflows where data is located through the use of cloud and container technologies
Loading...
Date
2019
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
University of the Western Cape
Abstract
The growing size of raw data and the lack of internet communication technology to
keep up with that growth is introducing unique challenges to academic researchers.
This is especially true for those residing in rural areas or countries with sub-par
telecommunication infrastructure. In this project I investigate the usefulness of cloud
computing technology, data analysis workflow languages and portable computation
for institutions that generate data. I introduce the concept of a software solution
that could be used to simplify the way that researchers execute their analysis on
data sets at remote sources, rather than having to move the data. The scope of this
project involved conceptualising and designing a software system to simplify the
use of a cloud environment as well as implementing a working prototype of said
software for the OpenStack cloud computing platform. I conclude that it is possible
to improve the performance of research pipelines by removing the need for
researchers to have operating system or cloud computing knowledge and that utilising
technologies such as this can ease the burden of moving data.
Description
>Magister Scientiae - MSc
Keywords
Raw data, Internet communication, Rural areas, Telecommunication infrastructure, Software system