Browsing by Author "Christoffels, Alan"
Now showing 1 - 20 of 90
Results Per Page
Sort Options
Item The African Coelecanth genome provides insights into tetrapod evolution(Macmillan Publishers, 2013) Christoffels, Alan; Hesse, Uljana; Gamieldien, Junaid; Panji, Sumir; Picone, Barbara; Van Heusden, PeterThe discovery of a living coelacanth specimen in 1938 was remarkable, as this lineage of lobe-finned fish was thought to have become extinct 70 million years ago. The modern coelacanth looks remarkably similar to many of its ancient relatives, and its evolutionary proximity to our own fish ancestors provides a glimpse of the fish that first walked on land. Here we report the genome sequence of the African coelacanth, Latimeria chalumnae. Through a phylogenomic analysis, we conclude that the lungfish, and not the coelacanth, is the closest living relative of tetrapods. Coelacanth protein-coding genes are significantly more slowly evolving than those of tetrapods, unlike other genomic features. Analyses of changes in genes and regulatory elements during the vertebrate adaptation to land highlight genes involved in immunity, nitrogen excretion and the development of fins, tail, ear, eye, brain and olfaction. Functional assays of enhancers involved in the fin-to-limb transition and in the emergence of extra-embryonic tissues show the importance of the coelacanth genome as a blueprint for understanding tetrapod evolution.Item Baobab LIMS: An open source biobank laboratory information management system for resource-limited settings(University of the Western Cape, 2019) Bendou, Hocine; Christoffels, AlanA laboratory information management system (LIMS) is central to the informatics infrastructure that underlies biobanking activities. To date, a wide range of commercial and open source LIMS are available. The decision to opt for one LIMS over another is often influenced by the needs of the biobank clients and researchers, as well as available financial resources. However, to find a LIMS that incorporates all possible requirements of a biobank may often be a complicated endeavour. The need to implement biobank standard operation procedures as well as stimulate the use of standards for biobank data representation motivated the development of Baobab LIMS, an open source LIMS for Biobanking. Baobab LIMS comprises modules for biospecimen kit assembly, shipping of biospecimen kits, storage management, analysis requests, reporting, and invoicing. Baobab LIMS is based on the Plone web-content management framework, a server-client-based system, whereby the end user is able to access the system securely through the internet on a standard web browser, thereby eliminating the need for standalone installations on all machines. The Baobab LIMS components were tested and evaluated in three human biobanks. The testing of the LIMS modules aided in the mapping of the biobanks requirements to the LIMS functionalities, and furthermore, it helped to reveal new user suggestions, such as the enhancement of the online documentation. The user suggestions are demonstrated to be important for both LIMS strengthen and biobank sustainability. Ultimately, the practical LIMS evaluations showed the ability of Boabab LIMS to be used in the management of human biobanks operations of relatively different biobanking workflows.Item Cardiovascular effects, molecular docking and chemo informatics analysis of compounds isolated from leonotis leonurus(University of the Western Cape, 2015) Sasi, Abd-Alkarim Nour-Addin; Kenechukwu, Obikeze; Christoffels, AlanLeonotis leonurus (L. Leonurus) has relatively abundant diterpenes and has been used as a traditional herbal medicine for treating several ailments including influenza, muscular cramps, skin related diseases, menstrual, antilipidemic, hyperglycaemia and hypertension. In this study, diterpenoid compounds such as; Dubiin, SaponifiedDubiin, Hispanol, Marrubiin and DC9 were isolated from L. Leonurus plant. The cardiovascular effects of these isolated compounds were investigated in order to determine the response of anaesthetised normotensive Wistar rats (in-vivo) to the compounds. Also, the druglikeness of the isolated diterpenoid compounds and their binding interaction with β1 adrenoceptor (PDB: 2Y04), angiotensin II receptor (Ang II) (PDB: 3R8A), Angiotensin converting enzyme (ACE) (PDB: 4XX3), and renin receptor (PDB: 2X8Z) by using molecular docking methods and Chemoinformatics analysis was performed (in-silico). Important molecular descriptors and molecular docking were used in our Chemoinformatics (in-silico) analysis to study the druglikeness and the binding affinity for of each molecule (Dubiin, SaponifiedDubiin, Hispanol, Marrubiin and DC9). The molecular descriptors and the binding energy were calculated by using the molecular operating environment software (MOE 2013). The lowest energy and highest cluster conformations of the molecules were further analysed. All the five (5) diterpenoids were predicted to have good oral bioavailability after oral administration and passed the BloodBrain Barrier (BBB) rules. Also, the compounds were predicted to have high probability of being good Druglike candidates, except for DC9, which is predicted to have lower possibilities of being Druglike candidate than the other diterpenoids. Furthermore, these compounds (Dubiin, SaponifiedDubiin, Hispanol, Marrubiin and DC9) were shown to interact with β1 adrenoceptors in-silico, an interaction that was confirmed in-vivo by increases in Blood pressure (SP, DP and MAP) and Heart rate (HR). In anaesthetized normotensive male Wistar rats (in-vivo), Dubiin (0.5 40mg/kg; IV), SaponifiedDubiin (0.5 60mg/kg; IV) Hispanol (0.5 40mg/kg; IV), DC9 (0.5 40mg/kg; IV) and Marrubiin (0.5 40mg/kg; IV) produced dose dependent increase in Systolic pressure (SP), Diastolic pressure (DP), and Mean arterial pressure (MAP) at all doses. Also, the compounds produced dose dependent increase in Heart rate (HR). From the in-vivo and in-silico studies it can be concluded that all the five (5) isolated diterpenoid compounds showed cardiovascular effects on Blood pressure (BP) and Heart rate (HR) by acting as β1 adrenoceptor agonists. Also, these diterpenoids compounds could be responsible for the cardiovascular effect observed in the methanol extracts from previous studies. These cardioactive compounds are prototype or ''lead compounds'' for designing and developing new nontoxic and effective drugs for cardiovascular disease (CVD) treatment.Item Challenges of biobanking in South Africa to facilitate indigenous research in an environment burdened with human immunodeficiency virus, tuberculosis, and emerging non-communicable diseases(Mary Ann Liebert, Inc., 2013) Abayomi, Akin; Christoffels, Alan; Grewal, Ravnit; Karam, Locunda A.; Rossouw, Catherine; Staunton, Ciara; Swanepoel, Carmen; van Rooyen, BeverleyThe high burden of infectious diseases and the growing problem of noncommunicable and metabolic disease syndromes in South Africa (SA) forces a more focused research approach to facilitate cutting-edge scientific growth and public health development. Increased SA research on these diseases and syndromes and the collection of associated biospecimens has ensured a plethora of biobanks created by individuals, albeit without the foresight of prospective and collective use by other local and international researchers. As the need for access to high-quality specimens in statistically relevant numbers has increased, so has the necessity for the development of national human biobanks in SA and across the Continent. The prospects of achieving sustainable centralized biobanks are still an emerging and evolving concept, primarily and recently driven by the launch of the H3Africa consortium, which includes the development of harmonized and standardized biobanking operating procedures. This process is hindered by a myriad of complex societal considerations and ethico-legal challenges. Efforts to consolidate and standardize biological sample collections are further compromised by the lack of full appreciation by national stakeholders of the biological value inherent in these collections, and the availability of high quality human samples with well-annotated data for future scientific research and development. Inadequate or nonexistent legislative structures that specifically regulate the storage, use, dispersal, and disposal of human biological samples are common phenomena and pose further challenges. Furthermore, concerns relating to consent for unspecific future uses, as well as access to information and data protection, are all new paradigms that require further consideration and public engagement. This article reviews important fundamental issues such as governance, ethics, infrastructure, and bioinformatics that are important foundational prerequisites for the establishment and evolution of successful human biobanking in South Africa.Item Cheminformatic approaches to hit-prioritization and target prediction of potential anti-mrsa natural products(University of Western Cape, 2020) Oselusi, Samson Olaitan; Egieyeh, Samuel; Christoffels, AlanThe growing resistance of Methicillin-Resistant Staphylococcus aureus (MRSA) to currently prescribed drugs has resulted in the failure of prevention and treatment of different infections caused by the superbug. Therefore, to keep pace with the resistance, there is a pressing need for novel antimicrobial agents, especially from non-conventional sources. Several natural products (NPs) have displayed varying in vitro activities against the pathogen but few of these natural compounds have been studied for their prospects to be potential antimicrobial drug candidates. This may be due to the high cost, tedious, and time-consuming process of conducting the important preclinical tests on these compounds. Hence, there is a need for cost-effective strategies for mining the available data on these natural compounds. This would help to get the knowledge that may guide rational prioritization of “likely to succeed” natural compounds to be developed into potential antimicrobial drug candidates.Item Cheminformatic Characterization of Natural Antimicrob al Products for the Development of New Lead Compounds(MDPI, 2021) Egieyeh, Samuel Ayodele; Oselusi, Samson Olaitan; Christoffels, AlanThe growing antimicrobial resistance (AMR) of pathogenic organisms to currently pre- scribed drugs has resulted in the failure to treat various infections caused by these superbugs. Therefore, to keep pace with the increasing drug resistance, there is a pressing need for novel antimicrobial agents, especially from non-conventional sources. Several natural products (NPs) have been shown to display promising in vitro activities against multidrug-resistant pathogens. Still, only a few of these compounds have been studied as prospective drug candidates. This may be due to the expensive and time-consuming process of conducting important studies on these compounds. The present review focuses on applying cheminformatics strategies to characterize, prioritize, and optimize NPs to develop new lead compounds against antimicrobial resistance pathogens. Moreover, case studies where these strategies have been used to identify potential drug candidates, including a few selected open-access tools commonly used for these studies, are briefly outlined.Item Cheminformatic Profiling and Hit Prioritization of Natural Products with Activities against Methicillin-Resistant Staphylococcus aureus (MRSA)(Molecules, 2021-06) Oselusi, Samson, O; Egieyeh, Samuel, A; Christoffels, AlanSeveral natural products (NPs) have displayed varying in vitro activities against methicillin-resistant Staphylococcus aureus (MRSA). However, few of these compounds have not been developed into potential antimicrobial drug candidates. This may be due to the high cost and tedious and time-consuming process of conducting the necessary preclinical tests on these compounds. In this study, cheminformatic profiling was performed on 111 anti-MRSA NPs (AMNPs), using a few orally administered conventional drugs for MRSA (CDs) as reference, to identify compounds with prospects to become drug candidates. This was followed by prioritizing these hits and identifying the liabilities among the AMNPs for possible optimization. Cheminformatic profiling revealed that most of the AMNPs were within the required drug-like region of the investigated properties. For example, more than 76% of the AMNPs showed compliance with the Lipinski, Veber, and Egan predictive rules for oral absorption and permeability. About 34% of the AMNPs showed the prospect to penetrate the blood–brain barrier (BBB), an advantage over the CDs, which are generally non-permeant of BBB. The analysis of toxicity revealed that 59% of the AMNPs might have negligible or no toxicity risks. Structure–activity relationship (SAR) analysis revealed chemical groups that may be determinants of the reported bioactivity of the compounds. A hit prioritization strategy using a novel “desirability scoring function” was able to identify AMNPs with the desired drug-likeness. Hit optimization strategies implemented on AMNPs with poor desirability scores led to the design of two compounds with improved desirability scores.Item Cheminformatic profiling and hit prioritization of natural products with activities against Methicillin-Resistant Staphylococcus Aureus (MRSA)(MPDI, 2021) Oselusi, Samson O.; Egieyeh, Samuel A.; Christoffels, AlanSeveral natural products (NPs) have displayed varying in vitro activities against methicillinresistant Staphylococcus aureus (MRSA). However, few of these compounds have not been developed into potential antimicrobial drug candidates. This may be due to the high cost and tedious and time-consuming process of conducting the necessary preclinical tests on these compounds. In this study, cheminformatic profiling was performed on 111 anti-MRSA NPs (AMNPs), using a few orally administered conventional drugs for MRSA (CDs) as reference, to identify compounds with prospects to become drug candidates. This was followed by prioritizing these hits and identifying the liabilities among the AMNPs for possible optimization. Cheminformatic profiling revealed that most of the AMNPs were within the required drug-like region of the investigated properties. For example, more than 76% of the AMNPs showed compliance with the Lipinski, Veber, and Egan predictive rules for oral absorption and permeability. About 34% of the AMNPs showed the prospect to penetrate the blood–brain barrier (BBB), an advantage over the CDs, which are generally non-permeant of BBB. The analysis of toxicity revealed that 59% of the AMNPs might have negligible or no toxicity risks. Structure–activity relationship (SAR) analysis revealed chemical groups that may be determinants of the reported bioactivity of the compounds. A hit prioritization strategy using a novel “desirability scoring function” was able to identify AMNPs with the desired drug-likeness.Item Chromosomal-level assembly of the Asian seabass genome using long sequence reads and multi-layered scaffolding(Public Library of Science, 2016) Vij, Shubha; Van Heusden, Peter; Christoffels, Alan; Mbandi, Stanley K.; Mwangi, SarahWe report here the ~670 Mb genome assembly of the Asian seabass (Lates calcarifer), a tropical marine teleost. We used long-read sequencing augmented by transcriptomics, optical and genetic mapping along with shared synteny from closely related fish species to derive a chromosome-level assembly with a contig N50 size over 1 Mb and scaffold N50 size over 25 Mb that span ~90% of the genome. The population structure of L. calcarifer species complex was analyzed by re-sequencing 61 individuals representing various regions across the species’ native range. SNP analyses identified high levels of genetic diversity and confirmed earlier indications of a population stratification comprising three clades with signs of admixture apparent in the South-East Asian population. The quality of the Asian seabass genome assembly far exceeds that of any other fish species, and will serve as a new standard for fish genomics.Item Coding of tsetse repellents by olfactory sensory neurons: towards the improvement and the development of novel(University of the Western Cape, 2020) Souleymane, Diallo; Christoffels, AlanTsetse flies are the biological vectors of human and animal trypanosomiasis and hence representant medical and veterinary importance. The sense of smell plays a significant role in tsetse and its ecological interaction, such as finding blood meal source, resting, and larvicidal sites and for mating. Tsetse olfactory behaviour can be exploited for their management; however, olfactory studies in tsetse flies are still fragmentary. Here in my PhD thesis, using scanning electron microscopy, electrophysiology, behaviour, bioinformatics and molecular biology techniques, I have investigated tsetse flies (Glossina fuscipes fuscipes) olfaction using behaviourally well studied odorants, tsetse repellent by comparing with attractant odour. Insect olfaction is mediated by olfactory sensory neurons (OSNs), located in olfactory sensilla, which are cuticular structures exposed to the environment through pore and create a platform for chemical communication. In the sensilla shaft the dendrite of OSNs are housed, which are protected by called the sensillum lymph produced by support cells and contains a variety of olfactory proteins, including the odorant binding protein (OBP) and chemosensory proteins (CSP). While on the dendrite of OSNs are expressed olfactory receptors. In my PhD, studies I tried to decipher the sense of smell in tsetse fly. In the second chapter, I demonstrated that G. f. fuscipes is equipped with diverse olfactory sensilla, that various from basiconic, trichoid and coeloconic. I also demonstrated, there is shape, length, number difference between sensilla types and sexual dimorphism. There is a major difference between male and female, while male has the unique basiconic sensilla, club shaped found in the pits, which is absent from female pits. In my third chapter, I investigated the odorant receptors which are expressed on the dendrite of the olfactory sensory neurons (OSNs). G. f. fuscipes has 42 ORs, which were not functionally characterised. I used behaviourally well studied odorants, tsetse repellents, composed of four components blend. I demonstrated that tsetse repellent is also a strong antifeedant for both G. pallidipes and G. f. fuscipes using feeding bioassays as compared to the attractant odour, adding the value of tsetse repellent. However, the attractant odour enhanced the feeding index. Using DREAM (deorphanization of receptors based on expression alterations of mRNA levels). I found that in G. f. fuscipes, following a short in vivo exposure to the individual tsetse repellent component as well as an attractant volatile chemical, OSNs that respond to these compounds altered their mRNA expression in two opposite direction, significant downregulation and upregulation in their number of transcripts corresponding to the OR that they expressed and interacted with odorant. Also, I found that the odorants with opposite valence already segregate distinctly at the cellular and molecular target at the periphery, which is the reception of odorants by OSNs, which is the basis of sophisticated olfactory behaviour. Deorphanization of ORs in none model insect is a challenge, here by combining DREAM with molecular dynamics, as docking score, physiology and homology modelling with Drosophila a well-studied model insects, I was able to predict putative receptors of the tsetse repellent components and an attractant odour. However, many ORs were neutral, showing they were not activated by the odorants, demonstrating the selectivity of the technique as well as the receptors. In my fourth chapter, I investigated the OBPs structures and their interaction with odorants molecules. I demonstrated that OBPs are expressed both in the antenna, as well as in other tissues, such as legs. I also demonstrated that there are variations in the expression of OBPs between tissues as well as sexes. I also demonstrated that odorants induced a fast alteration in OBP mRNA expression, some odorants induced a decrease in the transcription of genes corresponding to the activated OBP and others increased the expression by many fold in OBPs in live insect, others were neutral after 5 hours of exposure. Moreover, with subsequent behavioural data showed that the behavioural response of G. f. fuscipes toward 1-octen-3-ol decreased significantly when 1-octen-3-ol putative OBPs were silenced with feeding of double-stranded RNA (dsRNA). In summary, our finding whereby odorant exposure affects the OBPs mRNA, their physiochemical properties and the silencing of these OBPs affected the behavioural response demonstrate that the OBPs are involved in odour detection that affect the percept of the given odorant. The expression of OBPs in olfactory tissues, antenna and their interaction with odorant and their effect on behavioural response when silenced shows their direct involvement in odour detection and reception. Furthermore, their expression in other tissues such as legs indicates they might also have role in other physiological functions, such as taste.Item Coding of tsetse repellents by olfactory sensory neurons: towards the improvement and the development of novel tsetse repellents(University of the Western Cape, 2020) Souleymane, Diallo; Christoffels, AlanTsetse flies are the biological vectors of human and animal trypanosomiasis and hence representant medical and veterinary importance. The sense of smell plays a significant role in tsetse and its ecological interaction, such as finding blood meal source, resting, and larvicidal sites and for mating. Tsetse olfactory behaviour can be exploited for their management; however, olfactory studies in tsetse flies are still fragmentary. Here in my PhD thesis, using scanning electron microscopy, electrophysiology, behaviour, bioinformatics and molecular biology techniques, I have investigated tsetse flies (Glossina fuscipes fuscipes) olfaction using behaviourally well studied odorants, tsetse repellent by comparing with attractant odour. Insect olfaction is mediated by olfactory sensory neurons (OSNs), located in olfactory sensilla, which are cuticular structures exposed to the environment through pore and create a platform for chemical communication. In the sensilla shaft the dendrite of OSNs are housed, which are protected by called the sensillum lymph produced by support cells and contains a variety of olfactory proteins, including the odorant binding protein (OBP) and chemosensory proteins (CSP). While on the dendrite of OSNs are expressed olfactory receptors. In my PhD, studies I tried to decipher the sense of smell in tsetse fly. In the second chapter, I demonstrated that G. f. fuscipes is equipped with diverse olfactory sensilla, that various from basiconic, trichoid and coeloconic. I also demonstrated, there is shape, length, number difference between sensilla types and sexual dimorphism. There is a major difference between male and female, while male has the unique basiconic sensilla, club shaped found in the pits, which is absent from female pits. In my third chapter, I investigated the odorant receptors which are expressed on the dendrite of the olfactory sensory neurons (OSNs). G. f. fuscipes has 42 ORs, which were not functionally characterised. I used behaviourally well studied odorants, tsetse repellents, composed of four components blend. I demonstrated that tsetse repellent is also a strong antifeedant for both G. pallidipes and G. f. fuscipes using feeding bioassays as compared to the attractant odour, adding the value of tsetse repellent. However, the attractant odour enhanced the feeding index. Using DREAM (deorphanization of receptors based on expression alterations of mRNA levels). I found that in G. f. fuscipes, following a short in vivo exposure to the individual tsetse repellent component as well as an attractant volatile chemical, OSNs that respond to these compounds altered their mRNA expression in two opposite direction, significant downregulation and upregulation in their number of transcripts corresponding to the OR that they expressed and interacted with odorant. Also, I found that the odorants with opposite valence already segregate distinctly at the cellular and molecular target at the periphery, which is the reception of odorants by OSNs, which is the basis of sophisticated olfactory behaviour. Deorphanization of ORs in none model insect is a challenge, here by combining DREAM with molecular dynamics, as docking score, physiology and homology modelling with Drosophila a well-studied model insects, I was able to predict putative receptors of the tsetse repellent components and an attractant odour. However, many ORs were neutral, showing they were not activated by the odorants, demonstrating the selectivity of the technique as well as the receptors. In my fourth chapter, I investigated the OBPs structures and their interaction with odorants molecules. I demonstrated that OBPs are expressed both in the antenna, as well as in other tissues, such as legs. I also demonstrated that there are variations in the expression of OBPs between tissues as well as sexes. I also demonstrated that odorants induced a fast alteration in OBP mRNA expression, some odorants induced a decrease in the transcription of genes corresponding to the activated OBP and others increased the expression by many fold in OBPs in live insect, others were neutral after 5 hours of exposure. Moreover, with subsequent behavioural data showed that the behavioural response of G. f. fuscipes toward 1-octen-3-ol decreased significantly when 1-octen-3-ol putative OBPs were silenced with feeding of double-stranded RNA (dsRNA). In summary, our finding whereby odorant exposure affects the OBPs mRNA, their physiochemical properties and the silencing of these OBPs affected the behavioural response demonstrate that the OBPs are involved in odour detection that affect the percept of the given odorant. The expression of OBPs in olfactory tissues, antenna and their interaction with odorant and their effect on behavioural response when silenced shows their direct involvement in odour detection and reception. Furthermore, their expression in other tissues such as legs indicates they might also have role in other physiological functions, such as taste.Item Coding of tsetse repellents by olfactory sensory neurons: towards the improvement and the development of novel tsetse repellents(University of Western Cape, 2021) Souleymane, Diallo; Christoffels, AlanTsetse flies are the biological vectors of human and animal trypanosomiasis and hence representant medical and veterinary importance. The sense of smell plays a significant role in tsetse and its ecological interaction, such as finding blood meal source, resting, and larvicidal sites and for mating. Tsetse olfactory behaviour can be exploited for their management; however, olfactory studies in tsetse flies are still fragmentary. Here in my PhD thesis, using scanning electron microscopy, electrophysiology, behaviour, bioinformatics and molecular biology techniques, I have investigated tsetse flies (Glossina fuscipes fuscipes) olfaction using behaviourally well studied odorants, tsetse repellent by comparing with attractant odour. Insect olfaction is mediated by olfactory sensory neurons (OSNs), located in olfactory sensilla, which are cuticular structures exposed to the environment through pore and create a platform for chemical communication.Item Comparative analysis of testis and ovary transcriptomes in zebrafish by combining experimental and computational tools(Wiley, 2004) Li, Yang; Chia, Jer, M; Bartfai, Richard; Christoffels, Alan; Yue, Gen, H; Ding, Ke; Ho, Mei, Y; Hill, James, A; Stupka, Elia; Orban, LaszloStudies on the zebrafish model have contributed to our understanding of several important developmental processes, especially those that can be easily studied in the embryo. However, knowledge on late events such as gonad differentiation in the zebrafish is still limited. Here an analysis on the gene sets is expressed in the adult zebrafish testis and ovary in an attempt to identify genes with potential role in (zebra)fish gonad development and function. We produced 10 533 expressed sequence tags (ESTs) from zebrafish testis or ovary and downloaded an additional 23 642 gonad-derived sequences from the zebrafish EST database. We clustered these sequences together with over 13 000 kidney-derived zebrafish ESTs to study partial transcriptomes for these three organs. We searched for genes with gonad-specific expression by screening macroarrays containing at least 2600 unique cDNA inserts with testis-, ovary- and kidney-derived cDNA probes. Clones hybridizing to only one of the two gonad probes were selected, and subsequently screened with computational tools to identify 72 genes with potentially testis-specific and 97 genes with potentially ovary-specific expression, respectively. PCR-amplification confirmed gonad-specificity for 21 of the 45 clones tested (all without known function). Our study, which involves over 47 000 EST sequences and specialized cDNA arrays, is the first analysis of adult organ transcriptomes of zebrafish at such a scale. The study of genes expressed in adult zebrafish testis and ovary will provide useful information on regulation of gene expression in teleost gonads and might also contribute to our understanding of the development and differentiation of reproductive organs in vertebrates.Item A comparative genomics approach towards classifying immunity-related proteins in the tsetse fly(2009) Mpondo, Feziwe; Hide, Winston; Christoffels, AlanTsetse flies (Glossina spp) are vectors of African trypanosome (Trypanosoma spp) parasites, causative agents of Human African trypanosomiasis (sleeping sickness) and Nagana in livestock. Research suggests that tsetse fly immunity factors are key determinants in the success and failure of infection and the maturation process of parasites. An analysis of tsetse fly immunity factors is limited by the paucity of genomic data for Glossina spp. Nevertheless, completely sequenced and assembled genomes of Drosophila melanogaster, Anopheles gambiae and Aedes aegypti provide an opportunity to characterize protein families in species such as Glossina by using a comparative genomics approach. In this study we characterize thioester-containing proteins (TEPs), a sub-family of immunity-related proteins, in Glossina by leveraging the EST data for G.morsitans and the genomic resources of D. melanogaster, A. gambiae as well as A.aegypti.A total of 17 TEPs corresponding to Drosophila (four TEPs), Anopheles (eleven TEPs) and Aedes aegypti (two TEPs) were collected from published data supplemented with Genbank searches. In the absence of genome data for G. morsitans, 124 000 G.morsitans ESTs were clustered and assembled into 18 413 transcripts (contigs and singletons). Five Glossina contigs (Gmcn1115, Gmcn1116, Gmcn2398, Gmcn2281 and Gmcn4297) were identified as putative TEPs by BLAST searches. Phylogenetic analyses were conducted to determine the relationship of collected TEP proteins.Gmcn1115 clustered with DmtepI and DmtepII while Gmcn2398 is placed in a separate branch, suggesting that it is specific to G. morsitans.The TEPs are highly conserved within D. melanogaster as reflected in the conservation of the thioester domain, while only two and one TEPs in A. gambiae and A. aegypti thioester domain show conservation of the thioester domain suggesting that these proteins are subjected to high levels of selection. Despite the absence of a sequenced genome for G. morsitans, at least two putative TEPs where identified from EST data.Item A comparative genomics approach towards classifying immunity-related proteins in the tsetse fly(University of the western cape, 2009) Mpondo, Feziwe; Hide, Winston; Christoffels, AlanTsetse flies (Glossina spp) are vectors of African trypanosome (Trypanosoma spp) parasites, causative agents of Human African trypanosomiasis (sleeping sickness) and Nagana in livestock. Research suggests that tsetse fly immunity factors are key determinants in the success and failure of infection and the maturation process of parasites. An analysis of tsetse fly immunity factors is limited by the paucity of genomic data for Glossina spp. Nevertheless, completely sequenced and assembled genomes Drosophila melanogaster, Anopheles gambiae and Aedes aegypti provide an opportunity to characterize protein families in species such as G/ossiza by using a comparative genomics approach. In this study, we characterize thioester-containing proteins (TEPs), a sub-family of immunity-related proteins, in Glossinaby leveraging the EST data for G. morsitans and the genomic resources of D. melanogaster, A. gambiae as well as A. aegyptiItem Computational characterisation of DNA methylomes in mycobacterium tuberculosis Beijing hyper- and hypo-virulent strains(University of the Western Cape, 2014) Naidu, Alecia Geraldine; Christoffels, Alan; Gey van Pittius, NicoMycobacterium tuberculosis, the causative agent of tuberculosis, is estimated to infect approximately one-third of the world’s population and is responsible for around 2 million deaths per year. The disease is endemic in South Africa which has one of the world’s highest tuberculosis incidence and death rates. The M. tuberculosis Beijing genotype are characterised by having an enhanced virulence capability over other M. tuberculosis strains and are the predominant strain observed in the Western Cape of South Africa. DNA methylation is a largely untapped area of research in M.tuberculosis and has been poorly described in the literature especially given its connection to virulence despite it being well characterised along with its role in virulence in other pathogenic bacteria such as E.coli. The overall aim was to characterise a global DNA methylation profile for two M. tuberculosis Beijing strains, hyper-virulent and hypo-virulent, using single molecule real time sequencing data technology. Moreover, to determine if adenine methylation in promoter regions has a possible functional role. This study identified and characterised the DNA methylation profile at the single nucleotide resolution in these strains using Pacific Biosciences single molecule real time sequencing data. A computational approach was used to discern DNA methylation patterns between the hyper and hypo-virulent strains with a view of understanding virulence in the hyper-virulent strain. Methylated motifs, which belong to known Restriction Modification (RM) systems of the H37Rv referencegenome were also identified. N6-methyladenine (m6A) and N4-methlycytosine (m4C) loci were identified in both strains. m6A were idenitified in both strains occuring within the following sequence motifs CACGCAG (Type II RM system), GATNNNNRTAC/GTAYNNNNATC (Type I RM system), while the CTGGAGGA motif was found to be uniquley methylated in the hyper-virulentstrain.Interestingly, the CACGCAG motif was significantly methylated (p = 9.9 x10 -63) at a higher proportion in intergenic regions (~70%) as opposed to genic regions in both the hyper-virulent and hypo-virulent strains suggesting a role in gene regulation. There appeared to be a higher proportion of m6A occuring in intergenic regions compared to within genes for hyper-virulent (61%) and hypo-virulent (62%) strains. The genic proportion revealed that 35% of total m6A occurred uniquely within genes for the hyper-virulent strain while 27.9% for uniquely methylated genes in hypo-virulent strain.Item A computational characterisation of the relationship between genome structure and disease genes(University of the Western Cape, 2012) Kibler, Tracey Deborah; Tiffin, Nicki; Christoffels, AlanThis is a pilot study to investigate the relationship between disease gene status and the structure of the human genome with specific reference to regions of recombination. It compares certain characteristics of a control set of genes, with no reported association or function in any known disease, with a second set of well-curated genes with a known association to a disease. One of the benefits of recombination is the introduction of new combinations of genetic variation in the genome. Recombination hotspots are regions on the chromosome where higher than normal frequencies of breaking and rejoining between homologous chromosomes occur during meiosis. The hotspot regions exhibit both a non-random distribution across the human genome and varying frequencies of breaking and rejoining. The study analyzed a set of features that represent general properties of human genes; namely base composition (percentage GC content), genetic variation (single nucleotide polymorphisms - SNPs), gene length, and positional effect (distance from chromosome end), in both the disease-associated gene set and the control set. These features were linked to recombination hotspots in the human genome and the frequency of recombination at these hotspots. Descriptive statistics was used to determine differences between the occurrences of these features in disease-associated genes compared to the control set, as well as differences in the occurrence of these same features in subset of genes containing an internal recombination hotspot compared to the genes with no internal recombination hotspot. The study found that disease-associated genes are generally longer than those in the control set, which is consistent with previous studies. It also found that disease-associated genes are much more likely to contain a recombination hotspot than those genes with no disease association. The study did not, however, find any association between disease gene status and the other set of features; namely GC content, SNP numbers or the position of a gene on the chromosome. Further analysis of the data suggested that the increased probability of disease-associated genes containing a recombination hotspot is most likely an effect of longer gene length and that the presence of a recombination hotspot is not sufficient in its own right to cause disease gene status.Item Computational characterization of IRE-regulated genes in Glossina morsitans(University of Western Cape, 2013) Dashti, Zahra Jalali Sefid; Christoffels, AlanBlood feeding is a habit exhibited by many insects. Considering the devastating impact of these insects on human health, it is important to focus research on understanding the biology behind blood-feeding, disease transmission and host-pathogen interactions. Such knowledge would pave the way for developing efficient preventative measures. Iron an important element for species survival, is at the center of events controlling tsetse’s fitness and reproductive success. Hence, targeting genes involved in iron trafficking and sequestration would present possible means of preventing disease transmission. Considering the dynamic and multi-factorial nature of iron metabolism, a well-coordinated regulatory system is expected to be at work. Despite extensive literature on the mechanism of iron regulation and key factors responsible in maintaining its homeostasis in human, less attention has been given to understand such system in insects, especially the blood-feeding insects. The availability of the genome sequences for several insect disease vectors allows for a more detailed analysis on the identification and characterization of events controlling and preventing iron-induced toxicity following a blood-meal. The International Glossina Genome Initiative (IGGI) has coordinated the sequencing and annotation of the Glossina morsitans genome that has led to the identification of 12220 genes. This knowledge-base along with current understanding of the IRE system in regulating iron metabolism, allowed for investigating the UTRs of Glossina genes for the presence of these elements. Using a combination of motif enrichment and IRE-stem loop structure prediction, an IRE-mediated regulation was inferred for 150 genes, among which, 72 were identified with 5’-IREs and 78 with 3’-IREs. Of the identified IRE-regulated genes, the ferritin heavy chain and MRCK-alpha are the only known genes to have IREs, while the rest are novel genes for which putative roles in regulating iron levels in tsetse fly have been assigned in this study. Moreover, the functional inference of the identified genes further points to the enrichment of transcription and translation. Furthermore, several hypothetical proteins with no defined functions were identified to be IRE-regulated. These include TMP007137, TMP009128, TMP002546, TMP002921, TMP003628, TMP004581, TMP008259, TMP012389, TMP005219, TMP005827, TMP007908, TMP009332, TMP01- 3384, TMP009102, TMP010544, TMP010707, TMP004292, TMP006517, TMP014030, TMP009821 and TMP003060 for which an iron-regulatory mechanism of action may be inferred. We further report 26 IRE-regulated secreted proteins in Glossina, that present good candidates for further investigation pertaining to the development of novel vector control strategies. Using the predicted data on the identified IRE-regulated genes and their functional classification, we derived at 29 genes with putative roles in iron trafficking, where several unknown and hypothetical proteins are included. Thus a novel role is inferred for these genes in cellular binding and transport in the context of iron metabolism. It is therefore possible that these genes may have evolved in Glossina, such that they compensate for the absence of an IRE- regulated mechanism for transferrin. Additionally, we propose 14 IRE-regulated genes involved in immune and stress response, which may indeed play crucial roles at the host pathogen interface through their possible mechanisms of iron sequestration. Using the subcellular localization analysis, we further categorized the putative IRE regulated genes into several subcellular localizations, where the majority of genes were found within the nucleus and the cytosol. The detection of the conserved motifs in a set of genes, is an interesting yet sophisticated area of research, that allows for identifying either co-regulated or orthologous genes, while further providing support for the putative function of a set of genes that would otherwise remain uncharacterized. This is based on the notion that co-regulated genes are often coexpressed to carry out a specific function. As such, 14 regulatory elements were identified in the 5’- and 3’-UTRs of IRE-regulated genes, involved in embryonic development and reproduction, inflammation and immune response, signaling pathways and neurogenesis as well as DNA repair. This study further proposes several IRE-regulated genes as targets for micro-RNA regulation through identifying micro-RNA binding sites in their 3’UTRs. Using a motif clustering approach we clustered IRE-regulated genes based on the number of motifs they share. Significantly co-regulated genes sharing two or more motifs were determined as critical targets for future investigation. The expression map of IRE-regulated genes was analyzed to better understand the events taking place from 3 hours to 15 days following a blood meal. Re-analysis of Anopheles microarray chip showed the significant expression of three cell envelope and transport genes as early response and six as late response to a blood meal, which could indeed be assigned a putative role in iron trafficking. Genes identified in this study with implications in iron metabolism, whose timely expression allows for maintaining iron homeostasis, represent good targets for future work. Considering the important role of evolution in species adaptation to habits such as Hematophagy, it is of importance to identify evolutionary signatures associated with these changes. To distinguish between evolutionary forces that are specific to iron-metabolism in blood-feeding insects and those that are found in other insects, the IRE-regulated genes were clustered into orthologous groups using several blood feeding and non-blood feeding insect species. Assessment of different evolutionary scenarios using the Maximum Likelihood (ML) approach, points to variations in the evolution of IRE-regulated genes between the two insect groups, whereby several genes indicate an increased mutation rate in the BF-insect group relative to their non-blood feeding insect counterparts. These include TMP003602 (phosphoinositide3-kinase), TMP009157 (ubiquitin-conjugating enzyme9), TMP010317 (general transcription factor IIH subunit1), TMP011104 (serine-pyruvate mitochondrial), TMP013137 (pentatricopeptide Transcription and translation), TMP013886 (tRNA(uridine-2-o-)-methyl-transferase-trm7) and TMP014187 (mediator 100kD). Additionally, we have indicated the presence of positively selected sites within seven blood-feeding IRE-regulated genes namely TMP002520 (nucleoporin), TMP008942 (eukaryotic translation initiation factor 3), TMP009871(bruno-3 transcript) , TMP010317 (general transcription factor IIH subunit1), TMP010673 (ferritin heavy-chain protein), TMP011104 (serine-pyruvate mitochondrial) and TMP011448 (brain chitinase and chia). Thus the results of this study provides an in depth understanding of iron metabolism in Glossina morsitans and confers important targets for future validations based on which innovative control strategies may be designed.Item Computational characterization of iron metabolism in the tsetse disease vector, glossina morsitans: Ire stem-loops(BMC, 2016) Dashti, Zahra Jalali Sefid; Gamieldien, Junaid; Christoffels, AlanIron metabolism and regulation is an indispensable part of species survival, most importantly for blood feeding insects. Iron regulatory proteins are central regulators of iron homeostasis, whose binding to iron response element (IRE) stem-loop structures within the UTRs of genes regulate expression at the post-transcriptional level. Despite the extensive literature on themechanism of iron regulation in human, less attention has been given to insect and more specifically the blood feeding insects, where research has mainly focused on the characterization of ferritin and transferrin. We thus, examined the mechanism of iron homeostasis through a genome-wide computational identification of IREs and other enriched motifs in the UTRs of Glossina morsitans with the view to identify new IRE-regulated genes.Item A computational framework for transcriptome assembly and annotation in non-model organisms: the case of venturia inaequalis(University of the Western Cape, 2014) Kimbung, Stanley Mbandi; Christoffels, Alan; Jasper, D; Rees, GIn this dissertation three computational approaches are presented that enable optimization of reference-free transcriptome reconstruction. The first addresses the selection of bona fide reconstructed transcribed fragments (transfrags) from de novo transcriptome assemblies and annotation with a multiple domain co-occurrence framework. We showed that selected transfrags are functionally relevant and represented over 94% of the information derived from annotation by transference. The second approach relates to quality score based RNA-seq sub-sampling and the description of a novel sequence similarity-derived metric for quality assessment of de novo transcriptome assemblies. A detail systematic analysis of the side effects induced by quality score based trimming and or filtering on artefact removal and transcriptome quality is describe. Aggressive trimming produced incomplete reconstructed and missing transfrags. This approach was applied in generating an optimal transcriptome assembly for a South African isolate of V. inaequalis. The third approach deals with the computational partitioning of transfrags assembled from RNA-Seq of mixed host and pathogen reads. We used this strategy to correct a publicly available transcriptome assembly for V. inaequalis (Indian isolate). We binned 50% of the latter to Apple transfrags and identified putative immunity transcript models. Comparative transcriptomic analysis between fungi transfrags from the Indian and South African isolates reveal effectors or transcripts that may be expressed in planta upon morphogenic differentiation. These studies have successfully identified V. inaequalis specific transfrags that can facilitate gene discovery. The unique access to an in-house draft genome assembly allowed us to provide preliminary description of genes that are implicated in pathogenesis. Gene prediction with bona fide transfrags produced 11,692 protein-coding genes. We identified two hydrophobin-like genes and six accessory genes of the melanin biosynthetic pathway that are implicated in the invasive action of the appressorium. The cazyome reveals an impressive repertoire of carbohydrate degrading enzymes and carbohydrate-binding modules amongst which are six polysaccharide lyases, and the largest number of carbohydrate esterases (twenty-eight) known in any fungus sequenced to date