The authors of the original article [1] would like to recognize the critical contribution of core members of the FANTOM5 Consortium, who played the critical role of HeliScopeCAGE sequencing experiments, quality control of tag reads and processing of the raw sequencing data.
Publications
2018
Attempts to develop drugs that address sepsis based on leads developed in animal models have failed. We sought to identify leads based on human data by exploiting a natural experiment: the relative resistance of children to mortality from severe infections and sepsis. Using public datasets, we identified key differences in pathway activity (Pathprint) in blood transcriptome profiles of septic adults and children. To find drugs that could promote beneficial (child) pathways or inhibit harmful (adult) ones, we built an in silico pathway drug network (PDN) using expression correlation between drug, disease, and pathway gene signatures across 58,475 microarrays. Specific pathway clusters from children or adults were assessed for correlation with drug-based signatures. Validation by literature curation and by direct testing in an endotoxemia model of murine sepsis of the most correlated drug candidates demonstrated that the Pathprint-PDN methodology is more effective at generating positive drug leads than gene-level methods (e.g., CMap). Pathway-centric Pathprint-PDN is a powerful new way to identify drug candidates for intervention against sepsis and provides direct insight into pathways that may determine survival.
A goal of genomics is to understand the relationships between biological processes. Pathways contribute to functional interplay within biological processes through complex but poorly understood interactions. However, limited functional references for global pathway relationships exist. Pathways from databases such as KEGG and Reactome provide discrete annotations of biological processes. Their relationships are currently either inferred from gene set enrichment within specific experiments, or by simple overlap, linking pathway annotations that have genes in common. Here, we provide a unifying interpretation of functional interaction between pathways by systematically quantifying coexpression between 1,330 canonical pathways from the Molecular Signatures Database (MSigDB) to establish the Pathway Coexpression Network (PCxN). We estimated the correlation between canonical pathways valid in a broad context using a curated collection of 3,207 microarrays from 72 normal human tissues. PCxN accounts for shared genes between annotations to estimate significant correlations between pathways with related functions rather than with similar annotations. We demonstrate that PCxN provides novel insight into mechanisms of complex diseases using an Alzheimer's Disease (AD) case study. PCxN retrieved pathways significantly correlated with an expert curated AD gene list. These pathways have known associations with AD and were significantly enriched for genes independently associated with AD. As a further step, we show how PCxN complements the results of gene set enrichment methods by revealing relationships between enriched pathways, and by identifying additional highly correlated pathways. PCxN revealed that correlated pathways from an AD expression profiling study include functional clusters involved in cell adhesion and oxidative stress. PCxN provides expanded connections to pathways from the extracellular matrix. PCxN provides a powerful new framework for interrogation of global pathway relationships. Comprehensive exploration of PCxN can be performed at http://pcxn.org/.
2017
BACKGROUND AND AIMS: Longstanding uncertainty surrounds the selection of surgical protocols for the closure of unilateral cleft lip and palate, and randomised trials have only rarely been performed. This paper is an introduction to three randomised trials of primary surgery for children born with complete unilateral cleft lip and palate (UCLP). It presents the protocol developed for the trials in CONSORT format, and describes the management structure that was developed to achieve the long-term engagement and commitment required to complete the project.
METHOD: Ten established national or regional cleft centres participated. Lip and soft palate closure at 3-4 months, and hard palate closure at 12 months served as a common method in each trial. Trial 1 compared this with hard palate closure at 36 months. Trial 2 compared it with lip closure at 3-4 months and hard and soft palate closure at 12 months. Trial 3 compared it with lip and hard palate closure at 3-4 months and soft palate closure at 12 months. The primary outcomes were speech and dentofacial development, with a series of perioperative and longer-term secondary outcomes.
RESULTS: Recruitment of 448 infants took place over a 9-year period, with 99.8% subsequent retention at 5 years.
CONCLUSION: The series of reports that follow this introductory paper include comparisons at age 5 of surgical outcomes, speech outcomes, measures of dentofacial development and appearance, and parental satisfaction. The outcomes recorded and the numbers analysed for each outcome and time point are described in the series.
TRIAL REGISTRATION: ISRCTN29932826.
Amyotrophic lateral sclerosis (ALS) is a devastating neurodegenerative disease that lacks a predictive and broadly applicable biomarker. Continued focus on mutation-specific upstream mechanisms has yet to predict disease progression in the clinic. Utilising cellular pathology common to the majority of ALS patients, we implemented an objective transcriptome-driven approach to develop noninvasive prognostic biomarkers for disease progression. Genes expressed in laser captured motor neurons in direct correlation (Spearman rank correlation, p < 0.01) with counts of neuropathology were developed into co-expression network modules. Screening modules using three gene sets representing rate of disease progression and upstream genetic association with ALS led to the prioritisation of a single module enriched for immune response to motor neuron degeneration. Genes in the network module are important for microglial activation and predict disease progression in genetically heterogeneous ALS cohorts: Expression of three genes in peripheral lymphocytes - LILRA2, ITGB2 and CEBPD - differentiate patients with rapid and slowly progressive disease, suggesting promise as a blood-derived biomarker. TREM2 is a member of the network module and the level of soluble TREM2 protein in cerebrospinal fluid is shown to predict survival when measured in late stage disease (Spearman rank correlation, p = 0.01). Our data-driven systems approach has, for the first time, directly linked microglia to the development of motor neuron pathology. LILRA2, ITGB2 and CEBPD represent peripherally accessible candidate biomarkers and TREM2 provides a broadly applicable therapeutic target for ALS.
The use of induced pluripotent stem cells (iPSC) derived from independent patients and sources holds considerable promise to improve the understanding of development and disease. However, optimized use of iPSC depends on our ability to develop methods to efficiently qualify cell lines and protocols, monitor genetic stability, and evaluate self-renewal and differentiation potential. To accomplish these goals, 57 stem cell lines from 10 laboratories were differentiated to 7 different states, resulting in 248 analyzed samples. Cell lines were differentiated and characterized at a central laboratory using standardized cell culture methodologies, protocols, and metadata descriptors. Stem cell and derived differentiated lines were characterized using RNA-seq, miRNA-seq, copy number arrays, DNA methylation arrays, flow cytometry, and molecular histology. All materials, including raw data, metadata, analysis and processing code, and methodological and provenance documentation are publicly available for re-use and interactive exploration at https://www.synapse.org/pcbc. The goal is to provide data that can improve our ability to robustly and reproducibly use human pluripotent stem cells to understand development and disease.
SOX5 encodes a transcription factor that is expressed in multiple tissues including heart, lung and brain. Mutations in SOX5 have been previously found in patients with amyotrophic lateral sclerosis (ALS) and developmental delay, intellectual disability and dysmorphic features. To characterize the neuronal role of SOX5, we silenced the Drosophila ortholog of SOX5, Sox102F, by RNAi in various neuronal subtypes in Drosophila. Silencing of Sox102F led to misorientated and disorganized michrochaetes, neurons with shorter dendritic arborization (DA) and reduced complexity, diminished larval peristaltic contractions, loss of neuromuscular junction bouton structures, impaired olfactory perception, and severe neurodegeneration in brain. Silencing of SOX5 in human SH-SY5Y neuroblastoma cells resulted in a significant repression of WNT signaling activity and altered expression of WNT-related genes. Genetic association and meta-analyses of the results in several large family-based and case-control late-onset familial Alzheimer's disease (LOAD) samples of SOX5 variants revealed several variants that show significant association with AD disease status. In addition, analysis for rare and highly penetrate functional variants revealed four novel variants/mutations in SOX5, which taken together with functional prediction analysis, suggests a strong role of SOX5 causing AD in the carrier families. Collectively, these findings indicate that SOX5 is a novel candidate gene for LOAD with an important role in neuronal function. The genetic findings warrant further studies to identify and characterize SOX5 variants that confer risk for AD, ALS and intellectual disability.
BACKGROUND: Alternative transcription start site (TSS) usage plays important roles in transcriptional control of mammalian gene expression. The growing interest in alternative TSSs and their role in genome diversification spawned many single-gene studies on differential usages of tissue-specific or temporal-specific alternative TSSs. However, exploration of the switching usage of alternative TSS usage on a genomic level, especially in the central nervous system, is largely lacking.
RESULTS: In this study, We have prepared a unique set of time-course data for the developing cerebellum, as part of the FANTOM5 consortium ( http://fantom.gsc.riken.jp/5/ ) that uses their innovative capturing of 5' ends of all transcripts followed by Helicos next generation sequencing. We analyzed the usage of all transcription start sites (TSSs) at each time point during cerebellar development that provided information on multiple RNA isoforms that emerged from the same gene. We developed a mathematical method that systematically compares the expression of different TSSs of a gene to identify temporal crossover and non-crossover switching events. We identified 48,489 novel TSS switching events in 5433 genes during cerebellar development. This includes 9767 crossover TSS switching events in 1511 genes, where the dominant TSS shifts over time.
CONCLUSIONS: We observed a relatively high prevalence of TSS switching in cerebellar development where the resulting temporally-specific gene transcripts and protein products can play important regulatory and functional roles.
2016
Cancer cell lines can be useful to model cancer stem cells. Infection with Mycoplasma species is an insidious problem in mammalian cell culture. While investigating stem-like properties in early passage melanoma cell lines, we noted poorly reproducible results from an aliquot of a cell line that was later found to be infected with Mycoplasma hyorhinis. Deliberate infection of other early passage melanoma cell lines aliquots induced variable and unpredictable effects on expression of putative cancer stem cell markers, clonogenicity, proliferation and global gene expression. Cell lines established in stem cell media (SCM) were equally susceptible. Mycoplasma status is rarely reported in publications using cultured cells to study the cancer stem cell hypothesis. Our work highlights the importance of surveillance for Mycoplasma infection while using any cultured cells to interrogate tumor heterogeneity.
The application of genomics technologies to medicine and biomedical research is increasing in popularity, made possible by new high-throughput genotyping and sequencing technologies and improved data analysis capabilities. Some of the greatest genetic diversity among humans, animals, plants, and microbiota occurs in Africa, yet genomic research outputs from the continent are limited. The Human Heredity and Health in Africa (H3Africa) initiative was established to drive the development of genomic research for human health in Africa, and through recognition of the critical role of bioinformatics in this process, spurred the establishment of H3ABioNet, a pan-African bioinformatics network for H3Africa. The limitations in bioinformatics capacity on the continent have been a major contributory factor to the lack of notable outputs in high-throughput biology research. Although pockets of high-quality bioinformatics teams have existed previously, the majority of research institutions lack experienced faculty who can train and supervise bioinformatics students. H3ABioNet aims to address this dire need, specifically in the area of human genetics and genomics, but knock-on effects are ensuring this extends to other areas of bioinformatics. Here, we describe the emergence of genomics research and the development of bioinformatics in Africa through H3ABioNet.