Publications

2017

Zhang, Peter, Emmanuel Dimont, Thomas Ha, Douglas J Swanson, FANTOM Consortium, Winston Hide, and Dan Goldowitz. [2017] 2017. “Relatively Frequent Switching of Transcription Start Sites During Cerebellar Development..” BMC Genomics 18(1):461. doi: 10.1186/s12864-017-3834-z.

BACKGROUND: Alternative transcription start site (TSS) usage plays important roles in transcriptional control of mammalian gene expression. The growing interest in alternative TSSs and their role in genome diversification spawned many single-gene studies on differential usages of tissue-specific or temporal-specific alternative TSSs. However, exploration of the switching usage of alternative TSS usage on a genomic level, especially in the central nervous system, is largely lacking.

RESULTS: In this study, We have prepared a unique set of time-course data for the developing cerebellum, as part of the FANTOM5 consortium ( http://fantom.gsc.riken.jp/5/ ) that uses their innovative capturing of 5' ends of all transcripts followed by Helicos next generation sequencing. We analyzed the usage of all transcription start sites (TSSs) at each time point during cerebellar development that provided information on multiple RNA isoforms that emerged from the same gene. We developed a mathematical method that systematically compares the expression of different TSSs of a gene to identify temporal crossover and non-crossover switching events. We identified 48,489 novel TSS switching events in 5433 genes during cerebellar development. This includes 9767 crossover TSS switching events in 1511 genes, where the dominant TSS shifts over time.

CONCLUSIONS: We observed a relatively high prevalence of TSS switching in cerebellar development where the resulting temporally-specific gene transcripts and protein products can play important regulatory and functional roles.

2016

Gedye, Craig, Tracy Cardwell, Nektaria Dimopoulos, Bee Shin Tan, Heather Jackson, Suzanne Svobodová, Matthew Anaka, Andreas Behren, Christopher Maher, Oliver Hofmann, Winston Hide, Otavia Caballero, Ian D Davis, and Jonathan Cebon. [2016] 2016. “Mycoplasma Infection Alters Cancer Stem Cell Properties in Vitro..” Stem Cell Reviews and Reports 12(1):156-61. doi: 10.1007/s12015-015-9630-8.

Cancer cell lines can be useful to model cancer stem cells. Infection with Mycoplasma species is an insidious problem in mammalian cell culture. While investigating stem-like properties in early passage melanoma cell lines, we noted poorly reproducible results from an aliquot of a cell line that was later found to be infected with Mycoplasma hyorhinis. Deliberate infection of other early passage melanoma cell lines aliquots induced variable and unpredictable effects on expression of putative cancer stem cell markers, clonogenicity, proliferation and global gene expression. Cell lines established in stem cell media (SCM) were equally susceptible. Mycoplasma status is rarely reported in publications using cultured cells to study the cancer stem cell hypothesis. Our work highlights the importance of surveillance for Mycoplasma infection while using any cultured cells to interrogate tumor heterogeneity.

Mulder, Nicola J, Ezekiel Adebiyi, Raouf Alami, Alia Benkahla, James Brandful, Seydou Doumbia, Dean Everett, Faisal M Fadlelmola, Fatima Gaboun, Simani Gaseitsiwe, Hassan Ghazal, Scott Hazelhurst, Winston Hide, Azeddine Ibrahimi, Yasmina Jaufeerally Fakim, Victor Jongeneel, Fourie Joubert, Samar Kassim, Jonathan Kayondo, Judit Kumuthini, Sylvester Lyantagaye, Julie Makani, Ahmed Mansour Alzohairy, Daniel Masiga, Ahmed Moussa, Oyekanmi Nash, Odile Ouwe Missi Oukem-Boyer, Ellis Owusu-Dabo, Sumir Panji, Hugh Patterton, Fouzia Radouani, Khalid Sadki, Fouad Seghrouchni, Özlem Tastan Bishop, Nicki Tiffin, Nzovu Ulenga, and H3ABioNet Consortium. [2016] 2016. “H3ABioNet, a Sustainable Pan-African Bioinformatics Network for Human Heredity and Health in Africa..” Genome Research 26(2):271-7. doi: 10.1101/gr.196295.115.

The application of genomics technologies to medicine and biomedical research is increasing in popularity, made possible by new high-throughput genotyping and sequencing technologies and improved data analysis capabilities. Some of the greatest genetic diversity among humans, animals, plants, and microbiota occurs in Africa, yet genomic research outputs from the continent are limited. The Human Heredity and Health in Africa (H3Africa) initiative was established to drive the development of genomic research for human health in Africa, and through recognition of the critical role of bioinformatics in this process, spurred the establishment of H3ABioNet, a pan-African bioinformatics network for H3Africa. The limitations in bioinformatics capacity on the continent have been a major contributory factor to the lack of notable outputs in high-throughput biology research. Although pockets of high-quality bioinformatics teams have existed previously, the majority of research institutions lack experienced faculty who can train and supervise bioinformatics students. H3ABioNet aims to address this dire need, specifically in the area of human genetics and genomics, but knock-on effects are ensuring this extends to other areas of bioinformatics. Here, we describe the emergence of genomics research and the development of bioinformatics in Africa through H3ABioNet.

Roccaro, Aldo M, Antonio Sacco, Jiantao Shi, Marco Chiarini, Adriana Perilla-Glen, Salomon Manier, Siobhan Glavey, Yosra Aljawai, Yuji Mishima, Yawara Kawano, Michele Moschetta, Mick Correll, Ma Reina Improgo, Jennifer R Brown, Luisa Imberti, Giuseppe Rossi, Jorge J Castillo, Steven P Treon, Matthew L Freedman, Eliezer M Van Allen, Winston Hide, Elaine Hiller, Irene Rainville, and Irene M Ghobrial. [2016] 2016. “Exome Sequencing Reveals Recurrent Germ Line Variants in Patients With Familial Waldenström Macroglobulinemia..” Blood 127(21):2598-606. doi: 10.1182/blood-2015-11-680199.

Familial aggregation of Waldenström macroglobulinemia (WM) cases, and the clustering of B-cell lymphoproliferative disorders among first-degree relatives of WM patients, has been reported. Nevertheless, the possible contribution of inherited susceptibility to familial WM remains unrevealed. We performed whole exome sequencing on germ line DNA obtained from 4 family members in which coinheritance for WM was documented in 3 of them, and screened additional independent 246 cases by using gene-specific mutation sequencing. Among the shared germ line variants, LAPTM5(c403t) and HCLS1(g496a) were the most recurrent, being present in 3/3 affected members of the index family, detected in 8% of the unrelated familial cases, and present in 0.5% of the nonfamilial cases and in <0.05 of a control population. LAPTM5 and HCLS1 appeared as relevant WM candidate genes that characterized familial WM individuals and were also functionally relevant to the tumor clone. These findings highlight potentially novel contributors for the genetic predisposition to familial WM and indicate that LAPTM5(c403t) and HCLS1(g496a) may represent predisposition alleles in patients with familial WM.

Salomonis, Nathan, Phillip J Dexheimer, Larsson Omberg, Robin Schroll, Stacy Bush, Jeffrey Huo, Lynn Schriml, Shannan Ho Sui, Mehdi Keddache, Christopher Mayhew, Shiva Kumar Shanmukhappa, James Wells, Kenneth Daily, Shane Hubler, Yuliang Wang, Elias Zambidis, Adam Margolin, Winston Hide, Antonis K Hatzopoulos, Punam Malik, Jose A Cancelas, Bruce J Aronow, and Carolyn Lutzko. [2016] 2016. “Integrated Genomic Analysis of Diverse Induced Pluripotent Stem Cells from the Progenitor Cell Biology Consortium..” Stem Cell Reports 7(1):110-25. doi: 10.1016/j.stemcr.2016.05.006.

The rigorous characterization of distinct induced pluripotent stem cells (iPSC) derived from multiple reprogramming technologies, somatic sources, and donors is required to understand potential sources of variability and downstream potential. To achieve this goal, the Progenitor Cell Biology Consortium performed comprehensive experimental and genomic analyses of 58 iPSC from ten laboratories generated using a variety of reprogramming genes, vectors, and cells. Associated global molecular characterization studies identified functionally informative correlations in gene expression, DNA methylation, and/or copy-number variation among key developmental and oncogenic regulators as a result of donor, sex, line stability, reprogramming technology, and cell of origin. Furthermore, X-chromosome inactivation in PSC produced highly correlated differences in teratoma-lineage staining and regulator expression upon differentiation. All experimental results, and raw, processed, and metadata from these analyses, including powerful tools, are interactively accessible from a new online portal at https://www.synapse.org to serve as a reusable resource for the stem cell community.

Grasso, Carole, Matthew Anaka, Oliver Hofmann, Ramakrishna Sompallae, Kate Broadley, Winston Hide, Michael Berridge V, Jonathan Cebon, Andreas Behren, and Melanie J McConnell. [2016] 2016. “Iterative Sorting Reveals CD133+ and CD133- Melanoma Cells As Phenotypically Distinct Populations..” BMC Cancer 16(1):726. doi: 10.1186/s12885-016-2759-2.

BACKGROUND: The heterogeneity and tumourigenicity of metastatic melanoma is attributed to a cancer stem cell model, with CD133 considered to be a cancer stem cell marker in melanoma as well as other tumours, but its role has remained controversial.

METHODS: We iteratively sorted CD133+ and CD133- cells from 3 metastatic melanoma cell lines, and observed tumourigenicity and phenotypic characteristics over 7 generations of serial xeno-transplantation in NOD/SCID mice.

RESULTS: We demonstrate that iterative sorting is required to make highly pure populations of CD133+ and CD133- cells from metastatic melanoma, and that these two populations have distinct characteristics not related to the cancer stem cell phenotype. In vitro, gene set enrichment analysis indicated CD133+ cells were related to a proliferative phenotype, whereas CD133- cells were of an invasive phenotype. However, in vivo, serial transplantation of CD133+ and CD133- tumours over 7 generations showed that both populations were equally able to initiate and propagate tumours. Despite this, both populations remained phenotypically distinct, with CD133- cells only able to express CD133 in vivo and not in vitro. Loss of CD133 from the surface of a CD133+ cell was observed in vitro and in vivo, however CD133- cells derived from CD133+ retained the CD133+ phenotype, even in the presence of signals from the tumour microenvironment.

CONCLUSION: We show for the first time the necessity of iterative sorting to isolate pure marker-positive and marker-negative populations for comparative studies, and present evidence that despite CD133+ and CD133- cells being equally tumourigenic, they display distinct phenotypic differences, suggesting CD133 may define a distinct lineage in melanoma.

2015

Karp, Peter D, Bonnie Berger, Diane Kovats, Thomas Lengauer, Michal Linial, Pardis Sabeti, Winston Hide, and Burkhard Rost. [2015] 2015. “ISCB Ebola Award for Important Future Research on the Computational Biology of Ebola Virus..” F1000Research 4:12. doi: 10.12688/f1000research.6038.1.

Speed is of the essence in combating Ebola; thus, computational approaches should form a significant component of Ebola research. As for the development of any modern drug, computational biology is uniquely positioned to contribute through comparative analysis of the genome sequences of Ebola strains as well as 3-D protein modeling. Other computational approaches to Ebola may include large-scale docking studies of Ebola proteins with human proteins and with small-molecule libraries, computational modeling of the spread of the virus, computational mining of the Ebola literature, and creation of a curated Ebola database. Taken together, such computational efforts could significantly accelerate traditional scientific approaches. In recognition of the need for important and immediate solutions from the field of computational biology against Ebola, the International Society for Computational Biology (ISCB) announces a prize for an important computational advance in fighting the Ebola virus. ISCB will confer the ISCB Fight against Ebola Award, along with a prize of US$2,000, at its July 2016 annual meeting (ISCB Intelligent Systems for Molecular Biology (ISMB) 2016, Orlando, Florida).

Lizio, Marina, Jayson Harshbarger, Hisashi Shimoji, Jessica Severin, Takeya Kasukawa, Serkan Sahin, Imad Abugessaisa, Shiro Fukuda, Fumi Hori, Sachi Ishikawa-Kato, Christopher J Mungall, Erik Arner, Kenneth Baillie, Nicolas Bertin, Hidemasa Bono, Michiel de Hoon, Alexander D Diehl, Emmanuel Dimont, Tom C Freeman, Kaori Fujieda, Winston Hide, Rajaram Kaliyaperumal, Toshiaki Katayama, Timo Lassmann, Terrence F Meehan, Koro Nishikata, Hiromasa Ono, Michael Rehli, Albin Sandelin, Erik A Schultes, Peter A C ’t Hoen, Zuotian Tatum, Mark Thompson, Tetsuro Toyoda, Derek W Wright, Carsten O Daub, Masayoshi Itoh, Piero Carninci, Yoshihide Hayashizaki, Alistair R R Forrest, Hideya Kawaji, and FANTOM Consortium. [2015] 2015. “Gateways to the FANTOM5 Promoter Level Mammalian Expression Atlas..” Genome Biology 16(1):22. doi: 10.1186/s13059-014-0560-6.

The FANTOM5 project investigates transcription initiation activities in more than 1,000 human and mouse primary cells, cell lines and tissues using CAGE. Based on manual curation of sample information and development of an ontology for sample classification, we assemble the resulting data into a centralized data resource (http://fantom.gsc.riken.jp/5/). This resource contains web-based tools and data-access points for the research community to search and extract data related to samples, genes, promoter activities, transcription factors and enhancers across the FANTOM5 atlas.

Dimont, Emmanuel, Jiantao Shi, Rory Kirchner, and Winston Hide. [2015] 2015. “EdgeRun: an R Package for Sensitive, Functionally Relevant Differential Expression Discovery Using an Unconditional Exact Test..” Bioinformatics (Oxford, England) 31(15):2589-90. doi: 10.1093/bioinformatics/btv209.

Next-generation sequencing platforms for measuring digital expression such as RNA-Seq are displacing traditional microarray-based methods in biological experiments. The detection of differentially expressed genes between groups of biological conditions has led to the development of numerous bioinformatics tools, but so far, few exploit the expanded dynamic range afforded by the new technologies. We present edgeRun, an R package that implements an unconditional exact test that is a more powerful version of the exact test in edgeR. This increase in power is especially pronounced for experiments with as few as two replicates per condition, for genes with low total expression and with large biological coefficient of variation. In comparison with a panel of other tools, edgeRun consistently captures functionally similar differentially expressed genes.

Karp, Peter D, Bonnie Berger, Diane Kovats, Thomas Lengauer, Michal Linial, Pardis Sabeti, Winston Hide, and Burkhard Rost. [2015] 2015. “Message from the ISCB: ISCB Ebola Award for Important Future Research on the Computational Biology of Ebola Virus..” Bioinformatics (Oxford, England) 31(4):616-7. doi: 10.1093/bioinformatics/btv019.

UNLABELLED: Speed is of the essence in combating Ebola; thus, computational approaches should form a significant component of Ebola research. As for the development of any modern drug, computational biology is uniquely positioned to contribute through comparative analysis of the genome sequences of Ebola strains and three-dimensional protein modeling. Other computational approaches to Ebola may include large-scale docking studies of Ebola proteins with human proteins and with small-molecule libraries, computational modeling of the spread of the virus, computational mining of the Ebola literature and creation of a curated Ebola database. Taken together, such computational efforts could significantly accelerate traditional scientific approaches. In recognition of the need for important and immediate solutions from the field of computational biology against Ebola, the International Society for Computational Biology (ISCB) announces a prize for an important computational advance in fighting the Ebola virus. ISCB will confer the ISCB Fight against Ebola Award, along with a prize of US$2000, at its July 2016 annual meeting (ISCB Intelligent Systems for Molecular Biology 2016, Orlando, FL).

CONTACT: dkovats@iscb.org or rost@in.tum.de.