Publications

2015

Singh, Nakul, Arun D Singh, and Winston Hide. [2015] 2015. “Inferring an Evolutionary Tree of Uveal Melanoma From Genomic Copy Number Aberrations..” Investigative Ophthalmology & Visual Science 56(11):6801-9. doi: 10.1167/iovs.15-16822.

PURPOSE: The purpose of this study is to study the genomic evolution of primary uveal melanoma.

METHODS: Primary uveal melanoma genomic DNA was assayed on the Illumina Human660W-Quad v1.0 DNA Analysis BeadChip. Raw signal intensity data were quantile normalized to estimate copy number aberration with the Genome Alteration Print algorithm. Distance between samples was calculated as the Manhattan distance between the copy number profiles of the tumors. From the distance matrix, a phylogenetic network (evolutionary relationship inference) was estimated using SplitsTree4.

RESULTS: Of the 57 tumors, one (1.8%) was discarded because of a failed assay, and seven (12.3%) were revealed to be mixtures of several cell populations that could not be resolved. Three clades of tumor were identified (A [59.2%], B [32.7%], and C [6.1%]), each following a distinct evolutionary path and each associated with metastatic status (P = 0.01). One tumor (2.0%) did not fit into any clade. From a normal diploid melanocyte, a few tumors (clade C) lose a large portion of chromosome 6q, but do not develop any mutations on 8q. In an alternate path, the vast majority of tumors (clade A and clade B [91.9%]) gain a copy of the telomeric half of 8q. A majority of these tumors (clade A) subsequently lose a copy of chromosome 3, as well as gain the centromeric half of 8q. The other tumors (clade B) gain copies of 6p, as well as regions on 11p and 22q.

CONCLUSIONS: Our data suggest that there is little overlap in the subtypes of uveal melanoma after divergence (identified as clades A and B) and that these distinct subtypes are not likely to crossover or transform from one major clade to another.

Nishi, Yuichi, Xiaoxiao Zhang, Jieun Jeong, Kevin A Peterson, Anastasia Vedenko, Martha L Bulyk, Winston A Hide, and Andrew P McMahon. [2015] 2015. “A Direct Fate Exclusion Mechanism by Sonic Hedgehog-Regulated Transcriptional Repressors..” Development (Cambridge, England) 142(19):3286-93. doi: 10.1242/dev.124636.

Sonic hedgehog (Shh) signaling patterns the vertebrate spinal cord by activating a group of transcriptional repressors in distinct neural progenitors of somatic motor neuron and interneuron subtypes. To identify the action of this network, we performed a genome-wide analysis of the regulatory actions of three key ventral determinants in mammalian neural tube patterning: Nkx2.2, Nkx6.1 and Olig2. Previous studies have demonstrated that each factor acts predominantly as a transcriptional repressor, at least in part, to inhibit alternative progenitor fate choices. Here, we reveal broad and direct repression of multiple alternative fates as a general mechanism of repressor action. Additionally, the repressor network targets multiple Shh signaling components providing negative feedback to ongoing Shh signaling. Analysis of chromatin organization around Nkx2.2-, Nkx6.1- and Olig2-bound regions, together with co-analysis of engagement of the transcriptional activator Sox2, indicate that repressors bind to, and probably modulate the action of, neural enhancers. Together, the data suggest a model for neural progenitor specification downstream of Shh signaling, in which Nkx2.2 and Olig2 direct repression of alternative neural progenitor fate determinants, an action augmented by the overlapping activity of Nkx6.1 in each cell type. Integration of repressor and activator inputs, notably activator inputs mediated by Sox2, is probably a key mechanism in achieving cell type-specific transcriptional outcomes in mammalian neural progenitor fate specification.

2014

Zook, Justin M, Brad Chapman, Jason Wang, David Mittelman, Oliver Hofmann, Winston Hide, and Marc Salit. [2014] 2014. “Integrating Human Sequence Data Sets Provides a Resource of Benchmark SNP and Indel Genotype Calls..” Nature Biotechnology 32(3):246-51. doi: 10.1038/nbt.2835.

Clinical adoption of human genome sequencing requires methods that output genotypes with known accuracy at millions or billions of positions across a genome. Because of substantial discordance among calls made by existing sequencing methods and algorithms, there is a need for a highly accurate set of genotypes across a genome that can be used as a benchmark. Here we present methods to make high-confidence, single-nucleotide polymorphism (SNP), indel and homozygous reference genotype calls for NA12878, the pilot genome for the Genome in a Bottle Consortium. We minimize bias toward any method by integrating and arbitrating between 14 data sets from five sequencing technologies, seven read mappers and three variant callers. We identify regions for which no confident genotype call could be made, and classify them into different categories based on reasons for uncertainty. Our genotype calls are publicly available on the Genome Comparison and Analytic Testing website to enable real-time benchmarking of any method.

Liu, Jing, Michaela Krautzberger, Shannan H Sui, Oliver M Hofmann, Ying Chen, Manfred Baetscher, Ivica Grgic, Sanjeev Kumar, Benjamin D Humphreys, Winston A Hide, and Andrew P McMahon. [2014] 2014. “Cell-Specific Translational Profiling in Acute Kidney Injury..” The Journal of Clinical Investigation 124(3):1242-54. doi: 10.1172/JCI72126.

Acute kidney injury (AKI) promotes an abrupt loss of kidney function that results in substantial morbidity and mortality. Considerable effort has gone toward identification of diagnostic biomarkers and analysis of AKI-associated molecular events; however, most studies have adopted organ-wide approaches and have not elucidated the interplay among different cell types involved in AKI pathophysiology. To better characterize AKI-associated molecular and cellular events, we developed a mouse line that enables the identification of translational profiles in specific cell types. This strategy relies on CRE recombinase-dependent activation of an EGFP-tagged L10a ribosomal protein subunit, which allows translating ribosome affinity purification (TRAP) of mRNA populations in CRE-expressing cells. Combining this mouse line with cell type-specific CRE-driver lines, we identified distinct cellular responses in an ischemia reperfusion injury (IRI) model of AKI. Twenty-four hours following IRI, distinct translational signatures were identified in the nephron, kidney interstitial cell populations, vascular endothelium, and macrophages/monocytes. Furthermore, TRAP captured known IRI-associated markers, validating this approach. Biological function annotation, canonical pathway analysis, and in situ analysis of identified response genes provided insight into cell-specific injury signatures. Our study provides a deep, cell-based view of early injury-associated molecular events in AKI and documents a versatile, genetic tool to monitor cell-specific and temporal-specific biological processes in disease modeling.

Li, Jonathan Z, Brad Chapman, Patrick Charlebois, Oliver Hofmann, Brian Weiner, Alyssa J Porter, Reshmi Samuel, Saran Vardhanabhuti, Lu Zheng, Joseph Eron, Babafemi Taiwo, Michael C Zody, Matthew R Henn, Daniel R Kuritzkes, Winston Hide, ACTG A5262 Study Team, Cara C Wilson, Baiba I Berzins, Edward P Acosta, Barbara Bastow, Peter S Kim, Sarah W Read, Jennifer Janik, Debra S Meres, Michael M Lederman, Lori Mong-Kryspin, Karl E Shaw, Louis G Zimmerman, Randi Leavitt, Guy De La Rosa, and Amy Jennings. [2014] 2014. “Comparison of Illumina and 454 Deep Sequencing in Participants Failing Raltegravir-Based Antiretroviral Therapy..” PloS One 9(3):e90485. doi: 10.1371/journal.pone.0090485.

BACKGROUND: The impact of raltegravir-resistant HIV-1 minority variants (MVs) on raltegravir treatment failure is unknown. Illumina sequencing offers greater throughput than 454, but sequence analysis tools for viral sequencing are needed. We evaluated Illumina and 454 for the detection of HIV-1 raltegravir-resistant MVs.

METHODS: A5262 was a single-arm study of raltegravir and darunavir/ritonavir in treatment-naïve patients. Pre-treatment plasma was obtained from 5 participants with raltegravir resistance at the time of virologic failure. A control library was created by pooling integrase clones at predefined proportions. Multiplexed sequencing was performed with Illumina and 454 platforms at comparable costs. Illumina sequence analysis was performed with the novel snp-assess tool and 454 sequencing was analyzed with V-Phaser.

RESULTS: Illumina sequencing resulted in significantly higher sequence coverage and a 0.095% limit of detection. Illumina accurately detected all MVs in the control library at ≥0.5% and 7/10 MVs expected at 0.1%. 454 sequencing failed to detect any MVs at 0.1% with 5 false positive calls. For MVs detected in the patient samples by both 454 and Illumina, the correlation in the detected variant frequencies was high (R2 = 0.92, P<0.001). Illumina sequencing detected 2.4-fold greater nucleotide MVs and 2.9-fold greater amino acid MVs compared to 454. The only raltegravir-resistant MV detected was an E138K mutation in one participant by Illumina sequencing, but not by 454.

CONCLUSIONS: In participants of A5262 with raltegravir resistance at virologic failure, baseline raltegravir-resistant MVs were rarely detected. At comparable costs to 454 sequencing, Illumina demonstrated greater depth of coverage, increased sensitivity for detecting HIV MVs, and fewer false positive variant calls.

CLST, FANTOM Consortium and the RIKEN PMI and, Alistair R R Forrest, Hideya Kawaji, Michael Rehli, Kenneth Baillie, Michiel J L de Hoon, Vanja Haberle, Timo Lassmann, Ivan Kulakovskiy V, Marina Lizio, Masayoshi Itoh, Robin Andersson, Christopher J Mungall, Terrence F Meehan, Sebastian Schmeier, Nicolas Bertin, Mette Jørgensen, Emmanuel Dimont, Erik Arner, Christian Schmidl, Ulf Schaefer, Yulia A Medvedeva, Charles Plessy, Morana Vitezic, Jessica Severin, Colin A Semple, Yuri Ishizu, Robert S Young, Margherita Francescatto, Intikhab Alam, Davide Albanese, Gabriel M Altschuler, Takahiro Arakawa, John A C Archer, Peter Arner, Magda Babina, Sarah Rennie, Piotr J Balwierz, Anthony G Beckhouse, Swati Pradhan-Bhatt, Judith A Blake, Antje Blumenthal, Beatrice Bodega, Alessandro Bonetti, James Briggs, Frank Brombacher, Maxwell Burroughs, Andrea Califano, Carlo Cannistraci V, Daniel Carbajo, Yun Chen, Marco Chierici, Yari Ciani, Hans C Clevers, Emiliano Dalla, Carrie A Davis, Michael Detmar, Alexander D Diehl, Taeko Dohi, Finn Drabløs, Albert S B Edge, Matthias Edinger, Karl Ekwall, Mitsuhiro Endoh, Hideki Enomoto, Michela Fagiolini, Lynsey Fairbairn, Hai Fang, Mary C Farach-Carson, Geoffrey J Faulkner, Alexander Favorov V, Malcolm E Fisher, Martin C Frith, Rie Fujita, Shiro Fukuda, Cesare Furlanello, Masaaki Furino, Jun-ichi Furusawa, Teunis B Geijtenbeek, Andrew P Gibson, Thomas Gingeras, Daniel Goldowitz, Julian Gough, Sven Guhl, Reto Guler, Stefano Gustincich, Thomas J Ha, Masahide Hamaguchi, Mitsuko Hara, Matthias Harbers, Jayson Harshbarger, Akira Hasegawa, Yuki Hasegawa, Takehiro Hashimoto, Meenhard Herlyn, Kelly J Hitchens, Shannan J Ho Sui, Oliver M Hofmann, Ilka Hoof, Furni Hori, Lukasz Huminiecki, Kei Iida, Tomokatsu Ikawa, Boris R Jankovic, Hui Jia, Anagha Joshi, Giuseppe Jurman, Bogumil Kaczkowski, Chieko Kai, Kaoru Kaida, Ai Kaiho, Kazuhiro Kajiyama, Mutsumi Kanamori-Katayama, Artem S Kasianov, Takeya Kasukawa, Shintaro Katayama, Sachi Kato, Shuji Kawaguchi, Hiroshi Kawamoto, Yuki I Kawamura, Tsugumi Kawashima, Judith S Kempfle, Tony J Kenna, Juha Kere, Levon M Khachigian, Toshio Kitamura, Peter Klinken, Alan J Knox, Miki Kojima, Soichi Kojima, Naoto Kondo, Haruhiko Koseki, Shigeo Koyasu, Sarah Krampitz, Atsutaka Kubosaki, Andrew T Kwon, Jeroen F J Laros, Weonju Lee, Andreas Lennartsson, Kang Li, Berit Lilje, Leonard Lipovich, Alan Mackay-Sim, Ri-ichiroh Manabe, Jessica C Mar, Benoit Marchand, Anthony Mathelier, Niklas Mejhert, Alison Meynert, Yosuke Mizuno, David A de Lima Morais, Hiromasa Morikawa, Mitsuru Morimoto, Kazuyo Moro, Efthymios Motakis, Hozumi Motohashi, Christine L Mummery, Mitsuyoshi Murata, Sayaka Nagao-Sato, Yutaka Nakachi, Fumio Nakahara, Toshiyuki Nakamura, Yukio Nakamura, Kenichi Nakazato, Erik van Nimwegen, Noriko Ninomiya, Hiromi Nishiyori, Shohei Noma, Shohei Noma, Tadasuke Noazaki, Soichi Ogishima, Naganari Ohkura, Hiroko Ohimiya, Hiroshi Ohno, Mitsuhiro Ohshima, Mariko Okada-Hatakeyama, Yasushi Okazaki, Valerio Orlando, Dmitry A Ovchinnikov, Arnab Pain, Robert Passier, Margaret Patrikakis, Helena Persson, Silvano Piazza, James G D Prendergast, Owen J L Rackham, Jordan A Ramilowski, Mamoon Rashid, Timothy Ravasi, Patrizia Rizzu, Marco Roncador, Sugata Roy, Morten B Rye, Eri Saijyo, Antti Sajantila, Akiko Saka, Shimon Sakaguchi, Mizuho Sakai, Hiroki Sato, Suzana Savvi, Alka Saxena, Claudio Schneider, Erik A Schultes, Gundula G Schulze-Tanzil, Anita Schwegmann, Thierry Sengstag, Guojun Sheng, Hisashi Shimoji, Yishai Shimoni, Jay W Shin, Christophe Simon, Daisuke Sugiyama, Takaai Sugiyama, Masanori Suzuki, Naoko Suzuki, Rolf K Swoboda, Peter A C ’t Hoen, Michihira Tagami, Naoko Takahashi, Jun Takai, Hiroshi Tanaka, Hideki Tatsukawa, Zuotian Tatum, Mark Thompson, Hiroo Toyodo, Tetsuro Toyoda, Elvind Valen, Marc van de Wetering, Linda M van den Berg, Roberto Verado, Dipti Vijayan, Ilya E Vorontsov, Wyeth W Wasserman, Shoko Watanabe, Christine A Wells, Louise N Winteringham, Ernst Wolvetang, Emily J Wood, Yoko Yamaguchi, Masayuki Yamamoto, Misako Yoneda, Yohei Yonekura, Shigehiro Yoshida, Susan E Zabierowski, Peter G Zhang, Xiaobei Zhao, Silvia Zucchelli, Kim M Summers, Harukazu Suzuki, Carsten O Daub, Jun Kawai, Peter Heutink, Winston Hide, Tom C Freeman, Boris Lenhard, Vladimir B Bajic, Martin S Taylor, Vsevolod J Makeev, Albin Sandelin, David A Hume, Piero Carninci, and Yoshihide Hayashizaki. [2014] 2014. “A Promoter-Level Mammalian Expression Atlas..” Nature 507(7493):462-70. doi: 10.1038/nature13182.

Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.

Froehlich, John W, Ali R Vaezzadeh, Marc Kirchner, Andrew C Briscoe, Oliver Hofmann, Winston Hide, Hanno Steen, and Richard S Lee. [2014] 2014. “An In-Depth Comparison of the Male Pediatric and Adult Urinary Proteomes..” Biochimica et Biophysica Acta 1844(5):1044-50. doi: 10.1016/j.bbapap.2013.05.008.

In this study, we performed an in-depth characterization of the male pediatric infant urinary proteome by parallel proteomic analysis of normal healthy adult (n=6) and infant (n=6) males and comparison to available published data. A total of 1584 protein groups were identified. Of these, 708 proteins were identified in samples from both cohorts. Although present in both cohorts, 136 of these common proteins were significantly enriched in urine from adults and 94 proteins were significantly enriched in urine from infants. Using Gene Ontology, we found that the infant-enriched or specific subproteome (743 proteins) had an overrepresentation of proteins that are involved in translation and transcription, cellular growth and metabolic processes. In contrast, the adult enriched or specific subproteome (364 proteins) showed an overexpression of proteins involved in immune response and cell adhesion. This study demonstrates that the non-diseased male urinary proteome is quantitatively affected by age, has age-specific subproteomes, and identifies a common subproteome with no age-dependent abundance variations. These findings highlight the importance of age-matching in urinary proteomics. This article is part of a Special Issue entitled: Biomarkers: A Proteomic Challenge.

Dimont, Emmanuel, Oliver Hofmann, Shannan J Ho Sui, Alistair R R Forrest, Hideya Kawaji, FANTOM Consortium, and Winston Hide. [2014] 2014. “CAGExploreR: an R Package for the Analysis and Visualization of Promoter Dynamics across Multiple Experiments..” Bioinformatics (Oxford, England) 30(8):1183-84.

UNLABELLED: Alternate promoter usage is an important molecular mechanism for generating RNA and protein diversity. Cap Analysis Gene Expression (CAGE) is a powerful approach for revealing the multiplicity of transcription start site (TSS) events across experiments and conditions. An understanding of the dynamics of TSS choice across these conditions requires both sensitive quantification and comparative visualization. We have developed CAGExploreR, an R package to detect and visualize changes in the use of specific TSS in wider promoter regions in the context of changes in overall gene expression when comparing different CAGE samples. These changes provide insight into the modification of transcript isoform generation and regulatory network alterations associated with cell types and conditions. CAGExploreR is based on the FANTOM5 and MPromDb promoter set definitions but can also work with user-supplied regions. The package compares multiple CAGE libraries simultaneously. Supplementary Materials describe methods in detail, and a vignette demonstrates a workflow with a real data example.

AVAILABILITY AND IMPLEMENTATION: The package is freely available under the MIT license from CRAN (http://cran.r-project.org/web/packages/CAGExploreR).

CONTACT: edimont@mail.harvard.edu Supplementary information: Supplementary data are available at Bioinformatics online.

Consortium, H3Africa, Charles Rotimi, Akin Abayomi, Alash’ le Abimiku, Victoria May Adabayeri, Clement Adebamowo, Ezekiel Adebiyi, Adebowale D Ademola, Adebowale Adeyemo, Dwomoa Adu, Dissou Affolabi, Godfred Agongo, Samuel Ajayi, Sally Akarolo-Anthony, Rufus Akinyemi, Albert Akpalu, Marianne Alberts, Orlando Alonso Betancourt, Ahmed Mansour Alzohairy, Gobena Ameni, Olukemi Amodu, Gabriel Anabwani, Kristian Andersen, Fatiu Arogundade, Oyedunni Arulogun, Danny Asogun, Rasheed Bakare, Naby Balde, Mary Lynn Baniecki, Christine Beiswanger, Alia Benkahla, Lara Bethke, Micheal Boehnke, Vincent Boima, James Brandful, Andrew I Brooks, Frank C Brosius, Chester Brown, Bruno Bucheton, David T Burke, Barrington G Burnett, Stacy Carrington-Lawrence, Nadia Carstens, John Chisi, Alan Christoffels, Richard Cooper, Heather Cordell, Nigel Crowther, Talishiea Croxton, Jantina de Vries, Leslie Derr, Peter Donkor, Seydou Doumbia, Audrey Duncanson, Ivy Ekem, Ahmed El Sayed, Mark E Engel, John C K Enyaru, Dean Everett, Faisal M Fadlelmola, Eyitayo Fakunle, Kenneth H Fischbeck, Anne Fischer, Onikepe Folarin, Junaid Gamieldien, Robert F Garry, Simani Gaseitsiwe, Rasheed Gbadegesin, Anita Ghansah, Maria Giovanni, Parham Goesbeck, Xavier Gomez-Olive, Donald S Grant, Ravnit Grewal, Mark Guyer, Neil A Hanchard, Christian T Happi, Scott Hazelhurst, Branwen J Hennig, Christiane Hertz-, , Winston Hide, Friedhelm Hilderbrandt, Christopher Hugo-Hamman, Muntaser E Ibrahim, Regina James, Yasmina Jaufeerally-Fakim, Carolyn Jenkins, Ute Jentsch, Pan-Pan Jiang, Moses Joloba, Victor Jongeneel, Fourie Joubert, Mukthar Kader, Kathleen Kahn, Pontiano Kaleebu, Saidi H Kapiga, Samar Kamal Kassim, Ishmael Kasvosve, Jonathan Kayondo, Bernard Keavney, Adeodata Kekitiinwa, Sheik Humarr Khan, Paul Kimmel, Mary-Claire King, Robert Kleta, Mathurin Koffi, Jeffrey Kopp, Matthias Kretzler, Judit Kumuthini, Samuel Kyobe, Catherine Kyobutungi, Daniel T Lackland, Karen A Lacourciere, Guida Landouré, Rita Lawlor, Thomas Lehner, Maia Lesosky, Naomi Levitt, Katherine Littler, Zane Lombard, Jeanne F Loring, Sylvester Lyantagaye, Annette Macleod, Ebony B Madden, Chengetai R Mahomva, Julie Makani, Manmak Mamven, Marape Marape, Graeme Mardon, Patricia Marshall, Darren P Martin, Daniel Masiga, Robin Mason, Michael Mate-Kole, Enock Matovu, Mary Mayige, Bongani M Mayosi, Jean Claude Mbanya, Sheryl A McCurdy, Mark I McCarthy, Helen McIlleron, S O Mc’Ligeyo, Corrine Merle, Ana Olga Mocumbi, Charles Mondo, John Moran V, Ayesha Motala, Marva Moxey-Mims, Wata Sununguko Mpoloka, Chisomo L Msefula, Thuli Mthiyane, Nicola Mulder, Gebregziab her Mulugeta, Dieuodonne Mumba, John Musuku, Mo Nagdee, Oyekanmi Nash, Daouda Ndiaye, Anh Quynh Nguyen, Mark Nicol, Oathokwa Nkomazana, Shane Norris, Betty Nsangi, Alexander Nyarko, Moffat Nyirenda, Eileen Obe, Reginald Obiakor, Abraham Oduro, Solomon F Ofori-Acquah, Okechukwu Ogah, Stephen Ogendo, Kwaku Ohene-Frempong, Akinlolu Ojo, Timothy Olanrewaju, John Oli, Charlotte Osafo, Odile Ouwe Missi Oukem-Boyer, Bruce Ovbiagele, Andrew Owen, Mayowa Ojo Owolabi, Lukman Owolabi, Ellis Owusu-Dabo, Guillaume Paré, Rulan Parekh, Hugh G Patterton, Margaret B Penno, Jane Peterson, Rembert Pieper, Jacob Plange-Rhule, Martin Pollak, Julia Puzak, Rajkumar S Ramesar, Michele Ramsay, Rebekah Rasooly, Shiksha Reddy, Pardis C Sabeti, Kwamena Sagoe, Tunde Salako, Oumar Samassékou, Manjinder S Sandhu, Osman Sankoh, Fred Stephen Sarfo, Marie Sarr, Gasnat Shaboodien, Issa Sidibe, Gustave Simo, Martin Simuunza, Liam Smeeth, Eugene Sobngwi, Himla Soodyall, Hermann Sorgho, Oumou Sow Bah, Sudha Srinivasan, Dan J Stein, Ezra S Susser, Carmen Swanepoel, Godfred Tangwa, Andrew Tareila, Özlem Tastan Bishop, Bamidele Tayo, Nicki Tiffin, Halidou Tinto, Ekaete Tobin, Stephen Meir Tollman, Mahamadou Traoré, Marsha J Treadwell, Jennifer Troyer, Masego Tsimako-Johnstone, Vincent Tukei, Ifeoma Ulasi, Nzovu Ulenga, Beverley van Rooyen, Ablo Prudence Wachinou, Salina P Waddy, Alisha Wade, Misaki Wayengera, James Whitworth, Louise Wideroff, Cheryl A Winkler, Sarah Winnicki, Ambroise Wonkam, Mengistu Yewondwos, Tadase sen, Nathan Yozwiak, and Heather Zar. [2014] 2014. “Research Capacity. Enabling the Genomic Revolution in Africa..” Science (New York, N.Y.) 344(6190):1346-8. doi: 10.1126/science.1251546.
Tan, Shen Mynn, Rory Kirchner, Jingmin Jin, Oliver Hofmann, Larry McReynolds, Winston Hide, and Judy Lieberman. [2014] 2014. “Sequencing of Captive Target Transcripts Identifies the Network of Regulated Genes and Functions of Primate-Specific MiR-522..” Cell Reports 8(4):1225-39. doi: 10.1016/j.celrep.2014.07.023.

Identifying microRNA (miRNA)-regulated genes is key to understanding miRNA function. However, many miRNA recognition elements (MREs) do not follow canonical "seed" base-pairing rules, making identification of bona fide targets challenging. Here, we apply an unbiased sequencing-based systems approach to characterize miR-522, a member of the oncogenic primate-specific chromosome 19 miRNA cluster, highly expressed in poorly differentiated cancers. To identify miRNA targets, we sequenced full-length transcripts captured by a biotinylated miRNA mimic. Within these targets, mostly noncanonical MREs were identified by sequencing RNase-resistant fragments. miR-522 overexpression reduced mRNA, protein levels, and luciferase activity of >70% of a random list of candidate target genes and MREs. Bioinformatic analysis suggested that miR-522 regulates cell proliferation, detachment, migration, and epithelial-mesenchymal transition. miR-522 induces G1 cell-cycle arrest and causes cells to detach without anoikis, become invasive, and express mesenchymal genes. Thus, our method provides a simple but effective technique for identifying miRNA-regulated genes and biological function.