Long-Read Sequencing (LRS) technologies offer capabilities for characterizing complex engineered DNA constructs from Golden Gate and barcoded DNA variant assemblies, CRISPR engineered libraries or Multiplexed Assays of Variant Effect (MAVE) experiments. However, the heterogeneity of such molecules, combined with potential structural and length variability, presents analytical challenges. We present SLICER (Sequencing Long-read Identifier of Complex Element Regions), a pipeline for analyzing LRS of such constructs. SLICER dynamically identifies and extracts user-defined barcode/core elements per-read using an anchor-based method, accommodating positional/length variations and aligns these back to reference sequences. If absent, SLICER is capable of de novo reference prediction, a feature that can be insightful to identify unpredicted/aberrant phasing/combinatorial events. When benchmarked, SLICER’s de novo reference prediction was accurate to within 1% of reference data. SLICER’s dynamic extraction and robust de novo reference capabilities provide an invaluable tool for synthetic and engineered biology applications, enabling comprehensive interrogation of complex barcoded DNA constructs and libraries. SLICER is available at https://github.com/mbassalbioinformatics/SLICER.
Publications
Forthcoming
2025
Stem cells reside in specialized microenvironments, termed niches, at several different locations in tissues1,2,3. The differential functions of heterogeneous stem cells and niches are important given the increasing clinical applications of stem-cell transplantation and immunotherapy. Whether hierarchical structures among stem cells at distinct niches exist and further control aspects of immune tolerance is unknown. Here we describe previously unknown new hierarchical arrangements in haematopoietic stem cells (HSCs) and bone marrow niches that dictate both regenerative potential and immune privilege. High-level nitric oxide-generating (NOhi) HSCs are refractory to immune attack and exhibit delayed albeit robust long-term reconstitution. Such highly immune-privileged, primitive NOhi HSCs co-localize with distinctive capillaries characterized by primary ciliated endothelium and high levels of the immune-checkpoint molecule CD200. These capillaries regulate the regenerative functions of NOhi HSCs through the ciliary protein IFT20 together with CD200, endothelial nitric oxide synthase and autophagy signals, which further mediate immunoprotection. Notably, previously described niche constituents, sinusoidal cells and type-H vessels2,3,4,5,6,7,8,9,10 co-localize with less immune-privileged and less potent NOlow HSCs. Together, we identify highly immune-privileged, late-rising primitive HSCs and characterize their immunoprotective niches comprising specialized vascular domains. Our results indicate that the niche orchestrates hierarchy in stem cells and immune tolerance, and highlight future immunotherapeutic targets.
Immunomodulatory imide drugs (IMiDs) degrade specific C2H2 zinc finger degrons in transcription factors, making them effective against certain cancers. SALL4, a cancer driver, contains seven C2H2 zinc fingers in three clusters, including an IMiD degron in zinc finger cluster one (ZFC1). Surprisingly, IMiDs do not inhibit the growth of SALL4-expressing cancer cells. To overcome this limit, we focused on a non-IMiD domain, SALL4 zinc finger cluster four (ZFC4). By combining ZFC4-DNA crystal structure and an in silico docking algorithm, in conjunction with cell viability assays, we screened several chemical libraries against a potentially druggable binding pocket, leading to the discovery of SH6, a compound that selectively targets SALL4-expressing cancer cells. Mechanistic studies revealed that SH6 degrades SALL4 protein through the CUL4A/CRBN pathway, while deletion of ZFC4 abolished this activity. Moreover, SH6 treatment led to a significant 87% tumor growth inhibition of SALL4+ patient-derived xenografts and demonstrated good bioavailability in pharmacokinetic studies. In summary, these studies represent a new approach for IMiD independent drug discovery targeting C2H2 transcription factors such as SALL4 in cancer.
Coordinated initiation of DNA replication is essential to ensure efficient and timely DNA synthesis. Yet, molecular mechanism describing how replication initiation is coordinated in eukar-yotic cells is not completely understood. Herein, we present data demonstrating a novel feature of RNAs transcribed in the proximity of actively replicating gene loci. We show that RNAs aN-Choring ORC1 (ANCORs) to the histone variant H2A.Z are licensors of the DNA replication process. This ANCOR-H2A.Z interaction is essential for cells to initiate duplication of their ge-netic material. Widespread and locus-specific perturbations of these transcripts correlate with anomalous replication patterns and a notable loss of the H2A.Z replicative marker at the origin site. Collectively, we present a previously undescribed RNA-mediated mechanism that is associ-ated with the generation of active replication origins in eukaryotic cells. Our findings delineate a strategy to modulate the origins of replication in human cells at a local and global level, with potentially broad biomedical implications.
2024
The transcription factor C/EBPα is a well characterized DNA binding protein with essential functions in controlling and regulating myeloid differentiation and lipid metabolism. Herein, we describe C/EBPα’s separate and distinct RNA binding characteristics.
Using Chromatin RNA Immunoprecipitation, we identified that C/EBPα interacts primarily with RNA introns in both HL-60 and THP-1 cells, with a preference for a palindromic GC-rich binding motif. Structural prediction in conjunction with RNA electrophoretic mobility shift assays show that C/EBPα interacts with RNA through two previously undescribed domains located towards the proteins N terminus. These domains are distinct from C/EBPα’s DNA binding b-ZIP domain which is instead located on the proteins C terminus.
Mouse bone marrow transplantation and in vitro cytokine assays reveal that C/EBPα RNA binding appears to be essential for macrophage maturation but not neutrophil differentiation. To better understand these phenotypic differences, we expanded our observations into the transcriptomic space by utilizing single-cell CITE-Seq. In line with biochemical data, notable mature cell populations were absent when the C/EBPα’s RNA binding domains are excluded from the protein. Intriguingly, differential gene expression revealed strong, selective upregulation of Lipoprotein Lipase (Lpl) in monocyte, but not neutrophil populations. It has been previously reported that dysregulation of Lpl affects bone marrow monocyte progenitor differentiation and results in dysregulated cellular mobilization from the bone marrow, a phenotype that is commonly seen in cancers such as AML. The increased abundance of monocyte and neutrophil populations in the C/EBPα RNA deletion samples is therefore suggestive that C/EBPα RNA binding is critical in regulating terminal differentiation and mobilization circuits which may become dysregulated or perturbed in diseases such as AML.
On-going analyses are investigating the RNA velocity trajectory of captured cell populations in an attempt to better understand at which stage C/EBPα RNA binding becomes essential for terminal maturation and differentiation of select cell populations. Taken together though, we demonstrate that C/EBPα is also an RNA binding protein with unique functions distinct from its DNA binding activity that is essential for monocyte maturation, differentiation and mobilization.
Transposable elements (TEs) are indispensable for human development, with critical functions in pluripotency and embryogenesis. TE sequences also contribute to human pathologies, especially cancer, with documented activities as cis/trans transcriptional regulators, as sources of non-coding RNAs, and as mutagens that disrupt tumor suppressors. Despite this knowledge, little is known regarding the involvement of TE-derived genes (TEGs) in tumor pathogenesis. Here, systematic analyses of TEG expression across human cancer reveal a prominent role for pogo TE derived with KRAB domain (POGK). We show that POGK acts as a tumor suppressor in triple-negative breast cancer (TNBC) cells and that it couples with the co-repressor TRIM28 to directly block the transcription of ribosomal genes RPS16 and RPS29, in turn causing widespread inhibition of ribosomal biogenesis. We report that POGK undergoes deactivation by isoform switching in clinical TNBC, altogether revealing its exapted activities in tumor growth control.
2023
Histone modifications play crucial roles in transcriptional activation, and aberrant epigenetic changes are associated with oncogenesis. Lysine (K) acetyltransferases 5 (TIP60, also known as KAT5) is reportedly implicated in cancer development and maintenance, although its function in lung cancer remains controversial. Here we demonstrate that TIP60 knockdown in non-small cell lung cancer cell lines decreased tumor cell growth, migration, and invasion. Furthermore, analysis of a mouse lung cancer model with lung-specific conditional Tip60 knockout revealed suppressed tumor formation relative to controls, but no apparent effects on normal lung homeostasis. RNA-seq and ChIP-seq analyses of inducible TIP60 knockdown H1975 cells relative to controls revealed transglutaminase enzyme (TGM5) as downstream of TIP60. Investigation of a connectivity map database identified several candidate compounds that decrease TIP60 mRNA, one that suppressed tumor growth in cell culture and in vivo. In addition, TH1834, a TIP60 acetyltransferase inhibitor, showed comparable antitumor effects in cell culture and in vivo. Taken together, suppression of TIP60 activity shows tumor-specific efficacy against lung cancer, with no overt effect on normal tissues. Our work suggests that targeting TIP60 could be a promising approach to treating lung cancer.