A large amount of genomic data for profiling three-dimensional genome architecture have accumulated from large-scale consortium projects as well as from individual laboratories. In this review, we summarize recent landmark datasets and collections in the field. We describe the challenges in collection, annotation, and analysis of these data, particularly for integration of sequencing and microscopy data. We introduce efforts from consortia and independent groups to harmonize diverse datasets. As the resolution and throughput of sequencing and imaging technologies continue to increase, more efficient utilization and integration of collected data will be critical for a better understanding of nuclear architecture.
In the context of human disease, the mechanisms whereby transcription factors reprogram gene expression in reparative responses to injury are not well understood. We have studied the mechanisms of transcriptional reprogramming in disease using murine kidney podocytes as a model for tissue injury. Podocytes are a crucial component of glomeruli, the filtration units of each nephron. Podocyte injury is the initial event in many processes that lead to end-stage kidney disease. Wilms tumor-1 (WT1) is a master regulator of gene expression in podocytes, binding nearly all genes known to be crucial for maintenance of the glomerular filtration barrier. Using murine models and human kidney organoids, we investigated WT1-mediated transcriptional reprogramming during the course of podocyte injury. Reprogramming the transcriptome involved highly dynamic changes in the binding of WT1 to target genes during a reparative injury response, affecting chromatin state and expression levels of target genes.
Cell behaviors are dictated by epigenetic and transcriptional programs. Little is known about how extracellular stimuli modulate these programs to reshape gene expression and control cell behavioral responses. Here, we interrogated the epigenetic and transcriptional response of endothelial cells to VEGFA treatment and found rapid chromatin changes that mediate broad transcriptomic alterations. VEGFA-responsive genes were associated with active promoters, but changes in promoter histone marks were not tightly linked to gene expression changes. VEGFA altered transcription factor occupancy and the distal epigenetic landscape, which profoundly contributed to VEGFA-dependent changes in gene expression. Integration of gene expression, dynamic enhancer, and transcription factor occupancy changes induced by VEGFA yielded a VEGFA-regulated transcriptional regulatory network, which revealed that the small MAF transcription factors are master regulators of the VEGFA transcriptional program and angiogenesis. Collectively these results revealed that extracellular stimuli rapidly reconfigure the chromatin landscape to coordinately regulate biological responses.
Bromodomain-containing protein 9 (BRD9) is a recently identified subunit of SWI/SNF(BAF) chromatin remodeling complexes, yet its function is poorly understood. Here, using a genome-wide CRISPR-Cas9 screen, we show that BRD9 is a specific vulnerability in pediatric malignant rhabdoid tumors (RTs), which are driven by inactivation of the SMARCB1 subunit of SWI/SNF. We find that BRD9 exists in a unique SWI/SNF sub-complex that lacks SMARCB1, which has been considered a core subunit. While SMARCB1-containing SWI/SNF complexes are bound preferentially at enhancers, we show that BRD9-containing complexes exist at both promoters and enhancers. Mechanistically, we show that SMARCB1 loss causes increased BRD9 incorporation into SWI/SNF thus providing insight into BRD9 vulnerability in RTs. Underlying the dependency, while its bromodomain is dispensable, the DUF3512 domain of BRD9 is essential for SWI/SNF integrity in the absence of SMARCB1. Collectively, our results reveal a BRD9-containing SWI/SNF subcomplex is required for the survival of SMARCB1-mutant RTs.
Epigenetic regulation of gene expression has a crucial role allowing for the self-renewal and differentiation of stem and progenitor populations during organogenesis. The mammalian kidney maintains a population of self-renewing stem cells that differentiate to give rise to thousands of nephrons, which are the functional units that carry out filtration to maintain physiological homeostasis. The polycomb repressive complex 2 (PRC2) epigenetically represses gene expression during development by placing the H3K27me3 mark on histone H3 at promoter and enhancer sites, resulting in gene silencing. To understand the role of PRC2 in nephron differentiation, we conditionally inactivated the Eed gene, which encodes a nonredundant component of the PRC2 complex, in nephron progenitor cells. Resultant kidneys were smaller and showed premature loss of progenitor cells. The progenitors in Eedmutant mice that were induced to differentiate did not develop into properly formed nephrons. Lhx1, normally expressed in the renal vesicle, was overexpressed in kidneys of Eed mutant mice. Thus, PRC2 has a crucial role in suppressing the expression of genes that maintain the progenitor state, allowing nephron differentiation to proceed.
Regulatory decisions in Drosophila require Polycomb group (PcG) proteins to maintain the silent state and Trithorax group (TrxG) proteins to oppose silencing. Since PcG and TrxG are ubiquitous and lack apparent sequence specificity, a long-standing model is that targeting occurs via protein interactions; for instance, between repressors and PcG proteins. Instead, we found that Pc-repressive complex 1 (PRC1) purifies with coactivators Fs(1)h [female sterile (1) homeotic] and Enok/Br140 during embryogenesis. Fs(1)h is a TrxG member and the ortholog of BRD4, a bromodomain protein that binds to acetylated histones and is a key transcriptional coactivator in mammals. Enok and Br140, another bromodomain protein, are orthologous to subunits of a mammalian MOZ/MORF acetyltransferase complex. Here we confirm PRC1-Br140 and PRC1-Fs(1)h interactions and identify their genomic binding sites. PRC1-Br140 bind developmental genes in fly embryos, with analogous co-occupancy of PRC1 and a Br140 ortholog, BRD1, at bivalent loci in human embryonic stem (ES) cells. We propose that identification of PRC1-Br140 "bivalent complexes" in fly embryos supports and extends the bivalency model posited in mammalian cells, in which the coexistence of H3K4me3 and H3K27me3 at developmental promoters represents a poised transcriptional state. We further speculate that local competition between acetylation and deacetylation may play a critical role in the resolution of bivalent protein complexes during development.
Blastocyst-derived embryonic stem cells (ESCs) and gonad-derived embryonic germ cells (EGCs) represent two classic types of pluripotent cell lines, yet their molecular equivalence remains incompletely understood. Here, we compare genome-wide methylation patterns between isogenic ESC and EGC lines to define epigenetic similarities and differences. Surprisingly, we find that sex rather than cell type drives methylation patterns in ESCs and EGCs. Cell fusion experiments further reveal that the ratio of X chromosomes to autosomes dictates methylation levels, with female hybrids being hypomethylated and male hybrids being hypermethylated. We show that the X-linked MAPK phosphatase DUSP9 is upregulated in female compared to male ESCs, and its heterozygous loss in female ESCs leads to male-like methylation levels. However, male and female blastocysts are similarly hypomethylated, indicating that sex-specific methylation differences arise in culture. Collectively, our data demonstrate the epigenetic similarity of sex-matched ESCs and EGCs and identify DUSP9 as a regulator of female-specific hypomethylation.
Genes encoding subunits of SWI/SNF (BAF) chromatin remodelling complexes are collectively altered in over 20% of human malignancies, but the mechanisms by which these complexes alter chromatin to modulate transcription and cell fate are poorly understood. Utilizing mouse embryonic fibroblast and cancer cell line models, here we show via ChIP-seq and biochemical assays that SWI/SNF complexes are preferentially targeted to distal lineage specific enhancers and interact with p300 to modulate histone H3 lysine 27 acetylation. We identify a greater requirement for SWI/SNF at typical enhancers than at most super-enhancers and at enhancers in untranscribed regions than in transcribed regions. Our data further demonstrate that SWI/SNF-dependent distal enhancers are essential for controlling expression of genes linked to developmental processes. Our findings thus establish SWI/SNF complexes as regulators of the enhancer landscape and provide insight into the roles of SWI/SNF in cellular fate control.
SMARCB1 (also known as SNF5, INI1, and BAF47), a core subunit of the SWI/SNF (BAF) chromatin-remodeling complex, is inactivated in nearly all pediatric rhabdoid tumors. These aggressive cancers are among the most genomically stable, suggesting an epigenetic mechanism by which SMARCB1 loss drives transformation. Here we show that, despite having indistinguishable mutational landscapes, human rhabdoid tumors exhibit distinct enhancer H3K27ac signatures, which identify remnants of differentiation programs. We show that SMARCB1 is required for the integrity of SWI/SNF complexes and that its loss alters enhancer targeting-markedly impairing SWI/SNF binding to typical enhancers, particularly those required for differentiation, while maintaining SWI/SNF binding at super-enhancers. We show that these retained super-enhancers are essential for rhabdoid tumor survival, including some that are shared by all subtypes, such as SPRY1, and other lineage-specific super-enhancers, such as SOX2 in brain-derived rhabdoid tumors. Taken together, our findings identify a new chromatin-based epigenetic mechanism underlying the tumor-suppressive activity of SMARCB1.
Genes encoding subunits of SWI/SNF (BAF) chromatin-remodeling complexes are collectively mutated in ∼20% of all human cancers. Although ARID1A is the most frequent target of mutations, the mechanism by which its inactivation promotes tumorigenesis is unclear. Here we demonstrate that Arid1a functions as a tumor suppressor in the mouse colon, but not the small intestine, and that invasive ARID1A-deficient adenocarcinomas resemble human colorectal cancer (CRC). These tumors lack deregulation of APC/β-catenin signaling components, which are crucial gatekeepers in common forms of intestinal cancer. We find that ARID1A normally targets SWI/SNF complexes to enhancers, where they function in coordination with transcription factors to facilitate gene activation. ARID1B preserves SWI/SNF function in ARID1A-deficient cells, but defects in SWI/SNF targeting and control of enhancer activity cause extensive dysregulation of gene expression. These findings represent an advance in colon cancer modeling and implicate enhancer-mediated gene regulation as a principal tumor-suppressor function of ARID1A.
BACKGROUND: For many genes, RNA polymerase II stably pauses before transitioning to productive elongation. Although polymerase II pausing has been shown to be a mechanism for regulating transcriptional activation, the extent to which it is involved in control of mammalian gene expression and its relationship to chromatin structure remain poorly understood. RESULTS: Here, we analyze 85 RNA polymerase II chromatin immunoprecipitation (ChIP)-sequencing experiments from 35 different murine and human samples, as well as related genome-wide datasets, to gain new insights into the relationship between polymerase II pausing and gene regulation. Across cell and tissue types, paused genes (pausing index > 2) comprise approximately 60 % of expressed genes and are repeatedly associated with specific biological functions. Paused genes also have lower cell-to-cell expression variability. Increased pausing has a non-linear effect on gene expression levels, with moderately paused genes being expressed more highly than other paused genes. The highest gene expression levels are often achieved through a novel pause-release mechanism driven by high polymerase II initiation. In three datasets examining the impact of extracellular signals, genes responsive to stimulus have slightly lower pausing index on average than non-responsive genes, and rapid gene activation is linked to conditional pause-release. Both chromatin structure and local sequence composition near the transcription start site influence pausing, with divergent features between mammals and Drosophila. Most notably, in mammals pausing is positively correlated with histone H2A.Z occupancy at promoters. CONCLUSIONS: Our results provide new insights into the contribution of RNA polymerase II pausing in mammalian gene regulation and chromatin structure.
Chromatin accessibility plays a fundamental role in gene regulation. Nucleosome placement, usually measured by quantifying protection of DNA from enzymatic digestion, can regulate accessibility. We introduce a metric that uses micrococcal nuclease (MNase) digestion in a novel manner to measure chromatin accessibility by combining information from several digests of increasing depths. This metric, MACC (MNase accessibility), quantifies the inherent heterogeneity of nucleosome accessibility in which some nucleosomes are seen preferentially at high MNase and some at low MNase. MACC interrogates each genomic locus, measuring both nucleosome location and accessibility in the same assay. MACC can be performed either with or without a histone immunoprecipitation step, and thereby compares histone and non-histone protection. We find that changes in accessibility at enhancers, promoters and other regulatory regions do not correlate with changes in nucleosome occupancy. Moreover, high nucleosome occupancy does not necessarily preclude high accessibility, which reveals novel principles of chromatin regulation.
The Polycomb group (PcG) proteins are key conserved regulators of development, initially discovered in Drosophila and now strongly implicated in human disease. Nevertheless, differing silencing properties between the Drosophila and mammalian PcG systems have been observed. While specific DNA targeting sites for PcG proteins called Polycomb response elements (PREs) have been identified only in Drosophila, involvement of non-coding RNAs for PcG targeting has been favored in mammals. Another difference lies in the distribution patterns of PcG proteins. In mouse and human cells, PcG proteins show broad distributions, significantly overlapping with H3K27me3 domains. In contrast, only sharp peaks on PRE regions are observed for most PcG proteins in Drosophila, raising the question of how large domains of H3K27me3, up to many tens of kilobases, are formed and maintained in Drosophila. In this Extra View, we provide evidence that PcG distributions on silent chromatin in Drosophila are considerably broader than previously detected. Using BioTAP-XL, a chromatin crosslinking and tandem affinity purification approach, we find a broad, rather than PRE-limited overlap of PcG proteins with H3K27me3, suggesting a conserved spreading mechanism for PcG in flies and mammals.
Cheloufi S, Elling U, Hopfgartner B, Jung YL, Murn J, Ninova M, Hubmann M, Badeaux AI, Euong Ang C, Tenen D, Wesche DJ, Abazova N, Hogue M, Tasdemir N, Brumbaugh J, Rathert P, Jude J, Ferrari F, Blanco A, Fellner M, Wenzel D, Zinner M, Vidal SE, Bell O, Stadtfeld M, Chang HY, Almouzni G, Lowe SW, Rinn J, Wernig M, Aravin A, Shi Y, Park PJ, Penninger JM, Zuber J, Hochedlinger K. The histone chaperone CAF-1 safeguards somatic cell identity. Nature 2015;528(7581):218-24.Abstract
Cellular differentiation involves profound remodelling of chromatic landscapes, yet the mechanisms by which somatic cell identity is subsequently maintained remain incompletely understood. To further elucidate regulatory pathways that safeguard the somatic state, we performed two comprehensive RNA interference (RNAi) screens targeting chromatin factors during transcription-factor-mediated reprogramming of mouse fibroblasts to induced pluripotent stem cells (iPS cells). Subunits of the chromatin assembly factor-1 (CAF-1) complex, including Chaf1a and Chaf1b, emerged as the most prominent hits from both screens, followed by modulators of lysine sumoylation and heterochromatin maintenance. Optimal modulation of both CAF-1 and transcription factor levels increased reprogramming efficiency by several orders of magnitude and facilitated iPS cell formation in as little as 4 days. Mechanistically, CAF-1 suppression led to a more accessible chromatin structure at enhancer elements early during reprogramming. These changes were accompanied by a decrease in somatic heterochromatin domains, increased binding of Sox2 to pluripotency-specific targets and activation of associated genes. Notably, suppression of CAF-1 also enhanced the direct conversion of B cells into macrophages and fibroblasts into neurons. Together, our findings reveal the histone chaperone CAF-1 to be a novel regulator of somatic cell identity during transcription-factor-induced cell-fate transitions and provide a potential strategy to modulate cellular plasticity in a regenerative setting.
The equivalence of human induced pluripotent stem cells (hiPSCs) and human embryonic stem cells (hESCs) remains controversial. Here we use genetically matched hESC and hiPSC lines to assess the contribution of cellular origin (hESC vs. hiPSC), the Sendai virus (SeV) reprogramming method and genetic background to transcriptional and DNA methylation patterns while controlling for cell line clonality and sex. We find that transcriptional and epigenetic variation originating from genetic background dominates over variation due to cellular origin or SeV infection. Moreover, the 49 differentially expressed genes we detect between genetically matched hESCs and hiPSCs neither predict functional outcome nor distinguish an independently derived, larger set of unmatched hESC and hiPSC lines. We conclude that hESCs and hiPSCs are molecularly and functionally equivalent and cannot be distinguished by a consistent gene expression signature. Our data further imply that genetic background variation is a major confounding factor for transcriptional and epigenetic comparisons of pluripotent cell lines, explaining some of the previously observed differences between genetically unmatched hESCs and hiPSCs.
The Polycomb group (PcG) proteins are key regulators of development in Drosophila and are strongly implicated in human health and disease. How PcG complexes form repressive chromatin domains remains unclear. Using cross-linked affinity purifications of BioTAP-Polycomb (Pc) or BioTAP-Enhancer of zeste [E(z)], we captured all PcG-repressive complex 1 (PRC1) or PRC2 core components and Sex comb on midleg (Scm) as the only protein strongly enriched with both complexes. Although previously not linked to PRC2, we confirmed direct binding of Scm and PRC2 using recombinant protein expression and colocalization of Scm with PRC1, PRC2, and H3K27me3 in embryos and cultured cells using ChIP-seq (chromatin immunoprecipitation [ChIP] combined with deep sequencing). Furthermore, we found that RNAi knockdown of Scm and overexpression of the dominant-negative Scm-SAM (sterile α motif) domain both affected the binding pattern of E(z) on polytene chromosomes. Aberrant localization of the Scm-SAM domain in long contiguous regions on polytene chromosomes revealed its independent ability to spread on chromatin, consistent with its previously described ability to oligomerize in vitro. Pull-downs of BioTAP-Scm captured PRC1 and PRC2 and additional repressive complexes, including PhoRC, LINT, and CtBP. We propose that Scm is a key mediator connecting PRC1, PRC2, and transcriptional silencing. Combined with previous structural and genetic analyses, our results strongly suggest that Scm coordinates PcG complexes and polymerizes to produce broad domains of PcG silencing.
The transcription factor Wilms' tumor suppressor 1 (WT1) is key to podocyte development and viability; however, WT1 transcriptional networks in podocytes remain elusive. We provide a comprehensive analysis of the genome-wide WT1 transcriptional network in podocytes in vivo using chromatin immunoprecipitation followed by sequencing (ChIPseq) and RNA sequencing techniques. Our data show a specific role for WT1 in regulating the podocyte-specific transcriptome through binding to both promoters and enhancers of target genes. Furthermore, we inferred a podocyte transcription factor network consisting of WT1, LMX1B, TCF21, Fox-class and TEAD family transcription factors, and MAFB that uses tissue-specific enhancers to control podocyte gene expression. In addition to previously described WT1-dependent target genes, ChIPseq identified novel WT1-dependent signaling systems. These targets included components of the Hippo signaling system, underscoring the power of genome-wide transcriptional-network analyses. Together, our data elucidate a comprehensive gene regulatory network in podocytes suggesting that WT1 gene regulatory function and podocyte cell-type specification can best be understood in the context of transcription factor-regulatory element network interplay.
MOTIVATION: Genome-wide mapping of chromatin states is essential for defining regulatory elements and inferring their activities in eukaryotic genomes. A number of hidden Markov model (HMM)-based methods have been developed to infer chromatin state maps from genome-wide histone modification data for an individual genome. To perform a principled comparison of evolutionarily distant epigenomes, we must consider species-specific biases such as differences in genome size, strength of signal enrichment and co-occurrence patterns of histone modifications. RESULTS: Here, we present a new Bayesian non-parametric method called hierarchically linked infinite HMM (hiHMM) to jointly infer chromatin state maps in multiple genomes (different species, cell types and developmental stages) using genome-wide histone modification data. This flexible framework provides a new way to learn a consistent definition of chromatin states across multiple genomes, thus facilitating a direct comparison among them. We demonstrate the utility of this method using synthetic data as well as multiple modENCODE ChIP-seq datasets. CONCLUSION: The hierarchical and Bayesian non-parametric formulation in our approach is an important extension to the current set of methodologies for comparative chromatin landscape analysis. AVAILABILITY AND IMPLEMENTATION: Source codes are available at https://github.com/kasohn/hiHMM. Chromatin data are available at http://encode-x.med.harvard.edu/data_sets/chromatin/.
The CAG repeat expansion in the Huntington's disease gene HTT extends a polyglutamine tract in mutant huntingtin that enhances its ability to facilitate polycomb repressive complex 2 (PRC2). To gain insight into this dominant gain of function, we mapped histone modifications genome-wide across an isogenic panel of mouse embryonic stem cell (ESC) and neuronal progenitor cell (NPC) lines, comparing the effects of Htt null and different size Htt CAG mutations. We found that Htt is required in ESC for the proper deposition of histone H3K27me3 at a subset of 'bivalent' loci but in NPC it is needed at 'bivalent' loci for both the proper maintenance and the appropriate removal of this mark. In contrast, Htt CAG size, though changing histone H3K27me3, is prominently associated with altered histone H3K4me3 at 'active' loci. The sets of ESC and NPC genes with altered histone marks delineated by the lack of huntingtin or the presence of mutant huntingtin, though distinct, are enriched in similar pathways with apoptosis specifically highlighted for the CAG mutation. Thus, the manner by which huntingtin function facilitates PRC2 may afford mutant huntingtin with multiple opportunities to impinge upon the broader machinery that orchestrates developmentally appropriate chromatin status.
De Los Angeles A*, Ferrari F*, Fujiwara Y, Mathieu R, Lee S, Lee S, Tu H-C, Ross S, Chou S, Nguyen M, Wu Z, Theunissen TW, Powell BE, Imsoonthornruksa S, Chen J, Borkent M, Krupalnik V, Lujan E, Wernig M, Hanna JH, Hochedlinger K, Pei D, Jaenisch R, Deng H, Orkin SH, Park PJ**, Daley GQ**. Failure to replicate the STAP cell phenomenon. Nature 2015;525(7570):E6-9.