UTX (also known as KDM6A) encodes a histone H3K27 demethylase and is an important tumour suppressor that is frequently mutated in human cancers1. However, as the demethylase activity of UTX is often dispensable for mediating tumour suppression and developmental regulation2,3,4,5,6,7,8, the underlying molecular activity of UTX remains unknown. Here we show that phase separation of UTX underlies its chromatin-regulatory activity in tumour suppression. A core intrinsically disordered region (cIDR) of UTX forms phase-separated liquid condensates, and cIDR loss caused by the most frequent cancer mutation of UTX is mainly responsible for abolishing tumour suppression. Deletion, mutagenesis and replacement assays of the intrinsically disordered region demonstrate a critical role of UTX condensation in tumour suppression and embryonic stem cell differentiation. As shown by reconstitution in vitro and engineered systems in cells, UTX recruits the histone methyltransferase MLL4 (also known as KMT2D) to the same condensates and enriches the H3K4 methylation activity of MLL4. Moreover, UTX regulates genome-wide histone modifications and high-order chromatin interactions in a condensation-dependent manner. We also found that UTY, the Y chromosome homologue of UTX with weaker tumour-suppressive activity, forms condensates with reduced molecular dynamics. These studies demonstrate a crucial biological function of liquid condensates with proper material states in enabling the tumour-suppressive activity of a chromatin regulator.
This is a preview of subscription content
Subscription info for Chinese customers
We have a dedicated website for our Chinese customers. Please go to naturechina.com to subscribe to this journal.
Rent or Buy article
Get time limited or full article access on ReadCube.
All prices are NET prices.
The high-throughput sequencing data, including RNA-seq, ChIP-seq, PRO-seq and HiChIP, have been deposited in the Gene Expression Omnibus with the accession number GSE149420. Cancer mutation data are available from cBioPortal (http://www.cbioportal.org/) and the TCGA database. Source data are provided with this paper.
The scripts used to analyse the data from this study are freely available at: https://github.com/zanglab/utx_code.
Wang, L. & Shilatifard, A. UTX mutations in human cancer. Cancer Cell 35, 168–176 (2019).
Gozdecka, M. et al. UTX-mediated enhancer and chromatin remodeling suppresses myeloid leukemogenesis through noncatalytic inverse regulation of ETS and GATA programs. Nat. Genet. 50, 883–894 (2018).
Andricovich, J. et al. Loss of KDM6A activates super-enhancers to induce gender-specific squamous-like pancreatic cancer and confers sensitivity to BET inhibitors. Cancer Cell 33, 512–526.e8 (2018).
Morales Torres, C., Laugesen, A. & Helin, K. Utx is required for proper induction of ectoderm and mesoderm during differentiation of embryonic stem cells. PLoS ONE 8, e60020 (2013).
Wang, C. et al. UTX regulates mesoderm differentiation of embryonic stem cells independent of H3K27 demethylase activity. Proc. Natl Acad. Sci. USA 109, 15324–15329 (2012).
Shpargel, K. B., Sengoku, T., Yokoyama, S. & Magnuson, T. UTX and UTY demonstrate histone demethylase-independent function in mouse embryonic development. PLoS Genet. 8, e1002964 (2012).
Shpargel, K. B., Starmer, J., Wang, C., Ge, K. & Magnuson, T. UTX-guided neural crest function underlies craniofacial features of Kabuki syndrome. Proc. Natl. Acad. Sci. USA 114, E9046–E9055 (2017).
Miller, S. A., Mohn, S. E. & Weinmann, A. S. Jmjd3 and UTX play a demethylase-independent role in chromatin remodeling to regulate T-box family member-dependent gene expression. Mol. Cell 40, 594–605 (2010).
Banani, S. F., Lee, H. O., Hyman, A. A. & Rosen, M. K. Biomolecular condensates: organizers of cellular biochemistry. Nat. Rev. Mol. Cell Biol. 18, 285–298 (2017).
Li, W. et al. Biophysical properties of AKAP95 protein condensates regulate splicing and tumorigenesis. Nat. Cell Biol. 22, 960–972 (2020).
Lin, Y., Protter, D. S., Rosen, M. K. & Parker, R. Formation and maturation of phase-separated liquid droplets by RNA-binding proteins. Mol. Cell 60, 208–219 (2015).
Alam, H. et al. KMT2D deficiency impairs super-enhancers to confer a glycolytic vulnerability in lung cancer. Cancer Cell 37, 599–617 (2020).
Mansour, A. A. et al. The H3K27 demethylase Utx regulates somatic and germ cell epigenetic reprogramming. Nature 488, 409–413 (2012).
Tran, N., Broun, A. & Ge, K. Lysine demethylase KDM6A in differentiation, development, and cancer. Mol. Cell. Biol. 40, e00341-20 (2020).
Shin, Y. et al. Spatiotemporal control of intracellular phase transitions using light-activated optoDroplets. Cell 168, 159–171.e14 (2017).
Kim, J. H. et al. UTX and MLL4 coordinately regulate transcriptional programs for cell proliferation and invasiveness in breast cancer cells. Cancer Res. 74, 1705–1717 (2014).
Janicki, S. M. et al. From silencing to gene expression: real-time analysis in single cells. Cell 116, 683–698 (2004).
Wang, S. P. et al. A UTX–MLL4–p300 transcriptional regulatory network coordinately shapes active enhancer landscapes for eliciting transcription. Mol. Cell 67, 308–321.e6 (2017).
Tie, F., Banerjee, R., Conrad, P. A., Scacheri, P. C. & Harte, P. J. Histone demethylase UTX and chromatin remodeler BRM bind directly to CBP and modulate acetylation of histone H3 lysine 27. Mol. Cell. Biol. 32, 2323–2334 (2012).
Mumbach, M. R. et al. HiChIP: efficient and sensitive analysis of protein-directed genome architecture. Nat. Methods 13, 919–922 (2016).
Fang, R. et al. Mapping of long-range chromatin interactions by proximity ligation-assisted ChIP-seq. Cell Res. 26, 1345–1348 (2016).
Wang, Z., Zhang, Y. & Zang, C. BART3D: inferring transcriptional regulators associated with differential chromatin interactions from Hi-C data. Bioinformatics https://doi.org/10.1093/bioinformatics/btab173 (2021).
Dunford, A. et al. Tumor-suppressor genes that escape from X-inactivation contribute to cancer sex bias. Nat. Genet. 49, 10–16 (2017).
Gazova, I., Lengeling, A. & Summers, K. M. Lysine demethylases KDM6A and UTY: the X and Y of histone demethylation. Mol. Genet. Metabol. 127, 31–44 (2019).
Li, X. et al. UTX is an escape from X-inactivation tumor-suppressor in B cell lymphoma. Nat. Commun. 9, 2720 (2018).
Sze, C. C. & Shilatifard, A. MLL3/MLL4/COMPASS family on epigenetic regulation of enhancer function and cancer. Cold Spring Harb. Perspect. Med. 6, a026427 (2016).
Herz, H. M. et al. The H3K27me3 demethylase dUTX is a suppressor of Notch- and Rb-dependent tumors in Drosophila. Mol. Cell. Biol. 30, 2485–2497 (2010).
Ma, L. et al. Co-condensation between transcription factor and coactivator p300 modulates transcriptional bursting kinetics. Mol. Cell 81, 1682–1697.e7 (2021).
Fasciani, A. et al. MLL4-associated condensates counterbalance Polycomb-mediated nuclear mechanical stress in Kabuki syndrome. Nat. Genet. 52, 1397–1411 (2020).
Boija, A. et al. Transcription factors activate genes through the phase-separation capacity of their activation domains. Cell 175, 1842–1855.e16 (2018).
Benyoucef, A. et al. UTX inhibition as selective epigenetic therapy against TAL1-driven T-cell acute lymphoblastic leukemia. Genes Dev. 30, 508–521 (2016).
Faralli, H. et al. UTX demethylase activity is required for satellite cell-mediated muscle regeneration. J. Clin. Invest. 126, 1555–1565 (2016).
Beyaz, S. et al. The histone demethylase UTX regulates the lineage-specific epigenetic program of invariant natural killer T cells. Nat. Immunol. 18, 184–195 (2017).
Bogershausen, N. et al. Mutation update for Kabuki syndrome genes KMT2D and KDM6A and further delineation of X-linked Kabuki syndrome subtype 2. Hum. Mutat. 37, 847–864 (2016).
Yang, Z. et al. The DPY30 subunit in SET1/MLL complexes regulates the proliferation and differentiation of hematopoietic progenitor cells. Blood 124, 2025–2033 (2014).
Jiang, H. et al. Role for Dpy-30 in ES cell-fate specification by regulation of H3K4 methylation within bivalent domains. Cell 144, 513–525 (2011).
Schulz, W. A., Lang, A., Koch, J. & Greife, A. The histone demethylase UTX/KDM6A in cancer: progress and puzzles. Int. J. Cancer 145, 614–620 (2019).
Wang, J. et al. A molecular grammar governing the driving forces for phase separation of prion-like RNA binding proteins. Cell 174, 688–699.e16 (2018).
Niaki, A. G. et al. Loss of dynamic RNA interaction and aberrant phase separation induced by two distinct types of ALS/FTD-linked FUS mutations. Mol. Cell 77, 82–94.e4 (2020).
Holehouse, A. S., Das, R. K., Ahad, J. N., Richardson, M. O. & Pappu, R. V. CIDER: resources to analyze sequence-ensemble relationships of intrinsically disordered proteins. Biophys. J. 112, 16–21 (2017).
Dignam, J. D., Lebovitz, R. M. & Roeder, R. G. Accurate transcription initiation by RNA polymerase II in a soluble extract from isolated mammalian nuclei. Nucleic Acids Res. 11, 1475–1489 (1983).
Fujioka, A. et al. Dynamics of the Ras/ERK MAPK cascade as monitored by fluorescent probes. J. Biol. Chem. 281, 8917–8926 (2006).
[No authors listed] Illuminating the dark proteome. Cell 166, 1074–1077 (2016).
Rossow, M. J., Sasaki, J. M., Digman, M. A. & Gratton, E. Raster image correlation spectroscopy in live cells. Nat. Protoc. 5, 1761–1774 (2010).
Digman, M. A. & Gratton, E. Analysis of diffusion and binding in cells using the RICS approach. Microsc. Res. Tech. 72, 323–332 (2009).
Day, C. A., Kraft, L. J., Kang, M. & Kenworthy, A. K. Analysis of protein and lipid dynamics using confocal fluorescence recovery after photobleaching (FRAP). Curr. Protoc. Cytometry Ch. 2, Unit2.19 (2012).
Kwak, H., Fuda, N. J., Core, L. J. & Lis, J. T. Precise maps of RNA polymerase reveal how promoters direct initiation and pausing. Science 339, 950–953 (2013).
Judd, J. et al. A rapid, sensitive, scalable method for precision run-on sequencing (PRO-seq). Preprint at bioRxiv https://doi.org/10.1101/2020.05.18.102277 (2020).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinformatics 12, 323 (2011).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Huang da, W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2009).
Zhou, Y. et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 10, 1523 (2019).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Zhang, Y. et al. Model-based analysis of ChIP-seq (MACS). Genome Biol. 9, R137 (2008).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Zang, C. et al. A clustering approach for identification of enriched domains from histone modification ChIP-seq data. Bioinformatics 25, 1952–1958 (2009).
Juric, I. et al. MAPS: model-based analysis of long-range chromatin interactions from PLAC-seq and HiChIP experiments. PLoS Comput. Biol. 15, e1006982 (2019).
Li, D., Hsu, S., Purushotham, D., Sears, R. L. & Wang, T. WashU Epigenome Browser update 2019. Nucleic Acids Res. 47, W158–W165 (2019).
Wang, S. et al. Modeling cis-regulation with a compendium of genome-wide histone H3K27ac profiles. Genome Res. 26, 1417–1429 (2016).
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
We thank K. Kelly for MiaPaca2 cells; J.-B. Shin for COS-7 cells; T. Stukenberg for LacO array-containing cells; M. Mayo for the luciferase plasmid; T. Stukenberg and C. Brangwynne for the Cry2-containing plasmid; B. Ren for advice on HiChIP assays; M. Guertin and K. M. Sathyan for advice on PRO-seq assays; K. Ge for advice on immunoblotting for MLL3 and MLL4 and providing MLL3 antibody; Y. Bao and UVa Genome Analysis and Technology Core for deep sequencing services; and K. Siller, G. A. Bautista, J. Huband, R. Sun and E. Hall at the University of Virginia Research Computing Center for technical support. H.J. was supported by NIH grant 1 R21 CA257936-01 and start-up funds from the University of Virginia. The confocal microscopy system at the Keck Center of University of Virginia was supported by grants from NIH (OD016446 and S10 OD025156). Sequencing in this research was supported by NCI Cancer Center Support Grant 5P30CA044579. H.J. is a recipient of the American Cancer Society Research Scholar Award (RSG-15-166-01-DMC) and the Leukemia & Lymphoma Society Scholar Award (1354-19). C.Z. was supported by NIH grant R35 GM133712. Z.W. was supported by the Farrow Fellowship and by the NCI Cancer Center Support Grant P30 CA44579. F.P. and M.A.D. were supported in part by a grant from the NSF (MCB-1615701). M.A.D. and E.G. were funded by NIH grant number P41-GM103540.
The authors declare no competing interests.
Peer review information Nature thanks Brian Huntly, Tanja Mittag and Panagiotis Ntziachristos for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data figures and tables
a. Relative expression of UTX 555* mutant in 10 cancer patient samples that harbor this mutation and for which the mRNA level data are available. From cBioportal. Each bar is a patient sample. z-score indicates the number of standard deviations away from the mean of expression in the reference. It shows that 3 samples showed increase in mRNA level, and 4 samples showed reduction within 1 standard deviation of expression in reference samples, suggesting that this nonsense mutation does not usually lead to transcript decay and should express a truncated UTX (1–554). b. Top, a schematic of the full-length UTX. Bottom, a plot for number of non-synonymous mutations (type indicated in legend) within each 100 amino acids-long region on UTX, from 394 patients in TCGA. The plot is aligned with the top schematic. c. Top, a schematic showing division of the full-length UTX into 5 regions that were individually purified as fusion with Maltose-binding protein (MBP, to enhance solubility). See Constructs in Methods for rationale for region division. Bottom, phase contrast images of 50 µM MBP-UTX fragments corresponding to each indicated UTX region in 30 mM NaCl after treatment with TEV protease for 1 hr to remove the MBP tag. d. Coomassie blue staining image of purified EGFP and EGFP-cIDR. e. Fluorescence (left) and DIC (right) microscopy images of 30 µM EGFP-cIDR at increasing concentrations of NaCl at indicated times after dilution in the condensation buffer. Note the increase of droplet size and the occasional appearance of irregular shape at 45 min, suggesting growth and aging phenomena that are typical for phase separation. f. Fluorescence (left) and DIC (right) images showing wetting of 30 µM EGFP-cIDR at the surface of the coverslip immediately after dilution. g. Coomassie blue staining image of purified EGFP-UTX (419-848). h. Fluorescence (up) and DIC (bottom) images of EGFP-UTX (419–848) at indicated concentrations. i. Coomassie blue staining image of purified mEGFP and mEGFP-UTX (419–848). j. Fluorescence (up) and DIC (bottom) images of mEGFP and mEGFP-UTX (419–848) at indicated concentrations and time points after dilution in condensation buffer. k. Coomassie blue staining image of purified mCherry and mCherry-UTX (419–848). Note that, throughout this manuscript, a band of unknown identity, indicated by the asterisk, always appeared at ~15kD smaller than the purified mCherry or mCherry-tagged protein of interest, regardless of the identity of the protein of interest. We thus speculate that it is either a degradation product of mCherry or an unrelated protein tightly associated with mCherry. l. Fluorescence (up) and DIC (bottom) images of 40 µM mCherry and mCherry-UTX (419–848) at indicated time points after dilution. m. Immunoblotting by indicated antibody of total lysates from untransfected control or 293T cells transfected with indicated constructs that are fused to EGFP. n. Quantification of nuclear UTX concentration by anti-UTX immunoblotting. Total lysates from indicated number of 293T and mouse ES cells were loaded, along with indicated ng of purified EGFP-UTX (1–848). We determined that the nuclear concentration of endogenous UTX to be ~153 nM in 293T cancer cells and ~383 nM in mouse ESCs. o. Transfected UTX forms nuclear foci at a concentration that is on the same order of magnitude as the endogenous level. Left, immunoblotting (top, by anti-UTX antibody) and Ponceau S staining (bottom) of total lysates from untransfected control or 293T cells transfected with a very low dose of plasmid for UTX-EGFP, with a transfection efficiency of ~30% as examined under microscope. Three different amounts (1:2 serial dilution) of the lysates and purified EGFP-UTX (1-848) were loaded as indicated. UTX-EGFP amount was roughly 2 times that of the endogenous UTX, and estimated to be < 1 µM in the transfected cells, assuming that the negative cell under microscope had little contribution to the UTX-EGFP immunoblotting signal. The fluorescence image of the corresponding cells is shown on the right.
Extended Data Fig. 2 UTX cIDR and the key residues in cIDR are important for forming condensates and its tumour suppressive activity.
a. Growth curves of THP-1 cells electroporated with indicated constructs, shown as mean ± SD. n = 3 independent experiments. P value is between MT2 and ΔcIDR at the last time point. b. Immunoblotting by indicated antibody of total lysates from THP-1 cells electroporated with empty vector or indicated UTX WT or mutants. c. Fluorescence images of THP-1 cells electroporated with indicated UTX constructs fused to EGFP. d. Analysis of amino acid enrichment for UTX cIDR. By Composition Profiler, using indicated background sample. e. Fluorescence images of 293T cells transfected with the corresponding UTX WT or mutants fused with EGFP. f. Immunoblotting by indicated antibody of total lysates from 293T cells transfected with empty vector or indicated UTX WT or mutants fused to EGFP. g. Immunoblotting by indicated antibody of total lysates from THP-1 cells electroporated with empty vector or indicated UTX WT or mutants. h. Representative fluorescence microscopy images of foci in FRAP assays for full-length UTX WT and HY fused to mEGFP in nuclei following transfection into 293T cells. The photobleached focus is indicated by the dashed box and its images at indicated time points are amplified. i–n all used 20 µM mEGFP, mEGFP-UTX (419–848) WT or HY as indicated. i. Pictures and OD600 plotted as mean ± SD. n = 3 independent experiments. j. Representative fluorescence microscopy and DIC images at indicated times in condensation. Scale bar, 10 µm. k. FRAP recovery curves as mean ± SD at indicated early and late times in condensation. n = 13 and 12 droplets each for WT and HY, respectively, for early times, and n = 8 and 12 droplets each for WT and HY, respectively, for late times. l–n. Line RICS assays for WT (n = 32 droplets) and HY (n = 12) over the time range of 0-100 min in condensation. l. Fluorescence Images and line carpets. m. Line RICS autocorrelation curves. n. Plot showing the mean ± SD of the diffusion coefficients. *P<0.05, ***P<0.001, and ****P<0.0001, by two-sided t-test for all.
Extended Data Fig. 3 Tumour suppressive activity of UTX can be maintained by replacing its cIDR with unrelated protein IDRs.
a. Immunoblotting by anti-UTX or GAPDH of total lysates from THP-1 cells electroporated with empty vector or indicated UTX WT or mutants. b. Immunoblotting by anti-UTX (top) or GAPDH (bottom) of total lysates from THP-1 or MiaPaca2 cells that were transduced with empty vector or indicated UTX WT or mutants and induced with doxycycline. c. Confocal images of 293T cells transfected with indicated UTX chimeric constructs all fused to EGFP. d. Growth curves of THP-1 cells electroporated with indicated vector or UTX constructs, shown as average ± SD. n = 3 independent experiments. e. Representative results of colony formation assay of THP-1 cells transduced with indicated UTX WT or mutant construct. Bottom left, the number of colonies are plotted as mean ± SD (n = 3 independent experiments). Bottom right, the size of colonies are plotted as mean ± SD. n = 100 colonies for each except 73 colonies for UTX-eIFIDR. f. Kaplan Meier Curves for survival of the animals grafted with THP-1 cells stably transduced with indicated vector or UTX constructs. P = 0.0024 for WT vs. ΔcIDR, and 0.015 for ΔcIDR vs. UTX-eIFIDR. n = 8 mice each, except n = 10 for Vector. g. Representative crystal violet staining images of MiaPaca2 cells transduced with indicated UTX WT or mutant construct. Two thousand cells were seeded and cultured for 7 days in the absence or presence of doxycycline (to induce transgene expression) before staining. Right, staining absorbance for cells with doxycycline are plotted as mean ± SD from n = 3 independent repeats. h. MiaPaca2 cells stably transduced with indicated vector or UTX constructs were grafted into immune-deficient mice. Tumours from day 20 were imaged and their weights are plotted as average ± SD. n = 5 mice each for Vector and UTX-FUS(IDR), or 6 mice each for the others. i. Immunoblotting of total lysates (Input) and anti-FLAG antibody-mediated immunoprecipitation (IP) from 293T cells transfected with empty vector, or FLAG-tagged UTX WT or ΔcIDR, using indicated antibodies. j–o. RNA-seq analysis for gene expression in the THP-1 cells transduced with indicated UTX constructs (two samples each). j. Volcano plot. Blue and red dots represent genes that were significantly (adj. P<0.05) up- or down-regulated (over 1.25 fold), respectively, by WT versus vector control. The adjusted P values on y axis were from DESeq2. The detailed gene information is in Supplementary Table 1. k. Gene ontology analysis for the indicated gene clusters from the heat map in Fig. 1j. Red plot shows the significantly enriched gene functions for the 300 genes in the red bracket in Fig. 1j, which were strongly induced by WT and UTX-eIFIDR, but not as strongly by ΔcIDR, and not affected by ΔTPR. Blue shows the significantly enriched gene functions for the 200 genes in the blue bracket in Fig. 1j, which were markedly repressed by WT and UTX-eIFIDR, but not ΔcIDR and ΔTPR. l. Principal component analysis of quantile normalized gene expression (log2 of TPM, pseudo-count=1) using 1,271 DEGs in WT versus vector control. m. Scatter plots for gene expression changes between indicated samples. Black dashed line indicates the linear fit with correlation coefficient (r) labeled. Red and blue dots are genes significantly up- or down-regulated (over 1.25 fold) in WT over vector, respectively, as in j. n. Box plots for gene expression changes between indicated samples for the up- and down-regulated genes (defined in m) in WT over vector. Data are median (horizontal line), 25–75th percentiles (box) and 1.5 times the interquartile range recorded (whiskers). P values are shown for data significantly greater or less than 0 by two-sided one sample t-test. Left, n = 675 genes each; right, n = 596 genes each. o. Heatmap showing relative expression levels of genes that were significantly (adj. P<0.05) up- or down-regulated (1.25 fold each) by WT compared to vector control in the THP-1 cells transduced with vector control, UTX WT or catalytically inactive (MT2) mutant (two samples each). Each row represents a gene. P values by one-way ANOVA followed by Tukey’s post hoc test for all analyses except by two-sided log-rank test for g, two-sided modified Fisher’s exact test for k, and two-sided one sample t-test for n.
a. Schematic of FLAG-mEGFP Knock-in (KI) at Utx in ESCs. b. Genomic PCR products by indicated primers for different clones. The predicted sizes are 961 bp (by F1+R1) and 954 bp (by F2+R2) for the KI clones. c. Immunoblotting for KI clones by green circles (neighbouring lanes are clones of mixed population). d. Schematic for generating a Utx KO ESC clone. e. Sequencing results showing the 4 bp loss in Utx in the KO clone. f. Immunoblotting for the Utx KO clone. g. Schematic for generating Utx ΔcIDR ESC clones. These clones were generated by cIDR deletion followed with selection for in-frame rejoined Utx. h. Genomic PCR products for different clones. The bottom right gel used the primer pair in yellow shown in g, with predicted size of 205 bp for ΔcIDR. The other gels used the primer pair in black in g, with predicted size of 1040 bp for ΔcIDR. i. Sequencing of the indicated ΔcIDR clones. The in-frame deleted genomic regions are shown in the square brackets. The deletion of amino acids additional to the cIDR was due to the design of the gRNAs to ensure efficient recognition of CRISPR/Cas9. These residues are all in the disordered region. A phenylalanine was also added (in red) by the repair process in clone #72. j. Immunoblotting for the Utx ΔcIDR clones. k. Schematic of FLAG-mEGFP KI at Utx in the Utx ΔcIDR ESC clones. l. Genomic PCR products for different clones using the F2 and R2 primer pair shown in a. m. Immunoblotting for mEGFP-tagged WT and UtxΔ cIDR clones. n. Representative confocal microscopic images of ESCs that did not have mEGFP KI (Negative) or had mEGFP KI in the Utx WT or ΔcIDR clones. Two different samples of ΔcIDR clone #25 that had relatively high and low mEGFP signal levels are here to show that the signal levels did not affect the fuzziness or how dispersed UTX was. o. Granularity of the mEGFP signal was plotted as mean ± SD. n = 28 Utx WT and 38 ΔcIDR cells, each containing two different clones shown in panel n. Granularity, or mEGFP signal fluctuation, was calculated as SD of the relative signal intensity of a line plot (normalized to the mean signal intensity of the entire line) at each pixel on the line through an individual cell. P value by two-sided t-test. The line plot was generated by ImageJ, and an example is shown as the yellow line on a mEGFP-ΔcIDR clone in n. p. Venn diagrams for genes that, compared to the corresponding WT embryoid bodies, were downregulated (left) or upregulated (right) by KO (cyan) or ΔcIDR (salmon, two clones) in late embryoid bodies (day 8 or 9 in differentiation). Fold changes are indicated. From RNA-seq of two 2 independent differentiation assays. P values by two-sided Fisher’s exact test. q. Significantly enriched Biological Processes in embryonic development for genes downregulated (↓) or upregulated (↑) over 2 fold by Utx KO or ΔcIDR in embryoid bodies late in differentiation, compared to the corresponding WT embryoid bodies. By Metascape. P values by two-sided hypergeometric test.
Extended Data Fig. 5 UTX co-condenses with and enhance chromatin recruitment of MLL4 and p300 in cells.
a. A schematic showing the domain compositions of MLL4, MLL4f, and MLL4C(N). b. Left, a schematic showing the optoDroplet system used for the right, which are confocal images of 293T cells expressing MLL4C(N)-mCherry-Cry2, before and after light activation. c. Left, a schematic showing the optoDroplet system used for the right, which are confocal images of 293T cells expressing UTX-mCherry-Cry2, before and after light activation. d. Left, a schematic showing the optoDroplet system used for the right, which are confocal images of 293T cells expressing both UTX (WT or indicated mutant)-mCherry-Cry2 and MLL4C(N)-EGFP, before and after light activation. e. Left, a schematic showing the LacO-array system. Middle, fluorescence microscopy images of LacO-cells transfected with EGFP-LacI (control) or UTX (WT or mutant)-EGFP-LacI, each together with p300(251-550)-mCherry. Right, recruitment coefficients of p300(251-550) relative to the LacO-bound control or UTX (WT or mutant) are determined as described in Methods and plotted as average ± SD. n = 11, 19, 17, 24 cells for control, WT, ΔcIDR, and ΔTPR, respectively. P values by one-way ANOVA followed by Tukey’s post hoc test.
Extended Data Fig. 6 Purified UTX and MLL4 or p300 regions form co-condensates through specific interaction mediated by TPR.
a. Coomassie blue staining image of purified EGFP-UTX (419-848), EGFP-UTX (1-848), and mCherry-MLL4C(N). b. Fluorescence images of purified proteins individually diluted to indicated concentrations. c. Fluorescence images of purified proteins mixed and diluted to the final concentration as indicated, equal to the concentrations in b. d. Fluorescence images of purified proteins individually diluted to indicated concentrations. e. Fluorescence images of purified proteins mixed and diluted to the final concentration equal to that of the corresponding protein in d. The exposure time for GFP in the top row was longer than those in the other two rows to adjust the GFP brightness due to the much lower concentration of the GFP-fused proteins in the top row. f. Recruitment coefficient for MLL4C(N) was determined as the relative ratio of the mCherry signal amount inside the GFP condensates over that inside the mCherry condensates, and plotted as mean ± SD. n = 3 independent fields each. Note that each field had ~100 total droplets. **P<0.01 by one-way ANOVA followed by Tukey’s post hoc test. g. Domain schematic of p300. P300 (251-550) containing ZF1 domain is labeled. h. Fluorescence images of purified proteins mixed to the indicated final concentrations.
a. Schematic of the procedures to isolate the UTX (WT or ΔcIDR) and MLL4f proteins for in vitro histone methylation (HMT) and imaging analyses. Note that all HMT assays involved endogenous MLL3/4 associated with isolated UTX. b. Immunoblotting for UTX in the UTX (WT or ΔcIDR) pulldowns, along with indicated ng of purified EGFP-UTX (1-848). This allowed an estimate of the concentration of WT-GFP or ΔcIDR-GFP to be ~40 nM in the pulldowns. c. Fluorescence images of the UTX (WT or ΔcIDR) pulldowns in the condensation buffer, with ~25 nM UTX protein and 150 mM NaCl. d. Immunoblotting for the indicated proteins in the isolated UTX (WT or ΔcIDR) complexes. e. Representative results of in vitro HMT assays using the UTX (WT or ΔcIDR)-GFP pulldown. UTX and indicated histone modifications at indicated time points of HMT assay were determined by Immunoblotting. Total Histones are shown by Ponceau S staining. Right, Relative H3K4me1 increase (from time 0) is plotted as mean from 2 independent experiments. f. Representative results of in vitro HMT assays using the UTX (WT or 555*) pulldown. Top, immunoblotting by indicated antibodies and Ponceaus S staining for total histones in HMT assay. Bottom, H3K4me1 activity was determined as relative H3K4me1 increase from preexisting methylation (in control) and plotted as mean ± SD. n = 3 independent experiments. P values by two-sided t-test. g. Immunoblotting using anti-FLAG antibody for the isolated UTX (WT or ΔcIDR) complexes, MLL4f-mCherry, or their mixture as indicated. UTX (WT or ΔcIDR) and MLL4f all had FLAG tag. h. Fluorescence microscopy images of indicated protein complex, either as individual protein (top row) or mixed as indicated (bottom two rows) in the condensation buffer. The concentration of each protein in the mixture was the same as that of the corresponding individual protein shown on top row. i. Immunostaining for Myc (for Myc-tagged UTX), H3K27me3, and DAPI in COS-7 cells transfected with indicated UTX WT or mutant construct. Cells with UTX overexpression (indicated by arrows) were scored for the signal intensity of H3K27me3 and quantified in j. j. Quantification of the cells showing Decrease (↓), no change (-), or increase (↑) of H3K27me3. Shown are the percentage of cells showing the indicated changes in one out of a total of five biological repeats. In parenthesis is the number of cells showing the changes/total number of cells scored. In each repeat, around 100 cells were scored for each sample.
Extended Data Fig. 8 UTX condensates regulate genomic histone modifications and enhancer transcription.
a. A Venn diagram for genomic sites bound by UTX WT and ΔcIDR. The overlap number refers to the number of WT peaks that overlapped with ΔcIDR peaks. b. Genomic features of all 15,554 UTX WT-bound sites. The number of sites for each feature is shown in the parenthesis. c. Number and percentage of the genomic sites bound by either UTX WT (normal) or specifically by ΔcIDR (aberrant, not bound by WT) that overlap (within indicated distance) with any of the activating histone marks (H3K4me1, 2, 3, and H3K27ac) merged across all cell samples. d. Heatmap (left) and composite plot (right) for all 15,554 UTX WT-binding peaks in indicated cells. Heatmap rows were ranked by UTX peak levels (MACS2 peaks q-value, high to low). UTX ChIP-seq signals per 20bp non-overlapped bins covering 4kb regions centered at UTX peaks were shown. Details in Supplementary Table 3. e, f. Heatmap (left) and composite plot (right) for H3K4me1 (e) or H3K27ac (f) peaks in indicated cells at the UTX-binding sites (number indicated) with higher H3K4me1 (e) or H3K27ac (f) intensity in WT than vector expressing cells. Heatmap rows were ranked by UTX peak significance (MACS2 q-value, high to low). ChIP-seq signals from indicated histone modification per 20bp non-overlapped bins covering 4kb regions centered at UTX peaks were shown. g–j. Left, scatter plot using the indicated H3K4me3 (g), H3K4me2 (h), H3K27me3 (i), and MLL4 binding (j) signal level change in cells indicated on the x and y axes, for all 15,554 UTX WT-bound sites. Black dashed line indicates the linear fit with correlation coefficient (r) labeled. Right, composite plot for indicated modification or binding signals centered at UTX peaks in indicated cells. k. Scatter plots using the indicated histone modification signal level change in cells indicated on the x and y axes, for all 15,554 UTX WT-bound sites. Black dashed line indicates the linear fit with correlation coefficient (r) labeled. l. Top, scatter plot for PRO-seq signal change at all 4,806 UTX WT-bound enhancers in indicated cells. Black dashed line indicates the linear fit with correlation coefficient (r) labeled. Bottom, composite plot for the PRO-seq signals centered at PRO-seq peaks in indicated cells. m. Heatmap (left) and composite plot (right) for PRO-seq signals at UTX WT-bound enhancers that had increased (top row) or decreased (bottom row) PRO-seq signals over 1.2 fold in WT compared to vector expressing cells, in indicated cells. PRO-seq signals centered at PRO-seq peaks were shown.
a. Distribution of types of genomic interactions. P, promoter (TSS ± 2kb); E, enhancer (promoter non-overlapped, union DNase I hypersensitive sites); Desert: all other interactions, <1% in all. b. Distribution of the chromatin loop size in indicated cells. A loop is the connection of two different 5 kb chromatin regions by high-confidence contact reads. c. Number of chromatin loops identified in indicated cells. For simplicity, Venn diagrams for only Vector, WT, and ΔcIDR cells are shown. d. Box plots for the number of normalized HiChIP reads per loop for identified loops in indicated cells. The normalized read counts in a loop represents the relative intensity of the interaction. For H3K4me3 HiChIP (left), n = 57632, 67755, 88993, 95696, 90566 loops in Vector, WT, ΔcIDR, UTX-eIFIDR, ΔcTPR samples, respectively. For H3K27ac HiChIP (right), n = 20912, 27794, 37811, 31232, 26556 loops in Vector, WT, ΔcIDR, UTX-eIFIDR, ΔcTPR samples, respectively. e. Scatter plots for all 26,237 genes using the promoter H3K27ac HiChIP DCI of cells indicated on the x and y axes. Dashed line shows the linear fit with indicated correlation coefficient (r). Red and blue dots represent 479 and 112 genes with DCI of WT over vector > 5 and < −5, respectively. f. Box plots for DCI scores comparing cells indicated on x axis for genes represented by the red (left) and blue (right) dots from e. P values are shown for data significantly greater or less than 0 by two-sided one sample t-test. Left, n = 479 genes each; right, n = 112 genes each. g. Box plots for DCI scores at promoter (TSS ± 2kb) of cell samples (indicated on top of each plot) for genes whose expression was up- or downregulated (indicated on x axis) in the corresponding cells (indicated on top of each plot). Genes were selected as those both have expression and DCI data. The trend of change and the P value between indicated two groups of genes are shown. n = indicated number on x axis. h. Gene pathway analysis using DCI scores from indicated HiChIP data. Red shows pathways relevant to cancer or leukaemia inhibition among the top 20 significantly enriched Biological Processes for genes with promoter chromatin interactions substantially increased in WT over vector expressing cells. Blue shows pathways particularly relevant to cell growth and survival among the top 20 significantly enriched Biological Processes for genes with promoter chromatin interactions substantially decreased in WT over vector expressing cells. All by Metascape. Details in Supplementary Table 5. i, j. BART3D results showing transcriptional regulators that were likely to be associated with genomic regions that had increased (i) or decreased (j) chromatin interactions in WT over vector expressing cells from indicated HiChIP datasets. Top 10 ranked transcriptional regulators are labeled in red and their names are shown. For all box plots, data are median (horizontal line), 25–75th percentiles (box) and 1.5 times the interquartile range recorded (whiskers). P values by two-sided t-test for all except by hypergeometric test for h, and from BART3D for i, j.
a. Normalized RNA-seq reads of indicated genes that are shown for integrated genomic analysis, in Vector (V), WT (W), ΔcIDR (c), UTX-eIFIDR (e), ΔcTPR (T) samples. One repeat of the Vector sample was set to 1. HK2 gene (boxed) differs from the other genes in that it was repressed by UTX WT but not ΔcIDR. b. Integrated genomic analysis for MX2-MX1 cluster, OAS1-3-2 cluster, OASL, UBE2W-LY96 cluster, KCNJ2, and HK2. as examples for UTX condensates-regulated gene expression associated with chromatin interactions. Top, PRO-seq 3′ end coverage tracks (red, + strand; blue, – strand), with the same scale for all samples. A higher MX2 transcriptional activity in WT than ΔcIDR cells is shown by the denser WT PRO-seq signals when IGV-zoomed in in the dashed box on top. Also note the higher transcriptional activity of LY96 in UTX WT than vector and ΔcIDR cells, as shown by the PRO-seq signal density and the - strand eRNA (in green oval) at both LY96 promoter and gene-downstream enhancer. Green arrows show the higher H3K4me1 or H3K27ac signal in WT than the other two cell samples at OASL promoter. PRO-seq confirmed transcriptional changes and, together with the H3K4me1 and H3K27ac peaks, revealed actively and bi-directionally transcribing enhancers. Note that H3K27ac and/or H3K4me1 were increased at MX2, OASL, and KCNJ2 promoters or enhancers upon WT but not ΔcIDR expression. Bottom, H3K27ac HiChIP arc plots of loops are shown on WashU EpiGenome Browser. All arc plots have the same scale for statistical significance (arc darkness) of the loop calls. Black arrows indicate normal loops in WT-expressing cells that were lost in vector, ΔcIDR, or ΔTPR cells. Cyan arrows indicate loops normally suppressed by UTX WT but aberrantly formed in vector, ΔcIDR, or ΔTPR cells.
Extended Data Fig. 11 UTY is a weaker tumour suppressor than UTX due to its altered physical property of condensates.
a. Immunoblotting by indicated antibody of total lysates from THP-1 cells electroporated with empty vector or indicated Myc-tagged UTX WT or mutants. b. Amino acid sequence alignment of UTX and UTY cIDR. Middle row shows identical residues (by letter) and conservative mutations (by “+”). Y and F are labeled with green box. Negatively charged residues (E and D) are in red font, and positively charged residues (R and K) are in blue font. Charged residues that are unique in UTX or UTY cIDR are bold and underlined. c. Net charge per residue for cIDR of UTX and UTY, determined by CIDER. d. Images of Ni resin after binding with bacterial lysates during purification of indicated proteins. Red arrows point to the unusual aggregation of the resin bound to UTY cIDR. e. Coomassie blue staining image of purified UTX cIDR and UTY cIDR with indicated fluorescence tag. A non-specific band is indicated by the asterisk. f. Indicated proteins all at 30 µM protein and in 150 mM NaCl were subjected to centrifugation. The unspun mixture (“mixed”), supernatant (“super”), and pellets were resolved on SDS-PAGE gel followed with coomassie blue staining. Partition Percentage is plotted (right) as described in Methods as mean ± SD of 3 repeats. ***P<0.001 by two-sided t-test. g. Fluorescence and DIC images of mCherry-UTX cIDR or mCherry-UTY cIDR at indicated concentrations and time points after dilution into the condensation buffer. h. Fluorescence microscopy images of mEGFP-UTX cIDR or mEGFP-UTY cIDR at increasing concentrations of protein and NaCl. A shorter exposure time was used for all images of 20 µM protein. i. Fluorescence Image and line carpet of mEGFP-UTX cIDR and mEGFP-UTY cIDR droplets. j. Line RICS autocorrelation curves of 50 µM mEGFP-UTX cIDR and mEGFP-UTY cIDR at early (< or = 100 min) or late (> 170 min) time of phase separation after dilution into the condensation buffer, shown as mean ± SD. n UTX early = 8, n UTY early = 9, n UTX late = 8, n UTY late = 9 droplets. k. Representative fluorescence microscopy images for FRAP assays of UTX or UTX-UTYIDR fused to EGFP in nuclei following transfection into 293T cells. The photobleached focus was boxed and also amplified in the images on the right. Photobleached at time 0.
All panels in this figure used 20 µM (unless otherwise indicated in a) mEGFP-UTX (419-848) WT or mutants as indicated. S781Y and S814F mutation was found in one and two patients, respectively, and both were predicted by ICGC to have high functional impact. 5M is a compound missense mutation containing S674Y, S781Y, H808Y, S814F, and S818L, all found in one cancer patient. See Methods “Cancer database analysis for mutations” for details. a. Pictures and relative turbidity as OD600 normalized to the value of WT and plotted as mean ± SD. n = 3 independent experiments. Bottom, coomassie blue staining of the corresponding proteins, sometimes with finely adjusted (10%) different concentrations (for S814F) to ensure fair comparison with WT. b. Representative fluorescence microscopy and DIC images at indicated times in condensation. Scale bar, 10 µm. c. FRAP recovery curves as mean ± SD at indicated early and late times in condensation. Top two panels, n = 13 droplets for WT and S781Y each at early times, and n = 9 for each of them at late times. Bottom two panels, n = 9, 10, 10 for WT, 5M, S814F, respectively, at early times, and n = 8 for each of them at late times. The statistics is for the last time points, but, for WT and S814F at late condensation times, is for 0-143 seconds after photobleaching. d–f. Line RICS assays for WT (n = 21 droplets), S781Y (n = 12), S814F (n = 10), and 5M (n = 11) over the time range of 0-100 min in condensation. d. Fluorescence Images and line carpets. e. Line RICS autocorrelation curves. f. Plot showing the mean ± SD of the diffusion coefficients. ***P<0.001, and ****P<0.0001 by two-sided t-test for all.
This file contains gel source images
Supplementary Table 1 Changes of gene expression by expression of WT UTX and mutants in leukaemia cells
RNA-seq analyses (TPM) for selected genes in THP-1 cells that were stably transduced with vector, UTX WT, ΔcIDR, UTX(eIFIDR), ΔTPR or UTX(FUSIDR) (in sheet 1), or with vector, UTX WT or MT2 (in sheet 2). Two biological repeats each. These genes were significantly (adjusted P < 0.05, from DESeq2) up- or down-regulated (1.25-fold each) by WT compared with vector control.
Supplementary Table 2 Changes of gene expression by deletion of endogenous UTX or cIDR in mouse ES cells and embryoid bodies
Normalized reads by RNA-seq analyses for all genes in the mouse genome in the WT, genomically edited Utx-KO and ΔcIDR ES cells and late embryoid body from the differentiation assays. Sorted by the mean reads in all four ES cell samples. The differential expression details including the log2 fold change and adjusted P values were from DESeq2.
Based on UTX ChIP-seq for THP-1 cells that were stably transduced with vector, UTX WT or ΔcIDR (DEL in the table). UTX peaks were called using vector as control. The ‘ChIP-seq binding at WT UTX’ sheet also shows the histone modification and UTX and MLL4 binding signals at the WT-bound sites in different cells. The peak details including q-values were from MACS2.
HiChIP analysis in THP-1 cells that were stably transduced with vector, UTX WT, ΔcIDR, UTX(eIFIDR) or ΔTPR. There are four sheets in this file. Sheet ‘QC’ lists numbers of reads of indicated characteristics. The other three sheets show called chromatin loops in HiChIP assays indicated by the sheet name. The raw count and FDR were from MAPS results, and the raw count was normalized by per 100 million mapped pairs after duplicate removal (from MAPS QC file).
Integrated genomic analysis for all genes in the genome in THP-1 cells that were stably transduced with vector, UTX WT, ΔcIDR (DEL), UTX(eIFIDR) (EIF) or ΔTPR (TPR). Each gene is shown with its expression levels (from RNA-seq) and DCI scores. The differential expression details including the log2 fold change and adjusted P values were from DESeq2.
Information for oligonucleotides used for CRISPR–Cas9-mediated genome editing in this work.
Purified UTX cIDR formed droplets immediately upon dilution into the condensation buffer and reached 30 µM protein and 150 mM NaCl. The red arrows point to fusion events immediately prior to the fusion at different times throughout the video. Played at 100× speed.
Representative FRAP assays for UTX–EGFP transfected into 293T cells. Played at 5× speed.
Example 1 for OptoDroplets formed by MLL4C(N)–mCherry–Cry2 and UTX (WT, ΔcIDR or ΔTPR)–EGFP in 293T cells. Played at 5× speed.
Example 2 for OptoDroplets formed by MLL4C(N)–mCherry–Cry2 and UTX (WT, ΔcIDR or ΔTPR)–EGFP in 293T cells. Played at 5× speed.
Example for OptoDroplets formed by UTX (WT, ΔcIDR or ΔTPR)–mCherry–Cry2 and MLL4C(N)–EGFP in 293T cells. Played at 5× speed.
Representative FRAP assays for purified mEGFP–UTX or UTY cIDR in vitro. Played at 5× speed.
About this article
Cite this article
Shi, B., Li, W., Song, Y. et al. UTX condensation underlies its tumour-suppressive activity. Nature 597, 726–731 (2021). https://doi.org/10.1038/s41586-021-03903-7