Figures
Abstract
SF3B1, a component of the U2 snRNP pre-mRNA splicing factor, plays a critical role in splicing and is frequently mutated in cancer, particularly hematologic malignancies. We investigated the effects of the most common SF3B1 mutation, heterozygous substitution of Lysine 700 to Glutamate (K700E), in human embryonic stem cells (hESC), using CRISPR-Cas9 to generate heterozygous SF3B1K700E clones. Interestingly, we observed the upregulation of several key transcription regulators associated with hematopoiesis and a broad range of immune genes in SF3B1K700E hESCs. Despite differences in the transcriptional and splicing profiles between hESC and myelodysplastic syndrome (MDS) cells harboring the SF3B1K700E mutation, several common immune gene programs were identified in both cell types. To elucidate the molecular mechanisms underlying dysregulated gene expression in SF3B1K700E hESCs, we mapped actively engaged RNA polymerase II (RNA Pol II) using Precision Run-On sequencing (PRO-seq). These analyses revealed that the SF3B1K700E mutation alters RNA Pol II elongation properties. Specifically, we observed a general increase in pause release in SF3B1K700E hESCs, consistent with recent work in leukemia cells suggesting that the SF3B1K700E mutation affects early transcription elongation. Taken together, our study identifies several candidate genes that could contribute to the SF3B1 mutated phenotype and clarifies the role for the U2 snRNP and pre-spliceosome assembly on transcription by RNA Pol II. Further, our data suggest that mutations of SF3B1 impact immune gene expression independent of cell type, providing new insights into the role of SF3B1K700E in hematologic malignancies.
Citation: Dastpak M, Mimoso CA, Farshchian M, Paraggio CL, Leonard CE, Yin S, et al. (2025) SF3B1K700E mutation in human embryonic stem cells causes aberrant expression of immune-related genes. PLoS One 20(12): e0334361. https://doi.org/10.1371/journal.pone.0334361
Editor: Eric A. Shelden, Washington State University, UNITED STATES OF AMERICA
Received: April 16, 2025; Accepted: September 23, 2025; Published: December 17, 2025
Copyright: © 2025 Dastpak et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: RNA-seq and PRO-seq data reported in this paper were uploaded to Gene Expression Omnibus (GEO) database, accession number GSE278364.
Funding: This work was supported by NIH grant NIGMS GM122524 to RR, NIGMS GM139960 to KA, and Boston Children’s Hospital Office of Faculty Development to RGR.No sponsor played any role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: K.A. received research funding from Novartis not related to this work, consults for Syros Pharmaceuticals and Odyssey Therapeutics, and is on the SAB of CAMP4 Therapeutics. No aspect of this work is related to my consulting or other activities. This does not alter our adherence to PLOS ONE policies on sharing data and materials.
Introduction
SF3B1 is a component of the essential splicing factor U2 snRNP (small nuclear ribonucleoprotein) and is the most frequently mutated spliceosomal gene in cancer [1–3]. Pan-cancer genome analyses indicate that heterozygous SF3B1 mutations are primarily associated with hematological cancers, however, the fitness advantage these mutations provide to cancer cells is poorly understood. Pathogenic SF3B1 mutations are most frequent in myelodysplastic syndromes (MDS) and are also found in acute myeloid leukemia (AML), chronic lymphocytic leukemia (CLL) and several solid tumors, such as breast cancer (BRCA) and uveal melanoma (UVM) [4–9].
SF3B1 functions early in spliceosome assembly, and plays a central role in recognition of the 3’ end of introns [10]. Previously, we cloned and characterized components of the U2 snRNP-associated complexes SF3a and SF3b [11–16]. Our studies, along with recent cryo-EM and crystallographic work, revealed that the SF3B1 component of SF3b binds to pre-mRNA on both sides of the branchpoint sequence (BPS) at the spliceosome core [15–20]. Mutations in SF3B1 cause a variety of mis-splicing events, leading to alterations in cellular pathways that may drive pathogenesis [7,21,22]. For example, recent studies revealed that mutant SF3B1 leads to mis-splicing of the NF-κB signaling inhibitor, MAP3K7, which was proposed to play a role in MDS pathogenesis [22–24]. However, another study reported that MAP3K7 mis-splicing leads to downregulation of GATA1, a master regulator of erythroid differentiation, and that this in turn results in the severe anemia associated with MDS [22–25].
To date, studies of cancer-associated SF3B1 mutations have been carried out in blood cancer patient cells (MDS, AML, CLL), blood cells from mouse models, blood cancer cell lines (e.g., NALM and K562), or non-blood transformed cell lines (e.g., HeLa and HEK293 cells) [23,26–33]. In our study, we sought to define the fundamental effects of the K700E mutant of SF3B1 by examining it in a non-transformed, ground state pluripotent cell. The SF3B1K700E mutation is the sole independent predictor of worse overall survival in MDS [34], with conditional knock-in mouse models developing macrocytic anemia, erythroid dysplasia, and long-term hematopoietic stem cell expansion [35]. Interestingly, this specific mutation also plays a tumorigenic role in other cancers, such as pancreatic ductal adenocarcinoma, suggestive of diverse roles in derivatives of multiple germ layers [36].
We CRISPR-edited hESCs to harbor a heterozygous K700E mutation and used RNA-seq to examine changes in gene expression and splicing. Surprisingly, several master transcription regulators of hematopoiesis and numerous immune genes were upregulated in the SF3B1K700E hESCs, suggesting that wild-type SF3B1 plays a global role in ensuring proper regulation of immune genes. Intriguingly, in blood and non-blood cancers with the SF3B1 mutation, immune pathways are among the top downregulated pathways. To gain a better understanding of the underlying molecular mechanisms, we employed nascent RNA analysis to examine the transcriptional landscape in SF3B1K700E hESCs. These analyses demonstrated that RNA Pol II elongation properties are altered in hESCs with the SF3B1K700E mutation, specifically showing a general increase in pause release.
Materials and methods
Cell culture
H9 hESC lines were cultured using the mTeSR1TM1 kit (STEMCELL technology) and 1% penicillin-streptomycin (Sigma) on Matri-coated (CORNING) tissue culture plates and incubated at 37°C in humidified 5% (vol/vol) CO2.
CRISPR editing of SF3B1K700E in hESCs
H9 ES cells (WA09) obtained from WiCell Research Institute (Madison, WI) were cultured in E8 medium on Matrigel-coated plates at 37 °C with 5% (vol/vol) CO2 [37]. CRISPR/CAS9 was used by the HMS Cell Biology Initiative for Genome Editing and Neurodegeneration to generate lines containing an A > G point mutation resulting in K700E amino acid substitution in the SF3B1 gene. The CRISPR guide sequence was 5’-TGGATGAGCAGCAGAAAGTT-3’. The Ultramer sequence used to introduce the mutation was: 5’-TGTAACTTAGGTAATGTTGGGGCATAGTTAAAACCTGTGTTTGGTTTTGTAGGTCTTGTGGATGAGCAGCAGgAAGTTCGGACCATCAGTGCTTTGGCCATTGCTGCCTTGGCTGAAGCAGCAACTCCTTATGGTATCGAATCTTTTGAT-3 (point mutation lowercase; PAM region underscored). To create H9 cells harboring a heterozygous K700E mutation in SF3B1, 0.6 μg sgRNA was incubated with 3 μg SpCas9 protein for 10 minutes at room temperature and electroporated into 2x105 H9 cells along with the Ultramer repair template. Mutants were identified by Illumina MiSeq. In addition, K700E mutation status was further confirmed by PCR amplification of genomic DNA, followed by Sanger sequencing using a primer set specific for SF3B1. The primer sequences were as follows: forward, 5’-AATTTGGGCTACTGATTTGGGGA-3’; reverse, 5’-GCCTTCAAGAAAGCAGCCAAAC-3’.
Bulk RNA sequencing and data processing
Total RNA isolation from SF3B1K700E and WT hESCs was performed using the RNeasy Kit (Qiagen) following manufacturer’s instructions. Bulk mRNA sequencing of triplicate samples was performed using NEBNext® Ultra™ II RNA Library preparation (NEB #E7775) with PolyA selection on the Illumina HiSeq4000 using the standard protocol for Illumina. On average, ~ 31M paired-end reads were obtained per sample.
All custom scripts described are available on the AdelmanLab GitHub (https://github.com/AdelmanLab/NIH_scripts). For Figs 2 and 6, the custom script (trim_and_filter_PE.pl) was used to trim FASTQ read pairs to 75 bp per mate and read pairs with a minimum average base quality score of 20 retained. Read pairs were further trimmed using cutadapt 1.14 to remove adapter sequences and low-quality 3’ bases (--match-read-wildcards -m 20 -q 10). Reads were aligned to human (hg38) genome using parameters --quantMode TranscriptomeSAM GeneCounts --outMultimapperOrder Random --outSAMattrIHstart 0 --outFilterType BySJout --outFilterMismatchNmax 4 --alignSJoverhangMin 8 --outSAMstrandField intronMotif --outFilterIntronMotifs RemoveNoncanonicalUnannotated --alignIntronMin 20 --alignIntronMax 1000000 --alignMatesGapMax 1000000 --outFilterScoreMinOverLread 0 --outFilterMatchNminOverLread 0. Duplicates were also removed using STAR. Stranded coverage bedGraph files were generated from deduplicated BAM files using STAR. BedGraphs were depth-normalized using the normalize_bedGraph custom script. For Fig 2D, featurecounts was used to count RNA-seq reads within exons per gene and DESeq2 was run using default parameters to generate a list of differentially expressed genes between SF3B1K700E mutant and WT hESCs. For Fig 6G, depth-normalized BedGraph files were converted to the bigWig format, and merged bedGraphs for each experimental condition were generated using bigWigMerge (UCSC tools). Merged bedGraphs were then converted to the bigWig format for visualization.
(A) A schematic of SF3B1 gene showing domain structure and position of K700E mutation. Target sequence for sgRNA is labeled in red, followed by PAM sequence labeled in blue. (B) Sanger sequencing validation of heterozygous K700E mutation in three independent hESC clones. (C) Distribution of mis-splicing events in SF3B1K700E hESCs. ∆PSI ≥ 10% and p-value < 0.05 were used as thresholds. SES, single-exon skipping; MES, multiple-exon skipping; MXS, mutually exclusive splicing; A5ss, alternative 5′ splice site; A3ss, alternative 3′ splice site. (D) Volcano plot depicting mis-splicing events in SF3B1K700E ES cells. Representative genes with top ΔPSI% are labeled. (E) Representative mis-spliced genes in SF3B1K700E hESCs and SF3B1MT blood cancers (MDS, AML and CLL), blood cell lines (K562 and NALM6) and non-blood cancers (BRCA and UVM) with immune functions. (F) Integrated genome viewer (IGV) browser shots show mis-splicing in SF3B1K700E versus WT hESCs (highlighted in pink). Plots are from 3 biological replicates of an hESC line. See also S2 and S3 Tables.
(A) Distance heatmap of RNA-seq data from SF3B1K700E and WT ES cells in 3 biological replicates. (B) PCA plot of samples from WT and SF3B1K700E mutant hES cells. (C) Volcano plot depicting differentially expressed genes (among N = 14,946 active genes) in SF3B1K700E mutant vs. WT cells. Affected genes were defined by DESeq2 (p < 0.05 and Fold Change >1.5). (D) Several representative master hematopoietic genes and immune genes FC: Fold Change. (E) Top 20 gene sets with the most positive normalized enrichment scores (NES), indicative of gene upregulation in SF3B1K700E ES lines (NOM p-value < 0.05). See also S1 Fig and S4,S6 Tables.
(A) Gene set enrichment analysis, showing the most downregulated gene sets in SF3B1MT MDS patients. Hematopoietic/Immune pathways are highlighted in green. Gene sets with significant NES (NOM p-value < 0.05) are shown. (B) Gene co-expression network in SF3B1MT patients using WGCNA shows clustering of 16 module eigengenes in MDS patients. SF3B1MT module is labeled in pink. (C) Eigengene adjacency heatmap indicates the correlation between modules. Red and blue represent positive and negative correlations, respectively. Asterisks indicate modules negatively associated with immune pathways. (D) Heatmap of module-trait relationship displaying correlation coefficients between module eigengenes and SF3B1MT phenotype. Colors indicate positive (red) or negative (green) correlations. Values without parentheses indicate correlation coefficient, corresponding p-values are in parentheses at the bottom of each row. Modules with significant p-values are in bold. (E, F) Enriched pathways in modules M6_purple and M7_salmon, respectively, are listed. See also S2 Fig and S7-S9 Tables.
(A, B) Significant immune gene sets affected by SF3B1 mutation in blood and non-blood cancers, respectively. The number of gene sets that are positively enriched (upregulated genes) as compared to downregulated genes are indicated. (C, D) Top gene sets with the most negative normalized enrichment scores (NES), indicative of gene downregulation in SF3B1MT BRCA and UVM patients, respectively. Hematopoietic/immune pathways are highlighted in green. Gene sets with significant NES (NOM p-value < 0.05) are shown. See also S7,S10, S11 Tables.
(A) Shared immune GSEA gene sets downregulated in SF3B1MT blood and non-blood cancers. Leukocyte migration gene set is highlighted in green. (B) Leukocyte migration is the only gene set shared among all splicing factors examined in this study. NES and NOM p-value are shown. (C) Venn diagram showing CCR1 as the only shared downregulated gene among the indicated SF3B1MT cancers and U2AF1 MT MDS. Additional CCRs present in the indicated cancers are shown. (D) Balloon plot showing downregulation of CCL genes in splicing factors (SF) cancers. See also S12 Table.
(A) Density scatter plot depicting the relationship between the fold change in RNA-seq and gene body PRO-seq signal (+250 to TES) at genes changed in RNA-seq (N = 1,754). (B) RNA-seq and PRO-seq at an example upregulated gene FDXR (left) and downregulated gene MAP3K7 (right). PRO-seq signal was re-scaled downstream of the TSS to highlight gene body signal. (C) Metagene plot of PRO-seq signal at upregulated genes. Reads were summed in 25 nt bins, aligned to the TSS. Inset highlights gene body PRO-seq signal (TSS + 250 nt to + 2250 nt). At right are box plots reporting gene body PRO-seq reads (TSS + 250 to +2250nt) at upregulated genes per indicated condition. P-values generated using the paired t-test. (D) Same as C, but for downregulated genes. (E) Box plot reporting the fold change in promoter (TSS to +100 nt) PRO-seq signal at up- and down-regulated genes in WT and SF3B1 mutant cells. P-values were calculated with the paired t-test. (F) Pausing index was calculated at upregulated and downregulated genes as the ratio of promoter to gene body PRO-seq read density. The cumulative distribution of pausing indices in SF3B1K700E mutant and WT cells is shown. P-values generated using the paired t-test. See also S13 Table.
Data processing of publicly available MDS and CLL patient data
Raw FASTQ files for MDS and CLL patients, K562 and NALM-6 cell lines, were downloaded from NCBI-GEO datasets (accession numbers GSE85712, GSE128805, GSE128429, GSE72790, GSE95011) (S1 Table). Data related to Fig 1 (hESC RNA-seq) and public datasets were aligned using STAR version 2.5.2b with the following parameters:: --outSAMtype BAM –SortedByCoordinate --outSAMunmapped Within --outSAMattributes Standard --quantMode GeneCounts [38]. Ensembl database (human release 99) and Human assembly hg38 (GRCh38) were used as the gene annotation and reference genome, respectively. In addition, TCGA-BRCA, TCGA-UVM, and AML from the BEATAML1.0-COHORT, including aligned BAM files and STAR 2-Pass Genome counts, were downloaded from the GDC data portal (https://portal.gdc.cancer.gov/).
Identification of differential splicing events
To estimate alternative splicing (AS) events, PSI-Sigma v1.9j was used to provide a comprehensive analysis of AS events [39]. We used the default parameters of PSI-sigma, which required at least 10 supporting reads to detect splicing events with Homo sapiens GRCh38.99 as a reference genome. For GDC aligned BAM files, SJ.out files were generated. To quantify changes in splicing patterns, percent-spliced-in (PSI) index and differential PSI (∆PSI) values were calculated to find all isoforms in a specific gene region. Splicing events with ∆PSI ≥ 10% and p-value < 0.05 were included in analyses. Additionally, Integrative Genomic Viewer (IGV_2.8.9) was used for visualizing aberrant splicing events in SF3B1 mutant and wildtype samples.
Gene set enrichment analysis (GSEA)
Gene ontology (GO) analysis was performed using GSEA (release v4.1.0) with gene set collection ‘c5.go.bp.v7.4.symbols.gmt’. The pre-ranked gene list was analyzed to identify biological processes positively or negatively associated with SF3B1 mutations. Gene sets with significant NES (NOM p-value < 0.05) were evaluated.
Co-expression and weighted gene co-expression network analysis (WGCNA)
Weighted gene co-expression network analysis (WGCNA) was used to construct a transcriptional network from RNA expression profiles. WGCNA (R package, ‘WGCNA’ version 1.70–3) identifies the key modules and module-trait relationship using topological overlap measure (TOM) and detects similarly expressed gene sets across MDS samples [40].
Precision run-on sequencing (PRO-seq)
The cell permeabilization was performed according to the protocols established by the Nascent Transcriptomics Core at Harvard Medical School, as described in Mimoso and Goldman [41]. Aliquots of frozen (−80°C) permeabilized cells were thawed on ice and pipetted gently to fully resuspend. Aliquots were removed and permeabilized cells were counted using a Luna FX7 (Logos Biosystems) instrument. For each sample, 1 million permeabilized cells were used for nuclear run-on with 50,000 permeabilized Drosophila S2 cells added for normalization. Nuclear run on assays and library preparation were performed as described in Mimoso and Goldman [41]. Pooled libraries were sequenced using the Illumina NovaSeq platform.
PRO-seq data analysis
All custom scripts described herein are available on the AdelmanLab GitHub (https://github.com/AdelmanLab/NIH_scripts). Dual, 6 nt Unique Molecular Identifiers (UMIs) were extracted from read pairs using UMI-tools [42]. Read pairs were trimmed using cutadapt 1.14 to remove adapter sequences (-O 1 --match-read-wildcards -m 26). The UMI length was trimmed off the end of both reads to prevent read-through into the mate’s UMI, which will happen for shorter fragments. An additional nucleotide was removed from the end of read 1 (R1), using seqtk trimfq (https://github.com/lh3/seqtk), to preserve a single mate orientation during alignment. The paired end reads were then mapped to a combined genome index, including both the spike (dm6) and primary (hg38) genomes, using bowtie2 [43]. Properly paired reads were retained. These read pairs were then separated based on the genome (i.e., spike-in vs primary) to which they mapped, and both these spike and primary reads were independently deduplicated, again using UMI-tools. Reads mapping to the reference genome were separated according to whether they were R1 or R2, sorted via samtools 1.3.1 (-n), and subsequently converted to bedGraph format using a custom script (bowtie2stdBedGraph.pl). We note that this script counts each read once at the exact 3’ end of the nascent RNA. Because R1 in PRO-seq reveals the position of the RNA 3’ end, the “+” and “-“ strands were swapped to generate bedGraphs representing 3’ end positions at single nucleotide resolution.
Annotated transcription start sites were obtained from human (GRCh38.99) GTFs from Ensembl. After removing transcripts with {immunoglobulin, Mt, Mt_tRNA, rRNA} biotypes, PRO-seq signal in each sample was calculated in the window from the annotated TSS to +150 nt downstream, using a custom script, make_heatmap.pl. This script counts each read one time, at the exact 3’ end location of the nascent RNA. Good agreement (spearman rho > 0.96) between replicates was observed.
BedGraphs were normalized using the normalize_bedGraph script. We observed a consistent increase in reads mapping to dm6 after introducing the SF3B1K700E mutation in hESCs (Average percentage of reads mapping to dm6: WT hESCs = 1.6%, SF3B1K700E hESCs = 2.1%), indicating a global decrease in transcription in SF3B1K700E mutant hESCs. As a result, the PRO-seq libraries were spike-in normalized. To generate the spike-in normalization factors per replicate, the number of reads mapping to dm6 for each sample was divided by the sample with the fewest number of reads mapping to dm6 (WT Replicate #1). bedgraphs2stdBedgraph was used to generate combined replicate bedGraphs by summing counts per nucleotide across replicates for each condition. BedGraphs were converted to the bigWig format for visualization.
An annotation of the single dominant transcription start site (TSS) and transcription end site (TES) per active gene in mESCs was obtained as described in Mimoso and Adelman 2023 with the following modifications: PRO-seq and RNA-seq data from WT and SF3B1K700E cells was used as input for the custom script GetGeneAnnotation (available on the Adelman Lab Github (get_gene_annotations.sh, https://github.com/AdelmanLab/GeneAnnotationScripts). The custom script make_heatmap was used to generate count matrices aligned to dominant TSSs [44]. Metagene plots were generated by summing reads within bins at each indicated position with respect to the TSS and dividing by the number of annotations. For all metagene plots, the bin size and number of investigated annotations are indicated in the figure legends. Box plots have a line at the median and whiskers depicting 10–90th percentile. GraphPad Prism was used to visualize metagene and box plots.
Ethics statement
The patients’ RNA-seq data presented in the current publication are based on the use of study data downloaded from the dbGaP website (S1 Table). This study does not meet the definition of human subjects research due to the fact that the data used was completely de-identified. Therefore, IRB approval was not required.
Results and discussion
The SF3B1K700E mutation results in specific RNA mis-splicing in hESCs
We established three independent heterozygous SF3B1K700E hESC lines using CRISPR/Cas9-mediated genome editing (Fig 1A), which were validated by Sanger sequencing (Fig 1B). To determine how the K700E mutation affected gene expression and splicing in hESCs we carried out RNA-seq on each of the three SF3B1K700E lines in triplicate, as compared to the wild type (WT) parental hESC line. These data confirmed that mutant and WT SF3B1 alleles were expressed at similar levels in the SF3B1K700E hESC lines. Consistent with previous studies, the predominant mis-splicing event in SF3B1K700E hESCs was the use of cryptic, alternative 3’ splice sites (Fig 1C and 1D; S2 Table). We observed mis-splicing of many genes, including MAP3K7, DLST, DVL2, TMEM14C, DYNLL1, SNRPN, and BRD9, that were reported in studies of mutant SF3B1 cancers/ cell lines (Figs 1E and 1F) [9,23,28,35]. Moreover, the mis-splicing events occurred at identical coordinates in SF3B1K700E hESCs and SF3B1 cancers/transformed cell lines (Fig 1E; S3 Table).
Immune genes are upregulated in SF3B1K700E hESCs
We next investigated the effect of the SF3B1 mutation on gene expression in hESCs. Samples clustered well by genotype, with lines harboring the SF3B1K700E mutation clearly separating from the WT hESCs (Figs 2A-B). As shown in Fig 2C, we identified 1,177 up- and 577 down-regulated genes in SF3B1K700E hESCs (fold change ≥ 1.5, p-value < 0.05, S4 Table). Interestingly, several of the upregulated genes in the mutant hESC lines act as master regulators of hematopoiesis, with key roles in hematopoietic stem cells (HSCs), including PITX2, ARID5B, and MAFB. Additionally, we identified upregulated genes that contribute to the immune response [45–48]. These include SERPINE genes, which are serine protease inhibitors, and GDF15, a well-known marker for ineffective erythropoiesis (Fig 2D) [49,50].
To gain additional insight into the dysregulated genes in SF3B1K700E hESC lines, we carried out a gene set enrichment analysis (GSEA). This revealed that gene sets related to chromatin organization and DNA replication were downregulated, indicative of downregulation in SF3B1K700E hESC cells (S5 Table). These results are consistent with previous findings in cells treated with an inhibitor of SF3B1, which showed defective DNA replication and chromosome integrity [51]. Gene sets upregulated in SF3B1K700E hESCs were primarily associated with immune and inflammatory responses (Fig 2E, S5 Table), supporting a broad upregulation of related genes. Across all three lines, 18% of the total upregulated genes play roles in the immune response. A list of immune genes that were significantly upregulated in the SF3B1K700E hESC lines is shown in S6 Table. These data suggest that the essential splicing factor SF3B1 has a specific role in ensuring proper expression of hematopoietic and immune genes [22,35,52–54].
Immune genes are downregulated in SF3B1MT MDS
Considering our observation that immune genes are upregulated in SF3B1K700E hESCs, we used GSEA to investigate gene expression in SF3B1MT MDS using published RNA-seq data (S1 Table) [21,35,55]. MDS patients who were WT for SF3B1 were used as controls. In striking contrast to hESCs bearing SF3B1K700E mutation, immune genes were typically downregulated in SF3B1MT MDS. In fact, a majority of the downregulated gene sets in SF3B1MT MDS samples are related to immunity (highlighted in green in Fig 3A; full list shown in S7 Table), suggesting broad downregulation of immune genes. Indeed, few immune gene sets were upregulated in SF3B1MT MDS (S7 Table). To further examine these unexpected findings, we applied weighted gene correlation network analysis (WGCNA) to identify gene networks correlated to SF3B1MT MDS gene expression [40]. Sixteen modules of densely interconnected genes were identified (Fig 3B; S2 Fig); the pathways within these modules are listed in S8 Table. High levels of similarity in the SF3B1MT MDS gene expression data sets are shown by the clustering of the 16 modules (Fig 3B) and by the eigengene adjacency heat map (Fig 3C). The M6_purple (r = 0.61, p-value = 3e-06) and M7_salmon (r = 0.31 and p-value = 0.03) modules, designated by the asterisks in Fig 3D, were negatively associated with immune pathways (Fig 3E,F). Among these are MHC class II pathways as well as pathways related to T and B cells (list of gene interactions see S9 Table). Together, these results support the conclusion that immune genes are downregulated in SF3B1MT MDS. We considered that this could be a direct effect of SF3B1 mutation, or an indirect effect of perturbed immune cell differentiation in SF3B1-mutated progenitors.
Immune gene sets are broadly downregulated in SF3B1MT cancers
We also carried out GSEA of SF3B1MT CLL and AML using published datasets with the corresponding WT splicing factor patient data as controls (S1 Table) [28,56]. As observed with SF3B1MT MDS, the strong negative enrichment of immune gene sets implies broad gene downregulation in AML or CLL-bearing mutant SF3B1 (Fig 4A; S10 Table). We conclude that immune gene sets/genes are consistently downregulated in multiple different SF3B1MT blood cancers.
As SF3B1 mutation is also associated with non-blood cancers, we next asked whether the strong negative enrichment of immune gene sets in MDS/CLL/AML was due to cell type. Accordingly, we carried out GSEA using two SF3B1 mutant non-blood cancers, breast cancer (BRCA) and uveal melanoma (UVM). Strikingly, we obtained the same results with these cancer cell types as with the blood cancers. The immune gene sets with significant enrichment scores in comparisons of SF3B1MT versus SF3B1WT BRCA and UVM exhibit negative associations, indicating downregulation of immune genes in SF3B1MT solid cancers (Figs 4B-D, S11 Table).
Together, our data indicate that, although the mis-splicing patterns are similar between SF3B1K700E ES cells and SF3B1MT cancers, there is a striking difference in expression of the immune gene sets in SF3B1MT ES cells versus cancer cells. Consistent with previous studies, we conclude that the function of the SF3B1K700E mutation in ES or other cancers will be affected by cell state, as well as genomic and transcriptomic background [57].
Downregulation of genes associated with leukocyte migration is common among SF3B1 cancers and U2AF1, SRSF2, and ZRSR2-mutated MDS
To identify common hallmarks between different damaging mutations in splicing factors in a variety of cancers and MDS, we examined whether any of the GSEA immune gene sets overlapped in the public datasets of blood and non-blood cancers associated with SF3B1 mutations. This analysis revealed 7 gene sets in common. Among them were lymphocyte pathways and leukocyte migration (Fig 5A). We then examined GSEA data for SRSF2MT and ZRSR2MT MDS. Mutations in SRSF2, which binds to exonic splicing enhancers, alter exon inclusion [58]. ZRSR2 is mainly associated with the minor spliceosome, and its mutation primarily results in intron retention [59]. Remarkably, the leukocyte migration gene set was also downregulated in these splicing factor-mutated cancers (Fig 5B). Moreover, this gene set was the only one in common among all the splicing factor cancers that we examined, including SF3B1MT MDS, AML, CLL, UVM and BRCA as well as MDS associated with mutations in SRSF2, ZRSR2 or U2AF1, a protein that interacts with U2AF2 to recognize the 3’ splice site [58–60] (Fig 5B).
Additionally, at the level of individual genes, we identified one gene, CCR1, that is downregulated in all 5 SF3B1MT cancers examined in this study. CCR1 is one of ~20 C-C motif chemokine receptors (CCRs), which are 7 membrane proteins that couple to G proteins for signal transduction [61]. CCRs are key regulators of migration and chemotaxis of multiple blood cell types, including thymocytes, granulocytes, myeloid cells, and additional types of leukocytes. We observed downregulation of CCR1 by as much as 256-fold in CLL to ~2–3 fold in other SF3B1MT cancers (S12 Table). Notably, CCR1 knockout mice show impaired hematopoiesis as well as defective immune and inflammatory responses [62]. The potential importance of CCR1 in splicing factor-mutant cancers is further suggested by the observation that this gene is downregulated in U2AF1MT MDS (Fig 5C). Moreover, in SRSF2 and ZRSR2 mutant MDS, CCR2 is downregulated, and this CCR may substitute for the downregulation of CCR1 as both receptors share a ligand (CCL7). In general, CCRs may play an important role in cancers associated with splicing factor mutations, since additional CCRs are downregulated in SF3B1 blood and non-blood cancers, as well as U2AF1 MDS, with downregulation ranging from 2 to 50-fold (Fig 5C; S12 Table).
The CCRs have multiple ligands which are usually C-C motif chemokine ligands (CCLs). We found that many of the CCLs are also downregulated in SF3B1MT cancers, and in other cancers associated with splicing factor mutations (Fig 5D; S12 Table). For example, the CCL ligands for CCR1 that are downregulated in SF3B1 MDS include CCL3, CCL4, CCL5, CCL7, CCL14 and CCL23 (Fig 5D). Critically, although SF3B1 is best known for its role in splicing, we did not detect mis-splicing of the CCR/CCLs (S2 Table), suggesting that SF3B1 mutations might also impact the process of transcription, 3’ end formation or RNA decay.
Splicing factor mutation increases transcriptional pause release in hESCs
Analyses of gene activity in SF3B1MT cells, whether in hESCs or cancer cells, have demonstrated that changes in gene expression are not merely the consequence of alterations in splicing (S2 and S4 Tables). This intriguing observation suggests a potential influence of the SF3B1K700E mutation on the process of transcription. To test this idea directly, we performed PRO-seq, which maps the position of engaged RNA Pol II at single nucleotide resolution [41,63] This involves the transcriptional incorporation of a biotin-NTP, which is used to halt transcription and stringently isolate nascent RNAs. To allow for absolute quantification of differences between WT and SF3B1K700E hESCs, exogenous RNAs were spiked into each sample.
We first investigated all genes differentially expressed in RNA-seq (as in Fig 2C), comparing the fold changes in RNA-seq vs. PRO-seq signal between SF3B1K700E and WT cells. This analysis revealed a strong positive correlation between the two datasets (Fig 6A, r = 0.66), indicating that the observed changes in steady state RNA levels largely reflect altered transcription. This result is consistent with the sizeable changes in gene expression observed in SF3B1K700E mutant cells, despite only modest splicing defects, and is in agreement with previous work showing that the U2 snRNP can influence transcription [64,65]. To probe this result further, we evaluated PRO-seq signal at several example genes that were differentially expressed genes in RNA-seq. At upregulated gene FDXR, we observed elevated signal in both RNA-seq and PRO-seq (Fig 6B, left), and at downregulated gene MAP3K7, we observed reduced signal in both assays (Fig 6B, right). Indeed, investigation of all upregulated genes showed a significant increase in gene body PRO-seq signal (Fig 6C), in agreement with the RNA-seq. Similarly, we observe a significant decrease in gene body PRO-seq signal at genes downregulated in RNA-seq (Fig 6D). This analysis confirms that the SF3B1K700E mutation elicits changes in transcription that contribute to altered steady state RNA levels.
We next wished to determine the mechanism by which genes are differentially transcribed in SF3B1K700E mutant cells. Interestingly, we observed a significant decrease in the peak of promoter RNA Pol II occupancy at both up- and down-regulated genes in SF3B1K700E mutant cells compared to WT (Fig 6E). Notably, this result is consistent with previous work evaluating changes in RNA Pol II elongation properties in SF3B1K700E knock-in K562 cells [66], which also reported broadly lower RNA Pol II density near promoters. Thus, we wished to further probe the nature of this phenomenon. We considered that a reduction in promoter RNA Pol II levels might signify lower levels of transcription initiation in cells with the SF3B1K700E mutation, as suggested previously [66]. However, lower transcription initiation in SF3B1K700E cells would be expected to reduce gene expression and thus could not explain why most genes have unchanged or upregulated expression in SF3B1K700E hESCs. We thus investigated whether the broadly lower levels of promoter-associated RNA Pol II suggests that the SF3B1K700E mutation stimulates the release of paused RNA Pol II into productive elongation. To address this possibility, we calculated the pausing index, which is the ratio of promoter-proximal to gene body PRO-seq signal. Graphing the cumulative distribution of pausing indices in WT and SF3B1K700E mutant cells revealed a significant reduction of pausing in cells with the SF3B1K700E mutation (Fig 6F), at both upregulated (left) and downregulated genes (right). These results demonstrate that SF3B1K700E mutant cells have more efficient pause release globally, which could explain the increased expression of upregulated genes. We envision that the hematopoietic and immune-related genes upregulated in SF3B1K700E hESCs are normally rate limited at the level of pause release, such that a stimulation of pause release associated with the SF3B1K700E mutation is sufficient to increase gene activity. In contrast, the downregulated genes appear to have lower rates of transcription initiation, such that the RNA Pol II released from promoters is not effectively replaced by newly initiated RNA Pol II. In support of this idea, recent work in leukemia cells demonstrated that precocious pause release caused by SF3B1 mutations could lead to reduced chromatin accessibility and epigenetic marks of transcription activity [66].
Conclusions
The sophisticated interplay between transcription and splicing factors means that alterations in assembly of the spliceosome can have profound effects on the transcription process. Utilizing CRISPR-Cas9 technology, we developed a human embryonic stem cell (hESC) model with the SF3B1K700E mutation and assessed differential splicing, gene expression and nascent RNA synthesis. Our studies of SF3B1 function in a non-cancer cell line provides further insights into how this mutation alters the dynamics of RNA Pol II and mature mRNA output. We show that SF3B1 mutations directly impact transcription, increasing pause release. We note that a recent study probing changes in RNA Pol II elongation after short-term inhibition of U2 snRNP with chemical inhibitors suggested that U2 snRNP stabilizes pausing, however our results and those from SF3B1 mutant leukemia cell lines indicate that pause release is more efficient in cells with SF3B1 mutations. Indeed, recent work has demonstrated that chemical inhibitors and splicing factor mutations elicit different effects on gene expression. Our analysis of patients’ bulk RNA-seq data reflects a mixture of signals from different cell types, and future studies will be needed to assess gene expression changes within distinct cell populations from individuals with SF3B1 mutations. Taken together with studies in cancer systems, our results support that formation of the pre-spliceosome including the U2 snRNP stimulates transcription elongation. We conclude that mutations in splicing factors have effects on RNA formation that go well beyond altered splicing profiles.
We suggest that more efficient pause release is a general feature of transcription in SF3B1K700E mutant cells, yet that the effect of the SF3B1K700E mutation on RNA levels can vary from cell type to cell type, dependent on whether transcription initiation or pause release is limiting at that gene. Certainly, the chromatin contexts of many genes differ between MDS compared to hESCs, which would undoubtedly influence the consequences of increased pause release. Nonetheless, we speculate that the similar mis-splicing patterns of SF3B1K700E mutation in hESCs and cancers cells reflect global roles of SF3B1 in the assembly and function of the pre-spliceosome.
Further investigation will be necessary to develop targeted strategies for inhibiting or ameliorating splicing-disrupted diseases such as MDS and AML. Among the different gene programs that could be exploited by cancer cells, mutation of splicing factors such as SF3B1 could increase the transcriptional output not only by affecting spicing itself but also by tuning RNA Pol II dynamics. Our results in hESC as an undifferentiated model showed that this specific mutation does not change the expression of pluripotency and stemness genes but drives a transcriptional signature related to hematopoiesis and immune responses. This finding supports the role of SF3B1K700E in regulating immune signatures across multiple tumors and demonstrates that this could occur via a common mechanism impacting pausing by RNA Pol II. Future studies will demonstrate whether tumor-specific patterns of transcriptional dysregulation are controlled by this mechanism. While mutational landscapes in tumor or MDS samples may complicate the interpretation of the impact of SF3B1 mutation on tumorigenesis, our hESC model could be useful to uncover the mechanisms underlying gene dysregulation in splicing factor-mutant cancers. Taken together, our study provides insights regarding the most common SF3B1 mutation that might be applicable for future targeted therapeutics.
Supporting information
S1 Fig. Gene set enrichment analysis (GSEA) comparing _SF3B1_mutant versus WT ES cells.
(A) Bubble plot summarizing the top enriched biological pathways among differentially expressed genes. The x-axis shows the normalized enrichment score (NES), with positive values indicating upregulation in SF3B1_mutants and negative values indicating downregulation. Bubble size reflects gene set size, and bubble color encodes the adjusted p-value (padj), only pathways with padj < 0.1 and |NES| > 1 are shown. (B) Representative enrichment plots for two leading pathways. Left, protein–DNA complex assembly (NES = −1.45; padj = 0.068) is negatively enriched in SF3B1 mutants. Right, regulation of inflammatory response (NES = 1.35; padj = 0.09) is positively enriched. For each plot, the running enrichment score (top, red) is displayed across the ranked gene list (middle barcode plot; red = up in SF3B1_mutant, blue = down), with the rank metric distribution shown below. (C) Bar plot of selected hallmark, GO, and Reactome pathways ranked by NES. Red bars show pathways upregulated in SF3B1_mutants; blue bars denote downregulated pathways. The x-axis shows NES.
https://doi.org/10.1371/journal.pone.0334361.s001
(TIF)
S2 Fig. Gene co-expression network analysis using WGCNA in SF3B1MU MDS patients.
(A) Determination of soft-thresholding power in WGCNA. Scale-free topology index and mean connectivity for each power are shown. In this study, the threshold was reached for a power of 3. (B) Dendrogram of differentially expressed genes clustered based on the difference metrics (1-TOM). (C) Sample dendrogram and trait heatmap based on expression data from 3 MDS datasets.
https://doi.org/10.1371/journal.pone.0334361.s002
(TIF)
S1 Table. SF3B1MT and SF3B1WT patient information in GEO, TCGA and BeatAML cohorts.
https://doi.org/10.1371/journal.pone.0334361.s003
(XLSX)
S2 Table. Mis-splicing events in RNA-seq from 3 biological replicates of SF3B1K700E compared to SF3B1WT ES cells.
https://doi.org/10.1371/journal.pone.0334361.s004
(XLSX)
S3 Table. ∆PSI% and p-values for shared mis-splicing events in SF3B1K700E ES cells and SF3B1MT cancers/cell lines.
https://doi.org/10.1371/journal.pone.0334361.s005
(XLSX)
S4 Table. Genes that are differentially expressed in SF3B1K700E compared to SF3B1WT ES cells.
https://doi.org/10.1371/journal.pone.0334361.s006
(XLSX)
S5 Table. GSEA of dysregulated genes in SF3B1K700E ES cells.
https://doi.org/10.1371/journal.pone.0334361.s007
(XLSX)
S6 Table. Upregulated immune genes in SF3B1K700E ES cells.
https://doi.org/10.1371/journal.pone.0334361.s008
(XLSX)
S7 Table. GSEA analysis in SF3B1MT and SF3B1WT MDS patients.
https://doi.org/10.1371/journal.pone.0334361.s009
(XLSX)
S8 Table. Enriched pathways in module eigengenes, related to Fig 3.
https://doi.org/10.1371/journal.pone.0334361.s010
(XLSX)
S9 Table. Gene interactions in the M6_purple module and M7_salmon module.
https://doi.org/10.1371/journal.pone.0334361.s011
(XLSX)
S10 Table. GSEA analysis in SF3B1MT and SF3B1WT AML and CLL patients.
https://doi.org/10.1371/journal.pone.0334361.s012
(XLSX)
S11 Table. GSEA analyses of RNA-seq data in SF3B1MT versus SF3B1WT BRCA and UVM patients.
https://doi.org/10.1371/journal.pone.0334361.s013
(XLSX)
S12 Table. Downregulated Chemokine Receptors (CCRs) and their ligands (CCLs) in splicing factor mutant cancers.
https://doi.org/10.1371/journal.pone.0334361.s014
(XLSX)
S13 Table. RNA Pol II transcription levels measured using PRO-seq.
https://doi.org/10.1371/journal.pone.0334361.s015
(XLSX)
Acknowledgments
We thank Dr. Thorsten M. Schlaeger for technical assistance with ES cell culture and characterization. We are grateful to Drs. John N. Hutchinson, Kuan-Ting Lin, and Research Computing Consultant groups at HMS for assistance in RNA-seq analysis. We thank WiCell Research Institute (Madison, WI) for H9 ES cells (WA09). The authors would like to thank the Nascent Transcriptomics Core at Harvard Medical School, Boston, MA for generating PRO-seq libraries. We thank Dr. Danesh Moazed for his critical comments on the manuscript.
References
- 1. Anczuków O, Krainer AR. Splicing-factor alterations in cancers. RNA. 2016;22(9):1285–301. pmid:27530828
- 2. Zhou Z, Gong Q, Wang Y, Li M, Wang L, Ding H, et al. The biological function and clinical significance of SF3B1 mutations in cancer. Biomark Res. 2020;8:38. pmid:32905346
- 3. Bradley RK, Anczuków O. RNA splicing dysregulation and the hallmarks of cancer. Nat Rev Cancer. 2023;23(3):135–55. pmid:36627445
- 4. Harbour JW, Roberson EDO, Anbunathan H, Onken MD, Worley LA, Bowcock AM. Recurrent mutations at codon 625 of the splicing factor SF3B1 in uveal melanoma. Nat Genet. 2013;45(2):133–5. pmid:23313955
- 5. Jenkins JL, Kielkopf CL. Splicing factor mutations in myelodysplasias: insights from spliceosome structures. Trends Genet. 2017;33(5):336–48. pmid:28372848
- 6. Palomo L, Solé F. SF3B1: the lord of the rings in MDS. Blood. 2020;136(2):149–51.
- 7. Pellagatti A, Armstrong RN, Steeples V, Sharma E, Repapi E, Singh S, et al. Impact of spliceosome mutations on RNA splicing in myelodysplasia: dysregulated genes/pathways and clinical associations. Blood. 2018;132(12):1225–40. pmid:29930011
- 8.
Wan Y, Wu CJ. SF3B1 mutations in chronic lymphocytic leukemia. 2013;121(23):8.
- 9. Wang L, Brooks AN, Fan J, Wan Y, Gambe R, Li S, et al. Transcriptomic Characterization of SF3B1 Mutation Reveals Its Pleiotropic Effects in Chronic Lymphocytic Leukemia. Cancer Cell. 2016;30(5):750–63. pmid:27818134
- 10. Gozani O, Potashkin J, Reed R. A potential role for U2AF-SAP 155 interactions in recruiting U2 snRNP to the branch site. Mol Cell Biol. 1998;18(8):4752–60. pmid:9671485
- 11. Bennett M, Reed R. Correspondence between a mammalian spliceosome component and an essential yeast splicing factor. Science. 1993;262(5130):105–8. pmid:8211113
- 12. Champion-Arnaud P, Reed R. The prespliceosome components SAP 49 and SAP 145 interact in a complex implicated in tethering U2 snRNP to the branch site. Genes Dev. 1994;8(16):1974–83.
- 13. Das R, Reed R. Resolution of the mammalian E complex and the ATP-dependent spliceosomal complexes on native agarose mini-gels. RNA. 1999;5(11):1504–8. pmid:10580479
- 14. Das R, Zhou Z, Reed R. Functional association of U2 snRNP with the ATP-independent spliceosomal complex E. Mol Cell. 2000;5(5):779–87. pmid:10882114
- 15. Gozani O, Feld R, Reed R. Evidence that sequence-independent binding of highly conserved U2 snRNP proteins upstream of the branch site is required for assembly of spliceosomal complex A. Genes Dev. 1996;10(2):233–43. pmid:8566756
- 16. Wang C, Chua K, Seghezzi W, Lees E, Gozani O, Reed R. Phosphorylation of spliceosomal protein SAP 155 coupled with splicing catalysis. Genes Dev. 1998;12(10):1409–14. pmid:9585501
- 17. Reed R. Mechanisms of fidelity in pre-mRNA splicing. Curr Opin Cell Biol. 2000;12(3):340–5. pmid:10801464
- 18. Will CL, Luhrmann R. Spliceosome Structure and Function. Cold Spring Harbor Perspectives in Biology. 2011;3(7):a003707.
- 19. Cretu C, Schmitzová J, Ponce-Salvatierra A, Dybkov O, De Laurentiis EI, Sharma K, et al. Molecular Architecture of SF3b and structural consequences of its cancer-related mutations. Mol Cell. 2016;64(2):307–19. pmid:27720643
- 20. Finci LI, Zhang X, Huang X, Zhou Q, Tsai J, Teng T, et al. The cryo-EM structure of the SF3b spliceosome complex bound to a splicing modulator reveals a pre-mRNA substrate competitive mechanism of action. Genes Dev. 2018;32(3–4):309–20.
- 21. Madan V, Li J, Zhou S, Teoh WW, Han L, Meggendorfer M, et al. Distinct and convergent consequences of splice factor mutations in myelodysplastic syndromes. Am J Hematol. 2020;95(2):133–43. pmid:31680297
- 22. Pollyea DA, Harris C, Rabe JL, Hedin BR, De Arras L, Katz S, et al. Myelodysplastic syndrome-associated spliceosome gene mutations enhance innate immune signaling. Haematologica. 2019;104(9):e388–92. pmid:30846499
- 23. Lee SC-W, North K, Kim E, Jang E, Obeng E, Lu SX, et al. Synthetic lethal and convergent biological effects of cancer-associated spliceosomal gene mutations. Cancer Cell. 2018;34(2):225-241.e8. pmid:30107174
- 24. Sallman DA, List A. The role of innate immunity in MDS pathogenesis. Hemasphere. 2019;3(Suppl):135–7. pmid:35309825
- 25. Lieu YK, Liu Z, Ali AM, Wei X, Penson A, Zhang J. SF3B1 mutant-induced missplicing of MAP3K7 causes anemia in myelodysplastic syndromes. Proc Natl Acad Sci USA. 2022;119(1):e2111703119.
- 26. Alsafadi S, Houy A, Battistella A, Popova T, Wassef M, Henry E, et al. Cancer-associated SF3B1 mutations affect alternative splicing by promoting alternative branchpoint usage. Nat Commun. 2016;7:10615. pmid:26842708
- 27. Alsafadi S, Dayot S, Tarin M, Houy A, Bellanger D, Cornella M. Genetic alterations of SUGP1 mimic mutant-SF3B1 splice pattern in lung adenocarcinoma and other cancers. Oncogene. 2021;40(1):85–96.
- 28. Darman RB, Seiler M, Agrawal AA, Lim KH, Peng S, Aird D, et al. Cancer-associated SF3B1 Hotspot Mutations Induce Cryptic 3’ splice site selection through use of a different branch point. Cell Rep. 2015;13(5):1033–45. pmid:26565915
- 29. Ten Hacken E, Valentin R, Regis FFD, Sun J, Yin S, Werner L, et al. Splicing modulation sensitizes chronic lymphocytic leukemia cells to venetoclax by remodeling mitochondrial apoptotic dependencies. JCI Insight. 2018;3(19):e121438. pmid:30282833
- 30. Mupo A, Seiler M, Sathiaseelan V, Pance A, Yang Y, Agrawal AA, et al. Hemopoietic-specific Sf3b1-K700E knock-in mice display the splicing defect seen in human MDS but develop anemia without ring sideroblasts. Leukemia. 2017;31(3):720–7. pmid:27604819
- 31. Paolella BR, Gibson WJ, Urbanski LM, Alberta JA, Zack TI, Bandopadhayay P, et al. Copy-number and gene dependency analysis reveals partial copy loss of wild-type SF3B1 as a novel cancer vulnerability. eLife. 2017.
- 32. Saez B, Walter MJ, Graubert TA. Splicing factor gene mutations in hematologic malignancies. Blood. 2017;129(10):1260–9. pmid:27940478
- 33. Yin S, Gambe RG, Sun J, Martinez AZ, Cartun ZJ, Regis FFD. A murine model of chronic lymphocytic leukemia based on B cell-restricted expression of Sf3b1 mutation and Atm deletion. Cancer Cell. 2019;35(2):283-296.e5.
- 34. Kanagal-Shamanna R, Montalban-Bravo G, Sasaki K, Darbaniyan F, Jabbour E, Bueso-Ramos C, et al. Only SF3B1 mutation involving K700E independently predicts overall survival in myelodysplastic syndromes. Cancer. 2021;127(19):3552–65. pmid:34161603
- 35. Obeng EA, Chappell RJ, Seiler M, Chen MC, Campagna DR, Schmidt PJ, et al. Physiologic Expression of Sf3b1(K700E) Causes impaired erythropoiesis, aberrant splicing, and sensitivity to therapeutic spliceosome modulation. Cancer Cell. 2016;30(3):404–17. pmid:27622333
- 36. Simmler P, Ioannidi EI, Mengis T, Marquart KF, Asawa S, Van-Lehmann K. Mutant SF3B1 promotes malignancy in PDAC. eLife. 2023;12:e80683.
- 37. Chen G, Gulbranson DR, Hou Z, Bolin JM, Ruotti V, Probasco MD, et al. Chemically defined conditions for human iPSC derivation and culture. Nat Methods. 2011;8(5):424–9. pmid:21478862
- 38. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. pmid:23104886
- 39. Lin KT, Krainer AR. PSI-Sigma: a comprehensive splicing-detection method for short-read and long-read RNA-seq analysis. Bioinformatics. 2019;35(23):5048–54.
- 40. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. pmid:19114008
- 41. Mimoso CA, Goldman SR. PRO-seq: Precise Mapping of Engaged RNA Pol II at single-nucleotide resolution. Curr Protoc. 2023;3(12):e961. pmid:38149731
- 42. Smith T, Heger A, Sudbery I. UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy. Genome Res. 2017;27(3):491–9. pmid:28100584
- 43. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357–9. pmid:22388286
- 44. Martin B, Adelman K, Nelson G. AdelmanLab/NIH_scripts: AdelmanLab/NIH_scripts/v1 (Version v1). Zenodo. 2021.
- 45. Brunet de la Grange P, Armstrong F, Duval V, Rouyez M-C, Goardon N, Romeo P-H, et al. Low SCL/TAL1 expression reveals its major role in adult hematopoietic myeloid progenitors and stem cells. Blood. 2006;108(9):2998–3004. pmid:16849639
- 46. Parente R, Clark SJ, Inforzato A, Day AJ. Complement factor H in host defense and immune evasion. Cell Mol Life Sci. 2017;74(9):1605–24. pmid:27942748
- 47. Vo KK, Jarocha DJ, Lyde RB, Hayes V, Thom CS, Sullivan SK, et al. FLI1 level during megakaryopoiesis affects thrombopoiesis and platelet biology. Blood. 2017;129(26):3486–94. pmid:28432223
- 48. Zhang HZ, Rogulina S, Chen W, Degar BA, Forget BG. Effects of the Homeodomain Gene Pitx2 on Self-Renewal of Hematopoietic Stem Cells. Blood. 2007;110(11):95–95.
- 49. Gettins PGW. Serpin structure, mechanism, and function. Chem Rev. 2002;102(12):4751–804. pmid:12475206
- 50. Ranjbaran R, Abbasi M, Rahimian E, Dehbidi GR, Seyyedi N, Zare F, et al. GDF-15 negatively regulates excess erythropoiesis and its overexpression is involved in erythroid hyperplasia. Exp Cell Res. 2020;397(2):112346. pmid:33164866
- 51. Wheeler EC, Martin BJE, Doyle WC, Neaher S, Conway CA, Pitton CN, et al. Splicing modulators impair DNA damage response and induce killing of cohesin-mutant MDS and AML. Sci Transl Med. 2024;16(728):eade2774. pmid:38170787
- 52. De La Garza A, Cameron RC, Nik S, Payne SG, Bowman TV. Spliceosomal component Sf3b1 is essential for hematopoietic differentiation in zebrafish. Experimental Hematology. 2016 Sep;44(9):826–37.e4.
- 53. Matsunawa M, Yamamoto R, Sanada M, Sato A, Shiozawa Y, Yoshida K. Role of Sf3b1 on hematopoiesis. Blood. 2013;122(21):600.
- 54. Wang C, Sashida G, Saraya A, Ishiga R, Koide S, Oshima M, et al. Depletion of Sf3b1 impairs proliferative capacity of hematopoietic stem cells but is not sufficient to induce myelodysplasia. Blood. 2014;123(21):3336–43. pmid:24735968
- 55. Zhang J, Ali AM, Lieu YK, Liu Z, Gao J, Rabadan R, et al. Disease-Causing Mutations in SF3B1 Alter Splicing by Disrupting Interaction with SUGP1. Mol Cell. 2019;76(1):82-95.e7. pmid:31474574
- 56. Tyner JW, Tognon CE, Bottomly D, Wilmot B, Kurtz SE, Savage SL, et al. Functional genomic landscape of acute myeloid leukaemia. Nature. 2018;562(7728):526–31. pmid:30333627
- 57. Seiler M, Peng S, Agrawal AA, Palacino J, Teng T, Zhu P, et al. Somatic Mutational Landscape of Splicing Factor Genes and Their Functional Consequences across 33 Cancer Types. Cell Rep. 2018;23(1):282-296.e4. pmid:29617667
- 58. Kim E, Ilagan JO, Liang Y, Daubner GM, Lee SC-W, Ramakrishnan A, et al. SRSF2 Mutations Contribute to Myelodysplasia by Mutant-Specific Effects on Exon Recognition. Cancer Cell. 2015;27(5):617–30. pmid:25965569
- 59. Bonnal SC, López-Oreja I, Valcárcel J. Roles and mechanisms of alternative splicing in cancer - implications for care. Nat Rev Clin Oncol. 2020;17(8):457–74. pmid:32303702
- 60. Ilagan JO, Ramakrishnan A, Hayes B, Murphy ME, Zebari AS, Bradley P, et al. U2AF1 mutations alter splice site recognition in hematological malignancies. Genome Res. 2015;25(1):14–26. pmid:25267526
- 61. Allen SJ, Crown SE, Handel TM. Chemokine:receptor structure, interactions, and antagonism. Annu Rev Immunol. 2007;25(1):787–820.
- 62. Gao JL, Wynn TA, Chang Y, Lee EJ, Broxmeyer HE, Cooper S, et al. Impaired host defense, hematopoiesis, granulomatous inflammation and type 1-type 2 cytokine balance in mice lacking CC chemokine receptor 1. J Exp Med. 1997;185(11):1959–68. pmid:9166425
- 63. Kwak H, Fuda NJ, Core LJ, Lis JT. Precise maps of RNA polymerase reveal how promoters direct initiation and pausing. Science. 2013;339(6122):950–3.
- 64. Caizzi L, Monteiro-Martins S, Schwalb B, Lysakovskaia K, Schmitzova J, Sawicka A, et al. Efficient RNA polymerase II pause release requires U2 snRNP function. Mol Cell. 2021;81(9):1920-1934.e9. pmid:33689748
- 65. Castillo-Guzman D, Hartono SR, Frederick M, Sanz LA, Sereiva T, Chédin F. SF3B1-targeted Splicing Inhibition Triggers Transcriptional Stress Response and Global Alterations in R-Loop Landscape. Cold Spring Harbor Laboratory; 2020.
- 66. Boddu PC, Gupta AK, Roy R, De La Peña Avalos B, Olazabal-Herrero A, Neuenkirchen N, et al. Transcription elongation defects link oncogenic SF3B1 mutations to targetable alterations in chromatin landscape. Mol Cell. 2024;84(8):1475-1495.e18. pmid:38521065