Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Islands of genomic stability in the face of genetically unstable metastatic cancer

  • Kirsten Bowland,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, United States of America

  • Jiaying Lai,

    Roles Formal analysis, Investigation, Software, Writing – review & editing

    Affiliation Institute for Computational Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America

  • Alyza Skaist,

    Roles Formal analysis, Software, Writing – review & editing

    Affiliation Department of Oncology, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD, United States of America

  • Yan Zhang,

    Roles Software, Writing – review & editing

    Affiliation Department of Oncology, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD, United States of America

  • Selina Shiqing K. Teh,

    Roles Conceptualization, Writing – review & editing

    Affiliation Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, United States of America

  • Nicholas J. Roberts,

    Roles Supervision, Writing – review & editing

    Affiliations Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, United States of America, Department of Oncology, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD, United States of America

  • Elizabeth Thompson,

    Roles Formal analysis, Supervision, Validation, Writing – review & editing

    Affiliation Department of Oncology, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD, United States of America

  • Sarah J. Wheelan,

    Roles Formal analysis, Software, Writing – review & editing

    Affiliation National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, United States of America

  • Ralph H. Hruban,

    Roles Conceptualization, Supervision, Writing – review & editing

    Affiliations Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, United States of America, Department of Oncology, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD, United States of America

  • Rachel Karchin,

    Roles Formal analysis, Resources, Supervision, Writing – review & editing

    Affiliations Institute for Computational Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America, Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America

  • Matthew H. Bailey,

    Roles Conceptualization, Formal analysis, Methodology, Writing – review & editing

    Affiliation Department of Biology, Brigham Young University, Provo, UT, United States of America

  • Christine A. Iacobuzio-Donahue,

    Roles Conceptualization, Data curation, Investigation, Methodology, Resources, Supervision, Writing – review & editing

    Affiliations Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, United States of America, Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY, United States of America

  • James R. Eshleman

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing – review & editing

    jeshlem@jhmi.edu

    Affiliations Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, United States of America, Department of Oncology, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD, United States of America

Abstract

Introduction

Metastatic cancer affects millions of people worldwide annually and is the leading cause of cancer-related deaths. Most patients with metastatic disease are not eligible for surgical resection, and current therapeutic regimens have varying success rates, some with 5-year survival rates below 5%. Here, we test the hypothesis that metastatic cancer can be genetically targeted by exploiting single base substitution mutations unique to individual cells that occur as part of normal aging prior to transformation. These mutations are targetable because ~10% of them form novel tumor-specific “NGG” protospacer adjacent motif (PAM) sites targetable by CRISPR-Cas9.

Methods

Whole genome sequencing was performed on five rapid autopsy cases of patient-matched primary tumor, normal and metastatic tissue from pancreatic ductal adenocarcinoma decedents. CRISPR-Cas9 PAM targets were determined by bioinformatic tumor-normal subtraction for each patient and verified in metastatic samples by high-depth capture-based sequencing.

Results

We found that 90% of PAM targets were maintained between primary carcinomas and metastases overall. We identified rules that predict PAM loss or retention, where PAMs located in heterozygous regions in the primary tumor can be lost in metastases (private LOH), but PAMs occurring in regions of loss of heterozygosity (LOH) in the primary tumor were universally conserved in metastases.

Conclusions

Regions of truncal LOH are strongly retained in the presence of genetic instability and, therefore, represent genetic vulnerabilities in pancreatic adenocarcinomas. A CRISPR-based gene therapy approach targeting these regions may be a novel way to genetically target metastatic cancer.

Introduction

Adult stem cells accumulate mutations during the normal aging process [1, 2]. Most of these are passenger mutations found in intergenic regions with minimal, if any, impact on cell fitness. Recent studies showed that the rate of accumulation varies based on tissue type but averages approximately 40 mutations per year per cell [3, 4]. Over time, this results in a unique set of mutations in each adult stem cell that is clonally propagated to daughter cells during cell division [5, 6]. In the event of oncogenic transformation, it follows that the mutational burden in the cancer-initiating cell (CIC) should be propagated not only to the growing tumor but also to metastases that arise from that cancer. These mutations should not be present in the non-neoplastic cells in the same individual. Conversely, since cancer is known to be genetically unstable, it is also possible that cancer-specific mutations that exist in the primary tumor might be lost in some or all metastases.

CRISPR-Cas9 induces site-specific DNA double-strand breaks (DSBs) which the cell must repair through endogenous DSB repair pathways [7]. As engineered for eukaryotic systems, the endonuclease component, Cas9, is guided to specific DNA target sequences by a single guide RNA (sgRNA) [8, 9]. Cleaving the target DNA by Cas9 depends on the recognition of a protospacer adjacent motif (PAM) directly downstream of the gRNA target. With SpCas9, the most used CRISPR endonuclease, the PAM recognition sequence is any base followed by two guanine bases, abbreviated “NGG” [10].

Recently, CRISPR-Cas systems have been used for cancer gene therapy. Strategies have included insertion of suicide genes [11], targeting mutations in oncogenic driver genes that contain PAM sites [12, 13], unique sequences in oncoviruses such as E6 and E7 [14], and fusion oncogenes [15, 16]. We recently reported that somatic single base substitutions (SBS) forming novel PAM sites specific to cancer cells are targetable by CRISPR-Cas9. We discovered that approximately 10% of somatic SBS mutations in cancer cells (that are not present in patient-matched normal cells) produce novel PAM sites. We further demonstrated selective cell killing of tumor cells by exploiting these novel PAMs as targets for CRISPR-Cas9 induced DNA DSBs, finding that targeting 8–12 PAMs is sufficient to reduce pancreatic ductal adenocarcinoma (PDAC) cell populations by about 95%. We showed that patient-matched normal cells were unaffected [17].

Rapid autopsy programs, where tumor tissues are harvested from patients within hours of their death, provide an invaluable tool for studying cancer heterogeneity and tumor evolution. In this study, we used samples from the Gastrointestinal Cancer Rapid Medical Donation Program [18] (PI Dr. Christine A. Iacobuzio-Donahue), many of which were established as xenograft tumors or cell lines by our group [19]. We sought to identify regions of genomic vulnerability as a precision gene therapy approach. We explored the proportion of novel PAMs present in primary pancreatic tumors that were maintained throughout metastases and the mechanism of loss when PAMs were not maintained. Using deep sequencing of patient-matched primary carcinomas and metastases, we examined the evolutionary conservation of PAMs by focusing on the earliest (truncal) mutations. We find that in 5 PDAC cases, an average of ~90% of truncal PAMs from the primary tumor are maintained throughout each patient’s metastatic lesions and that the primary mechanism when PAMs are lost is loss of heterozygosity (LOH) in metastases. Critically, we find that PAMs occurring in regions of LOH in the primary tumor adjacent to tumor suppressor and oncogenes are 100% maintained in metastases. Counterintuitively, these copy number alterations that contributed to oncogenesis may represent a genetic vulnerability to cancer as essential genes in these conserved regions have been reduced to one copy (with exception of copy-neutral LOH). Building on our previous work, this finding further supports CRISPR-Cas9 as a potential gene therapy for metastatic cancer.

Results

Next-generation sequencing reveals truncal tumor-specific PAMs in pancreatic cancer

To determine whether CRISPR targeting of tumor-specific somatic PAMs could be used as a potential gene therapy agent against metastatic disease, it is important to understand what proportion of the targets identified in the primary tumor are maintained in each metastasis. The clonal nature of cancer establishes a theoretical foundation for propagation and maintenance of mutations from the CIC. However, cancer is inherently genomically unstable, and such instability might lead to target deletion thereby impacting the efficacy of therapies targeting these mutations.

First, we determined the truncal PAMs for each case, defined here as those PAMs that descended from the original CIC. An experimental and analytical workflow is provided (Fig 1A and 1B). We performed whole genome sequencing (WGS) on metastatic cell lines derived from five rapid autopsy cases and case-matched normal samples (Table 1). In two of the cases, patients A38 and A32, multiple metastatic cell lines were available for initial whole genome sequencing (WGS) (Fig 1C and 1D). In these cases, the union of all PAMs from any of the cell lines sequenced was included for downstream investigation.

thumbnail
Fig 1. PAM discovery and experimental design.

A) Example of novel PAM, where the germline sequence is NTG, and the mutation of T > G forms a novel NGG PAM. B) Experimental and analytical workflow. C) All PAMs discovered by WGS in the four A38 cell lines and their overlap, D) also shown for the two cell lines in case A32. E) The GC content for sgRNA targeting PAMs in all five cases and F) the genomic context of PAMs in all five cases.

https://doi.org/10.1371/journal.pone.0298490.g001

thumbnail
Table 1. Sample summary for whole genome sequencing PAM discovery and PAM validation in metastases.

https://doi.org/10.1371/journal.pone.0298490.t001

Tumor-specific PAMs were bioinformatically determined using PAMfinder (GitHub) after subtracting matched tumor and normal variant call files [17]. An average of 1236 PAMs (range 379 to 2195) were discovered from each tumor-normal subtraction. PAMs were further analyzed by variant allele fraction (VAF) of 95% or higher to identify a subset of targets in regions of LOH. An average of 217 PAMs (range 18 to 448) were discovered for each subtraction with a VAF ≥ 95% (Table 2). As high-quality WGS of primary tumor tissue was unavailable due to formalin fixation, the PAMs captured in any one of the primary samples for each case were considered truncal for this study.

thumbnail
Table 2. Tumor minus normal subtractions and the total somatic variants, PAMs, and high VAF PAMs discovered from each subtraction.

https://doi.org/10.1371/journal.pone.0298490.t002

As the ultimate purpose for tumor-specific PAM discovery is for CRISPR-Cas9 targeting, sgRNA for each PAM site were identified by PAMfinder and analyzed for efficient targeting and genomic context. Optimal sgRNA design includes GC content between 40–80%, and efficiency is best when targeting regions of open chromatin [20]. For the five autopsy cases studied, 58% (range 44% to 66%) of sgRNA possessed a GC balance within this optimal window. An average of 41.8% were AT rich and suboptimal for CRISPR targeting (Fig 1E). Additionally, the efficiency of CRISPR editing in vivo has been shown to be impacted by the genomic context of the targeted PAM, with open regions of chromatin exhibiting the highest rates [21]. Aggregated across all five cases, 62% of PAMs were located in introns or near coding regions, and 38% were intragenic. Less than 1% of PAMs were found to be in coding regions of the genome (Fig 1F).

90% of PAMs from the primary tumor are maintained in metastases

We used DNA capture hybridization probes to investigate the percentage of truncal PAMs that were maintained in multiple metastases for each case (Table 1). As a preliminary analysis, all captured sequences from the primary tumor, normal tissue, and metastases for each case were visualized in Integrated Genome Viewer (IGV) [22] to confirm the presence of the PAM in the tumor samples and absence in the patient-matched normal sample (Fig 2A). As a quality control check for each case, the percent truncal for each metastasis was plotted against the average read depth for that sample to determine if low cellularity of a particular tissue sample might obscure the presence of a given PAM (S1 Fig). For each case, only metastases in the linear portion of these plots were included in the final analysis to compensate for low tumor cellularity or contamination of tumor tissue with surrounding normal cells. For each case, the subset of PAMs present in each tumor sample was plotted to show sample overlap (Fig 2B and S2 Fig).

thumbnail
Fig 2. Maintenance of truncal PAMs in metastases.

A) Screenshot of capture probe sequencing from IGV at an A>G PAM creating mutation. The PAM is absent in the captured normal tissue but present in all metastatic samples. B) Venn diagram of primary and metastatic samples for case A10 showing PAM overlap between all samples. C) Overlap of PAMs in primary samples from case A10 and D) case A38. E) The distribution of metastases in which each PAM in each case is maintained. Each data point is an individual PAM with the mean % truncal and SEM calculated. F) The percentage of truncal PAMs that are maintained in each metastasis for each case. Each data point is an individual metastasis. No statistical significance was found between any cases except A6 by one-way ANOVA (p = 0.0011). G) The percentage of PAMs shared by all lesions in each case.

https://doi.org/10.1371/journal.pone.0298490.g002

Two cases, A10 and A38, each had three samples of primary tumor. Using the capture approach, the overlap between those samples was determined, providing insight into heterogeneity within the primary tumor for these cases. A10 primary samples shared 94% of PAMs (Fig 2C). Case A38 shared 90% of PAMs between primary sections but each had distinct PAMs not detected in the other two samples (Fig 2D).

For each case, each metastasis was evaluated for the percentage of truncal PAMs it contained (percent truncal), and each individual PAM was evaluated for how many metastases in which it was present (percent maintained). A2, A6, A10, A32, and A38 had 76, 183, 120, 180, and 1360 truncal PAMs, respectively. These PAMs were maintained in 91.7%, 81.3%, 93.1%, 92.6%, and 88.9% of metastases for each case, respectively (Fig 2E). We also evaluated each metastasis to determine the percentage of truncal PAMs it contained. A2 had 6 metastases and was 89.2% truncal. A6 had 6 metastases and was 81.3% truncal. A10 had 6 metastases and was 93.6% truncal. Case A32, with 3 metastases, was 92.6% truncal. A38 had 11 metastases and was 92.2% truncal (Fig 2F). Of note, A2 lacked a sample of primary and so for this case alone, truncal was inferred as mutations present in more than one metastasis. Across all five cases, the average percentage of truncal PAMs maintained in each metastasis was 89.8% and each PAM was present in an average of 89.5% of metastases for each case. Importantly, for cases A2, A10, A32, and A38, there was no significant difference between the percentage of truncal PAMs in each metastasis by one-way ANOVA. Metastases in case A6 were less truncal than the other four cases (p = 0.0016). In addition to examining maintenance of individual PAMs and metastases, the portion of truncal PAMs present in all lesions for each case was also calculated. This includes instances where all but one metastasis shares many PAMs, but a set of PAMs are missing in one lesion. This could be because of focal or regional deletions or LOH. Examining each case this way, we find that for A2, A6, A10, A32, and A38, 81.6%, 67.8%, 80.8%, 78.9% and 65.1% of PAMs were shared between all lesions studied, respectively. Across all five cases, an average of 74.8% of PAMs discovered in the primary tumor were present in every lesion studied (Fig 2G). Phylogenetic clustering showing the microevolutionary relationship between the primary tumors and metastasis is provided (S3 Fig). In cases with multiple samples of primary tumor, A10 and A38, it is possible to trace the lineage of metastases from distinct regions of the primary tumor.

PAMs are 100% maintained in regions of truncal LOH proximal to tumor suppressor and oncogenes

A deeper analysis of copy number (CN) and LOH was performed on cases A32 and A38, as WGS of two and four tumor cell lines, respectively, and matched normal cell lines were available. Zygosity and copy number directly affected the PAM VAF in sequencing data as diagrammed (Fig 3A). We then mapped chromosomal location of PAMs together with their VAF and copy number for each tumor cell line and found that truncal PAMs were more likely to have a VAF of 100% and be in haploid regions (Fig 3B and 3C, S4 and S5 Figs).

thumbnail
Fig 3. Truncal PAMs have higher VAFs than private PAMs.

A) Diagram depicting relationship between variant allele fraction (VAF), ploidy, and copy number (CN) when interpreting next generation sequencing data. WGS from cell lines show truncal PAMs with high VAF frequently occur more frequently in haploid regions of the genome. Shown is B) A32-T1, representative for case A32, and C) A38 cell line T1, representative for case A38.

https://doi.org/10.1371/journal.pone.0298490.g003

To investigate the relationship between VAF, copy number, and PAM maintenance further, we cross-referenced the list of PAMs found in all three captured samples of the A38 primary tumor with the WGS cell line data. First, we quantified the number of truncal PAMs on each chromosome by VAF and copy number (S6 and S7 Figs). We found that on chromosomes where all PAMs were present at a VAF of 100%, all PAMs were present in all four tumor cell lines (Fig 4A and 4B). On chromosomes that were haploid in one or two cell lines but not all, indicating that the copy loss occurred later in the disease, we found that PAMs were partially lost in the haploid regions. PAMs were 100% maintained in the regions that retained heterozygosity (Fig 4C and 4D). Furthermore, PAMs were 100% maintained on chromosomes that were diploid or greater and heterozygous in all four cell lines (Fig 4E and 4F).

thumbnail
Fig 4. PAMs are maintained in regions of truncal LOH and partially lost in regions of private LOH.

All data shown is from case A38. A) All truncal PAMs on chromosome 18 in all four cell lines have a VAF of 1, and B) All truncal PAMs discovered are present in all four cell lines with a copy number of 1 (haploid). C) A38 cell lines T2-T4 are at least diploid on chromosome 14 while T1 is haploid based on VAF. D) Truncal PAMs are 50% lost in T1, corresponding to a VAF of 1 and a CN of 1. PAMs are 100% maintained in T2-T4 where the VAF remains 0.5, indicative of heterozygosity. E) Chromosome 6 is at least diploid in all cell lines. Notably, T1 and T2 are triploid as indicated by VAF peaks at 0.35 and 0.7. F) All truncal PAMs are maintained in all four cell lines as heterozygosity is maintained. G) Percent PAM maintenance, average CN, and average VAF plotted by chromosome arm for A38 cell line T1. PAM loss occurs in haploid regions, where the VAF and CN are both 1. H) PAM maintenance, average VAF, and average CN across chromosome arms aggregated for all four A38 cell lines. Notably, areas of LOH common to all cell lines (truncal LOH) present 100% PAM maintenance, while PAM loss is limited to regions of private LOH. Regions of truncal LOH can be identified where CN = 1 in all samples and/or VAF = 1 in all samples. I) Summary data of truncal PAMs in case A38 WGS data. Truncal PAMs are 100% maintained in regions of truncal LOH, 67% maintained in regions of private LOH, and 99% maintained in heterozygous regions. Pairwise comparisons by Kruskal-Wallis test. J) Plot of truncal PAM VAF comparing PAMs maintained in all four cell lines and truncal PAMs lost in at least 1 cell line. Truncal PAMs that are maintained have a significantly higher mean VAF (0.6029) than those susceptible to loss (0.4598) (Mann-Whitney Rank Sum, p = 2.267x10-16). Additionally, maintained PAMs are more likely to have a VAF of 1 indicative of their occurrence in regions of LOH. In contrast, PAMs that were lost had a maximum VAF of 0.6483. n = 1080 for maintained PAMs, n = 146 for lost PAMs. All error presented in SEM.

https://doi.org/10.1371/journal.pone.0298490.g004

We then examined the genome of each metastasis, calculating the average PAM VAF, CN, and the percentage of maintained PAMs for each chromosome arm. Strikingly, PAMs were overall maintained or lost at a rate of about 50% per copy, i.e., 50% lost in haploid regions and 33% lost in polyploid regions (Fig 4G). Aggregating data from all four A38 cell lines, we found that PAM loss was limited to regions of private LOH, where individual samples had undergone LOH, but others had not (Fig 4H and S8 Fig). Further examining patterns of PAM maintenance and loss, we found that PAMs were 100% maintained in areas of truncal LOH but lost at a rate of 30% on chromosome arms that contained private LOH. Likewise, in A32, PAMs were 100% maintained in regions of truncal LOH, and were only lost in regions of private LOH or regional deletions (S9 Fig). In A38, PAMs are maintained at a rate of 99% where heterozygosity is maintained, indicating private LOH as the primary mechanism of PAM loss (Fig 4I and S10A Fig). Further supporting this hypothesis, we found that truncal PAMs that were 100% maintained had a higher average VAF and are more likely to be in regions of haploid LOH than truncal PAMs that had been lost in at least one metastasis (Fig 4J). A similar trend was seen in case A32, although statistical significance was limited by a small number–only 3—of lost truncal PAMs (S10B Fig).

Finally, when the percent maintained was plotted as a function of VAF, we saw that truncal PAMs maintained in all four A38 cell lines have a broad VAF distribution. Truncal PAMs lost in at least one cell line, however, were 100% maintained at a VAF of 50% (regions of retained heterozygosity), or 50% lost at a VAF of 100% (regions of LOH), supporting the hypothesis that LOH was responsible for loss of individual PAMs (S10C Fig).

Discussion

Haploid conserved islands of essential genes represent a therapeutic vulnerability

In both cases for which we possessed cell lines from multiple lesions, A32 and A38, we observed haploid regions that were consistently maintained between the primary tumors and metastases. Unlike other areas of private LOH representing ongoing genomic instability, these regions were haploid in the primary tumor and 100% conserved in metastases. These regions of truncal LOH frequently occurred around tumor suppressor and oncogenes commonly mutated in PDAC and likely contributed to transformation in the cancer initiating cell. Counterintuitively, these regions may represent a therapeutic vulnerability, as many genes shown to be essential in PDAC are also located in these haploid islands [23]. LOH islands surrounding mutated oncogenes like KRAS should be under selective pressure if oncogenic addiction is present to the driver gene.

In case A38, the areas of truncal LOH included chromosome arms 1p, 3p, 8p, 9p, 15q, and the entire chromosome 18. Chromosome 8p was largely deleted in cell line A38 T2, but the regions containing truncal PAMs were conserved. These chromosome arms contain tumor suppressor genes frequently deleted or mutated in pancreatic cancer, such as CDKN2A and SMAD4. In A32, regions of truncal LOH were present on chromosomes 12p and 20p. Chromosome 12p harbors activating KRAS mutations in the majority of PDACs, including A32. It is possible that targeting PAMs in regions of haploid LOH proximal to mutated tumor suppressor genes would ensure that the targeted PAMs are maintained in all metastases, and further increase the toxicity of the incurred DSBs as those regions lack a template for homology directed repair of the DNA lesion.

CRISPR-Cas9 guided tumor-specific DSBs as a cancer gene therapy

Our previous work and that of others showed that a limited number of double strand breaks is catastrophic for cancer cells [2426]. Mechanistically, our group has found that multiple simultaneous DSBs trigger the formation of cytogenetic lesions such as radial, dicentric, and ring chromosomes. These lesions lead to catastrophic genomic instability and cell death after a period of 7 to 21 days [27].

Our group recently performed proof of principle work to demonstrate CRISPR-Cas9 as a targeted cell killing tool against cancer [17]. Here we find that 90% of tumor targets can be found in the primary tumor and metastases, demonstrating that our approach should be applicable for treating metastatic cancer. In vivo studies with mice xenografted with patient derived cancer cells and patient derived organoids would be helpful to demonstrate cell killing of multiple tumors with the same set of targets. Also, in vivo delivery of CRISPR agents is still under investigation by many researchers [28] and much optimization is needed [29]. However, one could envision an approach with lipid nanoparticle encapsulation of Cas9 mRNA and sgRNAs, similar to the COVID-19 vaccine developed by Moderna and Pfizer [30]. Moreover, as researchers dedicated to drug delivery optimize in vivo approaches for a myriad of therapeutic purposes, it is critical that basic science researchers develop therapies in parallel so that the two fields will meld in the future to the benefit of patients.

Implications of PAM maintenance on patient sampling

Determining the clonal PAMs in each metastasis should indicate how many biopsies must be taken from a patient to ensure precision targeting of a patient’s entire disease. As we show here that 90% of truncal PAMs were indeed present in metastases, it should not be necessary to extensively biopsy a patient to narrow down the set of targetable mutations to generate enough DSBs in each cancer cell to achieve cell death. Additionally, we found that PAMs were maintained at a rate of 100% in areas of truncal LOH. In these cases, the PAMs may be selected for in metastases, as loss of the PAM would mean loss of all genetic information in that region. Identifying PAMs in regions of LOH in the primary tumor should then bias the analysis towards PAMs that will be maintained in all metastases. This conclusion is supported by a recent study which found that mutations in haploid or amplified regions of the genome constituted a persistent mutation burden in cancer which was more likely to be maintained [31].

Pancreatic cancer metastases can be polyclonal in origin [32], so sampling multiple sections of the primary tumor might elucidate much of this primary tumor heterogeneity passed on to metastases. Computational modeling supports this conclusion. A recent study using spatial modeling of intratumoral heterogeneity predicted that 3–5 biopsies of a primary tumor were sufficient to identify clonal mutations, depending on tumor heterogeneity and clonal complexity [33]. Similarly, sequencing of homogenized tumor samples from surgical pathology "waste" was shown to reveal heterogenous mutations present in primary tumors [34]. One could envision our approach as an adjuvant treatment targeting residual disease after primary tumor resection–in such a scenario, multiple sequencing samples from the primary tumor should be sufficient to reveal the truncal PAMs. In cases where a patient is diagnosed after metastatic spread, which is frequently the case, or as recurrent disease after resection, a small number of biopsies should be sufficient to distinguish truncal PAMs from other subclonal PAMs and allow targeting of all metastatic sites.

Maintenance of truncal PAMs in metastases

Most studies focused on the appearance and propagation of driver mutations in tumor evolution. Previous work with case A38, published as sample Pa08, revealed a stepwise progression of driver mutations occurring subclonally in the primary tumor, allowing for clocking of initial tumorigenesis and metastatic spread [35]. Notably, patient A38 had a higher total number of somatic mutations and a commensurately higher count of novel PAMs than the other autopsy cases. This patient presented with a strong family history of cancer and was a registered member of the National Familial Pancreatic Cancer Registry, so it is likely that a germline mutation contributed to the higher mutational burden in this patient. A recent study of clonality in melanoma metastases revealed multiple metastatic lineages with distinct mutational profiles [36]. Here, however, we focused on passenger mutations, largely intergenic, which were subject to less selective pressure than drivers. Frequent events such as LOH, chromosomal deletions, and structural variants could also remove or affect the copy number of truncal PAM sites. In the context of CRISPR-based gene therapy, these alterations could impact how many DNA DSBs would be generated per target or the choice of DNA repair pathway employed after the break [37], affecting downstream cytotoxicity. To cause the highest cytotoxicity with the smallest number of DSBs to the most cancer cells, it is crucial to understand the genomic landscape of novel PAMs throughout a patient’s disease.

For the purposes of this study, we defined “truncal” as those mutations present in the primary tumor. Truncal has been previously defined as “ancestral mutations in the trunk of the phylogenetic tree that are shared by all clones” [38]. While it is impossible to sequence the CIC, clonal mutations can be inferred from multiple samples of a primary tumor. Here we included PAMs discovered in any sample of primary tumor. Targeted deep sequencing of sonicated DNA has been shown to introduce spurious SBS mutations in sequencing data [39] and somatic PAMs are frequently located in repetitive areas of the genome leading to poor probe specificity and low sequencing coverage of those regions. Multiple samples of primary tumor were available for two cases (A10 and A38). To maximize the number of PAMs considered for this study and to eliminate the chance that the PAMs are present throughout a primary tumor but not detected because of technical artifacts, all PAMs discovered in any sample of the primary tumor were included for downstream analysis of clonality.

There are some limitations to this study. Cell lines were used to generate whole genome sequencing to identify somatic PAMs, and these are not available in all cases of cancer. Ideally, one would determine this information by performing whole genome sequencing (WGS) directly on primary tumor samples, however in some cancers like pancreatic cancer, such analysis can be compromised by low primary neoplastic cellularity. Analyses can be further complicated by sample age, oxidation, and crosslinking in formalin fixed paraffin embedded (FFPE) blocks. To begin with the largest pool of PAMs possible, we sequenced high quality DNA from metastatic cell lines for each patient and designed capture probes complementary to the combination of all PAMs found in each case. Naturally such a method included many private mutations for each metastasis, including those that arose during passage in cell culture. We were able to narrow down the broad pool of mutations for each case to those specific to the primary tumor by hybridization with FFPE primary samples. It is possible that some truncal PAMs present in other subclones of the primary tumor not evolutionarily upstream of the WGS metastatic cell lines were excluded with this approach and that starting analysis with PAMs already propagated to metastases biased the analysis. However, CIC mutations should be present in every subclone of the primary tumor prior to metastasis, and so isolating the mutations present in multiple metastases and the primary tumor helped limit this potential bias.

As LOH is an event unique to cancer, investigators have attempted to exploit this signature using various methods for some time [40]. Recently, Kinzler et al. explored a CAR-T immunotherapy approach targeting LOH at HLA alleles in cancer [41]. As PAMs are preferentially maintained in haploid regions of LOH, our approach allows for prediction of targets that could be efficacious in treating metastatic disease. Identification of conserved islands of LOH surrounding known tumor suppressor and oncogenes has implications beyond our study and warrant further investigation. Our approach may allow for genetically targeting LOH in cancer for a range of therapeutic applications.

Materials and methods

WGS PAM discovery

DNA was extracted from cell pellets or frozen tissue with a Qiagen UCP Micro DNA Kit (Ref 56204) per the manufacturer’s instructions. Matched tumor and normal samples were verified by STR profiling. WGS was performed on cancer samples to a target depth of 60x coverage, and 30x coverage for normal tissue. Sentieon-genomics-202010.02 bwa v0.7.17 (mem) was used for running the alignments against hg19 genome using default parameters. Sentieon-genomics-202010.02’s THhaplotyper was used to call somatic variants between the tumor-normal pairs. CNVkit v0.9.4 (batch) was used for copy number variation detection using “wgs” as a method parameter. The resulting integer copy numbers in the call.cns files from teh CNVkit outputs were used for downstream analysis. Sentieon-genomics-202010.02’s Haplotyper was used to call germline variants in normal samples with default parameters. The variants were filtered by the mapping quality ≥20, depth ≥10 and an allele frequency ≥0.05. PAMfinder (previously described) was then used to call somatic PAMs specific to tumor samples. PAMfinder is available for download at https://github.com/selinateh/PAMfinder.

FFPE tissue macrodissection

5 μM H&E slides were cut from each FFPE rapid autopsy blocks and reviewed by an anatomical GI pathologist (ET) with marking of tumor and normal regions prior to macrodissection. 10-μM sections were cut adjacent to the H&E section and were not stained. Sections were deparaffinized with xylene and ethanol. The PinPoint slide DNA isolation system (Zymo Research, USA) was used according to manufacturer’s directions to dissect and digest desired areas of tissue. Samples were digested with proteinase K overnight and DNA was extracted the next day by QIAamp DNA Mini Kit (ref 51306) (Qiagen, USA) with washes performed twice. DNA was quantified by Qubit HS Assay (ThermoFisher Scientific, USA).

Capture probe design

xGen LockDown Hybridization Probes (IDT, USA) were custom designed. Probe pools consisted of 120bp biotinylated oligonucleotides with 2x tiling over the desired areas, where possible. Probes were designed to straddle novel PAMs found from PAMfinder script as described above. To prevent nonspecific binding, probes were not included for loci with specificity scores greater than 100 as determined by the manufacturer’s algorithm (range 1–500, >50 considered risky). Samples were broken into two pools, with A2 and A32 combined in Pool 1 (3379 probes) and A38, A10, and A6 in Pool 2 (9239 probes).

Library preparation

Macrodissected DNA samples were sheared on a Covaris S2 Focus Ultrasonicator (Covaris, USA) to a target length of 250–350 bases. Libraries were prepared by xGen cfDNA & FFPE DNA Library Prep kit (IDT, USA) per manufacturer’s instructions. Libraries were amplified with KAPA HiFi HotStart Ready Mix (Roche, USA) and bead purified by AMPure XP beads (Beckman Coulter, USA). DNA was quantified by Qubit HS Assay or D1000 High Sensitivity Screen Tape (Agilent, USA).

Hybridization

Libraries were multiplexed with equal mass ratio in 4, 5, or 8-plexes. Combined libraries were dried down with universal blockers and human cot DNA in a Savant SpeedVac (ThermoFisher Scientific) vacuum concentrator. Dried down samples were hybridized overnight with custom panels of xGen LockDown Probes (IDT, USA). Washes and bead captures were performed the following day with xGen Hybridization reagents per the manufacturer’s instructions. Captured libraries were amplified with KAPA HiFi HotStart Ready Mix (Roche, USA) and purified with AMPure XP beads. Final capture libraries were quantified on a Bioanalyzer (Agilent, USA) prior to sequencing.

Capture library sequencing

Capture libraries were sequenced with 2x250 paired end reads on an Illumina MiSeq to a target depth of 100x coverage. SAMTOOLS (RRID: SCR_002105) v1.10 and GATK v3.6.0 were used to determine coverage at different levels of partitioning and aggregation.

Analysis of capture data

Aligned reads were visualized in Integrated Genome Viewer (IGV, version 2.15.4, Broad Institute) [22] with single base substitutions called at 5% VAF cutoff, just above the threshold for background artifactual noise. Metastases were visualized concurrently with captured normal and primary tumor sequencing for each case to determine the presence or absence of each PAM. All PAMs present in the primary tumor capture but absent in captured normal were considered truncal. “Percent truncal” for each metastasis was calculated as the number of truncal PAMs present in that sample divided by the number of truncal PAMs for the case. “Percent maintained” was calculated for each PAM by dividing the number of metastases in which the PAM was present by the total number of metastases for that case.

Statistical analysis

All statistical analysis was performed with GraphPad Prism (RRID: SCR_002798) software 8.3.0. Nonparametric comparisons of three or more groups were determined with Kruskal-Wallis test and nonparametric comparisons of two groups were performed with Mann-Whitney Rank Sum test. Results were considered significant at p < 0.05. Error is displayed as SEM throughout the study.

Tree construction

Hierarchical clustering analysis was performed on the combined primary and metastatic samples with either WGS or PAM probe capture. Variants were restricted to the PAM probe sites. The presence or absence of a mutation was considered for each sample as the set of sites with a non-zero variant allele fraction from either WGS or PAM probe capture. The resulting binary matrix was clustered and plotted using UPGMA agglomeration in R. When a primary tumor was available, each tree was re-rooted and plotted using the ‘ape’ R library (https://doi.org/10.1093/bioinformatics/bty633)). When multiple primary samples were available, the primary with the fewest unique PAMs, when compared to other samples from the primary tissue of origin, was selected as the root.

Sex as a biological variable

For the analysis of PAM maintenance in metastasis, 80% of patients were male and 20% were female. Patients A2, 6, 10, and 38 were male. Patient A32 was female. For the WGS analysis of LOH islands and mechanism of PAM loss, 2 total cases were analyzed, one of which was male (A38) and one of which was female (A32).

Ethical approval for human subjects research

The Gastrointestinal Cancer Rapid Medical Donation Program (GICRMDP) was initiated at the Johns Hopkins hospital by Dr. Christine Iacobuzio-Donahue, after obtaining Institutional Review Board (IRB) approval. The Recruitment period was from May 22, 2003 to September 5, 2006. Inclusion criteria required biopsy-proven or radiographic evidence of metastatic pancreatic cancer in adults. The patients provided written informed consent, in the presence of the next of kin. The next of kin was reconsented following death of the patient. A rapid autopsy was performed within 6 hours following death. Tissue was harvested for histology, xenografting into nude mice and cell line production. IRB approval for whole genome sequencing was subsequently obtained for retrospective analysis of de-identified samples on May 19, 2011.

Supporting information

S1 Fig. Quality control analysis plotting average read depth for each sample against the calculated percent truncal for that sample, broken down by each case, and then shown overall for all cases.

Histograms showing maintenance of PAMs for each case are also shown.

https://doi.org/10.1371/journal.pone.0298490.s001

(TIF)

S2 Fig. Sets of PAM overlap for each case.

Venn diagrams were created with ggvenn from the ggplot2 R package, and the A38 subsets were plotted with UpSetR.

https://doi.org/10.1371/journal.pone.0298490.s002

(TIF)

S3 Fig. Phylogenetic trees for rapid autopsy primary tumor and metastatic samples.

https://doi.org/10.1371/journal.pone.0298490.s003

(PDF)

S4 Fig. VAF and CN for truncal and private PAMs for WGS of A38 cell lines T2-4.

https://doi.org/10.1371/journal.pone.0298490.s004

(TIF)

S5 Fig. VAF and CN for truncal and private PAMs for WGS of A32 cell line T2.

https://doi.org/10.1371/journal.pone.0298490.s005

(TIF)

S6 Fig. Histograms and CN/VAF analysis for truncal PAMs in A38 WGS cell line data, by chromosome.

https://doi.org/10.1371/journal.pone.0298490.s006

(TIF)

S7 Fig. Histograms and CN/VAF analysis for truncal PAMs in A38 WGS cell line data, by chromosome.

https://doi.org/10.1371/journal.pone.0298490.s007

(TIF)

S8 Fig. Plots of PAM maintenance, VAF, and CN for A38 cell lines T2-T4 analyzed for each chromosome arm.

https://doi.org/10.1371/journal.pone.0298490.s008

(TIF)

S9 Fig. Plots of PAM maintenance, VAF, and CN for A32 cell lines T1-2 analyzed for each chromosome arm.

https://doi.org/10.1371/journal.pone.0298490.s009

(TIF)

S10 Fig. WGS was available for 2 cell lines and normal for case A32.

A) No significant difference was found between the two cell lines for LOH and PAM maintenance (Kruskal-Wallis, p = 0.3307) or for B) VAF between truncal PAMs maintained in both lines or lost in some (Mann-Whitney rank sum, p = 0.7705). n = 165 for maintained PAMs, n = 3 for PAMs lost in one cell line but not the other. C) Plot of truncal PAM VAF comparing PAMs maintained in all four cell lines (blue) and truncal PAMs lost in at least 1 cell line (red). A wide VAF distribution in present in PAMs present in all. For PAMs lost in some, VAFs cluster at 0.5 in samples where they are maintained, 1 when they are lost at a rate of 50%.

https://doi.org/10.1371/journal.pone.0298490.s010

(TIF)

Acknowledgments

The authors would like to thank the patients and their families who underwent rapid autopsy and made this study possible. We would also like to thank Jennifer Meyers, Drs. Kornel Schuebel, and Srinivasan Yegnasubramanian, Laura Wood, Suping Chen, Richard Burkhart, Chien-Fu Hung, Angelo DeMarzo, and Ming-Tseh Lin for helpful suggestions; Arnique Coutain, Julia Mychak, Lisa Haley for technical assistance; Kailee Calder, Fidel Cai, Jonathan Chen, and Kindness Nwaukwa for assistance with data analysis.

References

  1. 1. Gensler HL, Bernstein H. DNA damage as the primary cause of aging. Q Rev Biol. 1981;56(3):279–303. pmid:7031747
  2. 2. Tomasetti C, Vogelstein B, Parmigiani G. Half or more of the somatic mutations in cancers of self-renewing tissues originate prior to tumor initiation. Proc Natl Acad Sci U S A. 2013;110(6):1999–2004. pmid:23345422
  3. 3. Moore L, Cagan A, Coorens THH, Neville MDC, Sanghvi R, Sanders MA, et al. The mutational landscape of human somatic and germline cells. Nature. 2021;597(7876):381–6. pmid:34433962
  4. 4. Blokzijl F, de Ligt J, Jager M, Sasselli V, Roerink S, Sasaki N, et al. Tissue-specific mutation accumulation in human adult stem cells during life. Nature. 2016;538(7624):260–4. pmid:27698416
  5. 5. Nowell PC. The clonal evolution of tumor cell populations. Science. 1976;194(4260):23–8. pmid:959840
  6. 6. Greaves M, Maley CC. Clonal evolution in cancer. Nature. 2012;481(7381):306–13. pmid:22258609
  7. 7. Jiang F, Doudna JA. CRISPR-Cas9 Structures and Mechanisms. Annu Rev Biophys. 2017;46:505–29. pmid:28375731
  8. 8. Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339(6121):819–23. pmid:23287718
  9. 9. Mali P, Yang L, Esvelt KM, Aach J, Guell M, DiCarlo JE, et al. RNA-guided human genome engineering via Cas9. Science. 2013;339(6121):823–6. pmid:23287722
  10. 10. Doudna JA, Charpentier E. Genome editing. The new frontier of genome engineering with CRISPR-Cas9. Science. 2014;346(6213):1258096. pmid:25430774
  11. 11. Chen ZH, Yu YP, Zuo ZH, Nelson JB, Michalopoulos GK, Monga S, et al. Targeting genomic rearrangements in tumor cells through Cas9-mediated insertion of a suicide gene. Nat Biotechnol. 2017;35(6):543–50. pmid:28459452
  12. 12. Koo T, Yoon AR, Cho HY, Bae S, Yun CO, Kim JS. Selective disruption of an oncogenic mutant allele by CRISPR/Cas9 induces efficient tumor regression. Nucleic Acids Res. 2017;45(13):7897–908. pmid:28575452
  13. 13. Kim W, Lee S, Kim HS, Song M, Cha YH, Kim YH, et al. Targeting mutant KRAS with CRISPR-Cas9 controls tumor growth. Genome Res. 2018;28(3):374–82. pmid:29326299
  14. 14. Jubair L, Fallaha S, McMillan NAJ. Systemic Delivery of CRISPR/Cas9 Targeting HPV Oncogenes Is Effective at Eliminating Established Tumors. Mol Ther. 2019;27(12):2091–9. pmid:31537455
  15. 15. Martinez-Lage M, Torres-Ruiz R, Puig-Serra P, Moreno-Gaona P, Martin MC, Moya FJ, et al. In vivo CRISPR/Cas9 targeting of fusion oncogenes for selective elimination of cancer cells. Nat Commun. 2020;11(1):5060. pmid:33033246
  16. 16. Vuelta E, Ordonez JL, Sanz DJ, Ballesteros S, Hernandez-Rivas JM, Mendez-Sanchez L, et al. CRISPR/Cas9-Directed Gene Trap Constitutes a Selection System for Corrected BCR/ABL Leukemic Cells in CML. Int J Mol Sci. 2022;23(12). pmid:35742831
  17. 17. Teh SSK, Bowland K, Bennett A, Halper-Stromberg E, Skaist A, Tang J, et al. CRISPR-Cas9 for selective targeting of somatic mutations in pancreatic cancers. bioRxiv. 2023. pmid:37131822
  18. 18. Embuscado EE, Laheru D, Ricci F, Yun KJ, de Boom Witzel S, Seigel A, et al. Immortalizing the complexity of cancer metastasis: genetic features of lethal metastatic pancreatic cancer obtained from rapid autopsy. Cancer Biol Ther. 2005;4(5):548–54. pmid:15846069
  19. 19. Iacobuzio-Donahue CA, Fu B, Yachida S, Luo M, Abe H, Henderson CM, et al. DPC4 gene status of the primary carcinoma correlates with patterns of failure in patients with pancreatic cancer. J Clin Oncol. 2009;27(11):1806–13. pmid:19273710
  20. 20. Konstantakos V, Nentidis A, Krithara A, Paliouras G. CRISPR-Cas9 gRNA efficiency prediction: an overview of predictive tools and the role of deep learning. Nucleic Acids Res. 2022;50(7):3616–37. pmid:35349718
  21. 21. Kader M, Sun W, Ren BG, Yu YP, Tao J, Foley LM, et al. Therapeutic targeting at genome mutations of liver cancer by the insertion of HSV1 thymidine kinase through Cas9-mediated editing. Hepatol Commun. 2024;8(4). pmid:38497929
  22. 22. Robinson JT, Thorvaldsdottir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29(1):24–6. pmid:21221095
  23. 23. Zhao M, Sun J, Zhao Z. TSGene: a web resource for tumor suppressor genes. Nucleic Acids Res. 2013;41(Database issue):D970–6. pmid:23066107
  24. 24. Teh SSK, Halper-Stromberg E, Morsberger L, Bennett A, Bowland K, Skaist A, et al. Mechanism of delayed cell death following simultaneous CRISPR-Cas9 targeting in pancreatic cancers. bioRxiv. 2023. pmid:37066222
  25. 25. van den Berg J, A GM, Kielbassa K, Feringa FM, Freire R, Medema RH A limited number of double-strand DNA breaks is sufficient to delay cell cycle progression. Nucleic Acids Res. 2018;46(19):10132–44. pmid:30184135
  26. 26. Leibowitz ML, Papathanasiou S, Doerfler PA, Blaine LJ, Sun L, Yao Y, et al. Chromothripsis as an on-target consequence of CRISPR-Cas9 genome editing. Nat Genet. 2021;53(6):895–905. pmid:33846636
  27. 27. Cosper PF, Copeland SE, Tucker JB, Weaver BA. Chromosome Missegregation as a Modulator of Radiation Sensitivity. Semin Radiat Oncol. 2022;32(1):54–63. pmid:34861996
  28. 28. Gillmore JD, Gane E, Taubel J, Kao J, Fontana M, Maitland ML, et al. CRISPR-Cas9 In Vivo Gene Editing for Transthyretin Amyloidosis. N Engl J Med. 2021;385(6):493–502. pmid:34215024
  29. 29. Reardon S. FDA Approves First CRISPR Gene Editing Treatment for Sickle Cell Disease. Scientific American. 2023.
  30. 30. Park KS, Sun X, Aikins ME, Moon JJ. Non-viral COVID-19 vaccine delivery systems. Adv Drug Deliv Rev. 2021;169:137–51. pmid:33340620
  31. 31. Niknafs N, Balan A, Cherry C, Hummelink K, Monkhorst K, Shao XM, et al. Persistent mutation burden drives sustained anti-tumor immune responses. Nat Med. 2023;29(2):440–9. pmid:36702947
  32. 32. Maddipati R, Stanger BZ. Pancreatic Cancer Metastases Harbor Evidence of Polyclonality. Cancer Discov. 2015;5(10):1086–97. pmid:26209539
  33. 33. Opasic L, Zhou D, Werner B, Dingli D, Traulsen A. How many samples are needed to infer truly clonal mutations from heterogenous tumours? BMC Cancer. 2019;19(1):403. pmid:31035962
  34. 34. Litchfield K, Stanislaw S, Spain L, Gallegos LL, Rowan A, Schnidrig D, et al. Representative Sequencing: Unbiased Sampling of Solid Tumor Tissue. Cell Rep. 2020;31(5):107550. pmid:32375028
  35. 35. Yachida S, Jones S, Bozic I, Antal T, Leary R, Fu B, et al. Distant metastasis occurs late during the genetic evolution of pancreatic cancer. Nature. 2010;467(7319):1114–7. pmid:20981102
  36. 36. Rabbie R, Ansari-Pour N, Cast O, Lau D, Scott F, Welsh SJ, et al. Multi-site clonality analysis uncovers pervasive heterogeneity across melanoma metastases. Nat Commun. 2020;11(1):4306. pmid:32855398
  37. 37. Schep R, Brinkman EK, Leemans C, Vergara X, van der Weide RH, Morris B, et al. Impact of chromatin context on Cas9-induced DNA double-strand break repair pathway balance. Mol Cell. 2021;81(10):2216–30 e10. pmid:33848455
  38. 38. Davis A, Gao R, Navin N. Tumor evolution: Linear, branching, neutral or punctuated? Biochim Biophys Acta Rev Cancer. 2017;1867(2):151–61. pmid:28110020
  39. 39. Park G, Park JK, Shin SH, Jeon HJ, Kim NKD, Kim YJ, et al. Characterization of background noise in capture-based targeted sequencing data. Genome Biol. 2017;18(1):136. pmid:28732520
  40. 40. Zhang X, Sjoblom T. Targeting Loss of Heterozygosity: A Novel Paradigm for Cancer Therapy. Pharmaceuticals (Basel). 2021;14(1). pmid:33450833
  41. 41. Hwang MS, Mog BJ, Douglass J, Pearlman AH, Hsiue EH, Paul S, et al. Targeting loss of heterozygosity for cancer-specific immunotherapy. Proc Natl Acad Sci U S A. 2021;118(12). pmid:33731480