Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Gibbs process distinguishes survival and reveals contact-inhibition genes in Glioblastoma multiforme

  • Afrooz Jahedi ,

    Contributed equally to this work with: Afrooz Jahedi, Gayatri Kumar

    Roles Writing – review & editing

    Affiliation Department of Translational Molecular Pathology, UT MD Anderson Cancer Center, Houston, TX, United States of America

  • Gayatri Kumar ,

    Contributed equally to this work with: Afrooz Jahedi, Gayatri Kumar

    Roles Formal analysis, Investigation, Writing – review & editing

    Affiliation Department of Translational Molecular Pathology, UT MD Anderson Cancer Center, Houston, TX, United States of America

  • Lavanya Kannan,

    Roles Project administration

    Affiliation Ernst & Young, Houston, TX, United States of America

  • Tarjani Agarwal,

    Roles Investigation

    Affiliation Point of Care, Livingston, NJ, United States of America

  • Jason Huse,

    Roles Investigation

    Affiliations Department of Translational Molecular Pathology, UT MD Anderson Cancer Center, Houston, TX, United States of America, Department of Pathology, UT MD Anderson Cancer Center, Houston, TX, United States of America

  • Krishna Bhat,

    Roles Formal analysis, Investigation, Supervision, Writing – review & editing

    Affiliations Department of Translational Molecular Pathology, UT MD Anderson Cancer Center, Houston, TX, United States of America, Department of Neurosurgery, UT MD Anderson Cancer Center, Houston, TX, United States of America

  • Kasthuri Kannan

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    kskannan@mdanderson.org

    Affiliations Department of Translational Molecular Pathology, UT MD Anderson Cancer Center, Houston, TX, United States of America, Department of Neurosurgery, UT MD Anderson Cancer Center, Houston, TX, United States of America

Abstract

Tumor growth is a spatiotemporal birth-and-death process with loss of heterotypic contact-inhibition of locomotion (CIL) of tumor cells promoting invasion and metastasis. Therefore, representing tumor cells as two-dimensional points, we can expect the tumor tissues in histology slides to reflect realizations of spatial birth-and-death process which can be mathematically modeled to reveal molecular mechanisms of CIL, provided the mathematics models the inhibitory interactions. Gibbs process as an inhibitory point process is a natural choice since it is an equilibrium process of the spatial birth-and-death process. That is if the tumor cells maintain homotypic contact inhibition, the spatial distributions of tumor cells will result in Gibbs hard core process over long time scales. In order to verify if this is the case, we applied the Gibbs process to 411 TCGA Glioblastoma multiforme patient images. Our imaging dataset included all cases for which diagnostic slide images were available. The model revealed two groups of patients, one of which - the “Gibbs group,” showed the convergence of the Gibbs process with significant survival difference. Further smoothing the discretized (and noisy) inhibition metric, for both increasing and randomized survival time, we found a significant association of the patients in the Gibbs group with increasing survival time. The mean inhibition metric also revealed the point at which the homotypic CIL establishes in tumor cells. Besides, RNAseq analysis between patients with loss of heterotypic CIL and intact homotypic CIL in the Gibbs group unveiled cell movement gene signatures and differences in Actin cytoskeleton and RhoA signaling pathways as key molecular alterations. These genes and pathways have established roles in CIL. Taken together, our integrated analysis of patient images and RNAseq data provides for the first time a mathematical basis for CIL in tumors, explains survival as well as uncovers the underlying molecular landscape for this key tumor invasion and metastatic phenomenon.

Introduction and preliminaries

Contact-inhibition of locomotion (CIL) is a fundamental mechanism that immobilizes cells in healthy tissues [1] and has long been implicated in the invasion of cancer cells [2]. When collisions happen between the same cell type, this mechanism is referred to as homotypic CIL, and heterotypic CIL occurs when the collisions are between different cell types. Several studies have demonstrated a loss of heterotypic CIL between tumor and normal cells and have hypothesized this as a phenomenon behind tumor invasion and metastasis [35]. At the same time, it is known that homotypic CIL is maintained between the tumor cells [6, 7]. Hence, we can expect the spatial distribution of cells in the tumor tissue to reflect one or both of these phenotypes that would distinguish the survival of the patients, with loss of heterotypic CIL patients evidencing low survival due to tumor invasion and metastasis.

Although CIL is a dynamic mechanism that happens in real-time, by examining the spatial distribution of cells in tumor tissues, we can infer homotypic CIL and/or loss of heterotypic CIL. After all, if homotypic inhibition is maintained, the tumor cells should not be too clustered. Similarly, if the loss of heterotypic inhibition occurs, we can expect a clustering phenotype. Even though one cannot infer the mechanisms of homotypic or loss of heterotypic CIL based on dispersion or clustering phenotypes alone, we can deduce these processes if cell movement gene expression is also altered. Therefore, by examining the spatial statistics and applying appropriate inhibitory or cluster models, homotypic CIL or loss of heterotypic CIL can be inferred by integrating pathology images with gene expression data. Even though CIL as a collective migration of tissues has been a subject of extensive mathematical modeling [8], application of the stochastic process to understand these mechanisms in tumors through radiomics approaches has not been undertaken, and the mathematical description of CIL as a spatial point process remains elusive.

From a mathematical standpoint, several stochastic models for tumor evolution has been proposed since 1950’s. Five dominant paradigms have emerged [9], out of which Two-Stage Clonal Expansion (TSCE) and Moran models have been predominantly applied to study tumor growth [1012]. TSCE posits that tumor growth follows two stages, an initiating stage which is modeled as an inhomogeneous Poisson process (because tumor initiation is a rare event) and proliferation/malignancy stage that follows the birth-and-death process. Thus, the Moran model also being a birth- and-death process, allows us to infer tumor growth is a birth-and-death process with CIL as a critical mechanism for invasion and metastasis.

Therefore, representing tumor cells as two-dimensional points, we can expect the tumor tissues in histology slides to reflect realizations of spatial birth-and-death process where each point/cell would maintain a certain distance from its neighbors if homotypic CIL is maintained. These spatial points can be mathematically modeled using an appropriate spatial point process that considers inhibitory interactions, and Gibbs hard core process as an inhibitory process to which birth-and-death process converges is a natural choice. A thorough treatment of spatial point processes and in particular Gibbs process is mathematically sophisticated, and beyond the scope of this article. However, we will illustrate the central ideas behind Gibbs process and related statistics transliterated in the language of cells and CIL. These are adapted (sometimes ad verbatim) from excellent introductory point process resources such as [13] and transliterated for our purposes.

Overview of the Gibbs process and L-function (transliterated to reflect biological context)

We will use the word intensity to refer to the number of cells per unit area in a randomly chosen small region of a histology slide.

Gibbs hard core process.

A spatial distribution of a finite collection of cells with intensity β can be assigned a probability density with respect to a completely random collection of cells with unit intensity. This probability density will change depending on the interaction between the cells. In the case of intact homotypic CIL in tumors, the cells are forbidden to come too close to each other, and when they maintain a mean distance h, the locations of the centers of the cells form a point process to which each pair of neighboring points is approximately h units apart. Such inhibition is called a hard-core constraint, and the Gibbs hard core process is defined by the probability density when it satisfies the hard-core constraint.

It is noteworthy to mention that there is a deep relationship between the spatial birth-and-death process and the Gibbs hard core process. Preston [14] proves, under the condition that there is a certain minimal distance maintained between the cells, the spatial birth-and-death process will converge to a realization of the Gibbs hard core process. In particular, we have the following result:

Suppose that, in each small interval of time Δt, each existing cell has probability d(t)Δt of undergoing cell death, where d(t) is the apoptosis rate per cell per unit time. In the same time interval, in any small region of area Δa, let a cell division take place with probability m(t)ΔaΔt, where m(t) is the mitotic rate of the cell per unit time. Provided the dividing cell lies h units away from its nearest neighbor, no matter what the initial state of the cells are, over long time scales, this spatial birth-and-death process would reach an equilibrium in which any snapshot of the cells will be a realization of the Gibbs hard core process with Gibbs process intensity β=m(t)/d(t) and hard-core constraint h.

We can now see that when homotypic CIL is maintained in tumors, the hard-core constraint is naturally satisfied and therefore if we represent the cells as two-dimensional points (segmented from histology slides), the Gibbs process is instinctive to apply. However, we do not know apriori if homotypic CIL is maintained for all tumors and we can expect some of the tumors to exhibit loss of heterotypic CIL, and some homotypic CIL. The validity of the Gibbs model as an inhibitory process can be applied to test and distinguish the maintenance of homotypic CIL versus the loss of heterotypic CIL. Moreover, the validity of the model can be determined by the statistical theory that suggests the inference for the Gibbs models should be based on summary statistics such as the L-function [13].

L-function.

The L-function is the commonly used transformation of the K-function. K-function is one of the very popular statistics for comparing the spatial correlations within point patterns [13] proposed by Ripley [15]. The K-function, Kr, of a randomly chosen small region of a histology slide image is the expected number of neighbors within radius r of a typical cell at location u, divided by the intensity β. Note that if we do not divide by the intensity of β, the expected number of neighbors within radius r will vary depending on the scale of the region. That is, if the slides are zoomed out/in, the expected number of r-neighbors will change. Dividing by the intensity β assures us the K-function is independent of scaling operations.

Mathematically, for any r ≥ 0 and any location u. The theoretical K-function of a homogeneous distribution of cells (aka homogeneous Poisson process) is π r2 and this quantity can be used to determine if the cells are clustered or inhibited. For such a comparison, the transformed K-function, namely the L-function, can be compared to the L-function of the homogeneous Poisson process, namely, . If the cells would exhibit a clustering phenotype and , the cells will be contact-inhibited.

As noted above, if the cells are contact-inhibited, the validity of the Gibbs process can be determined by the summary statistics, the L-function. That is, if the cells are contact-inhibited, we can expect the fitted optimal value of Gibbs hard-core constraint, optrGr to be approximately the same as the optimal value of |Lr−r|, indicating that the statistics is consistent with the model as determined by the L-function. Therefore, we will use these metrics for our results, that is Lr ≡ optr|Lr−r| and Gr = optrGr. Thus, it is natural to study the metric M(r) = (Lr, Gr) to determine the convergence of the Gibbs process. We will refer M(r) as the inhibition metric. Further, for our analysis, we will use the smoothed one-dimensional version of the metric, which we call the mean inhibition metric, MIM(r), is defined as: which we will use to analyze overall survival of the patients.

Image processing

Fig 1 shows the image processing protocol employed to extract the point pattern to fit the Gibbs process for the TCGA images (image TCGA-02-0339 shown for illustration). We processed all images for which diagnostic slides were available (389 images, see ‘S1 File’). We also added additional 22 cases to test the association of MGMT-promotor methylation found to be associated with one of the groups (eventually a false association) in the initial 389 cohort, resulting in 411 patient images. These images were added since there were only a few MGMT-promoter methylated cases in the original 389 cohort that showed a positive association with one of the groups. Three slides, namely, TCGA-19-1388, TCGA-19- 0963 and TCGA-19-1389 had bad diagnostics images, and so we used tissue slides. Each image was resolved until 50 μm and a region of 300 μm2 was randomly chosen and extracted for segmenting the cells.

We note that the downstream analysis is independent of the resolution and the area since Ripley’s K-function (and hence the estimated L-function) gives a scale-free description of the expected number of cells within a neighborhood of an arbitrary cell in any region of interest [16]. The cells in the isolated regions were segmented using k-means segmentation algorithm; the center of masses was identified in the binary mask to generate a spatial point pattern corresponding to the images. The standard Gibbs hard core process allows two major generalizations: the Strauss process and the Geyer saturation process. Unlike a Gibbs process, in Strauss process, close pairs of points are not impossible, but are unlikely to occur as a probability. Geyer saturation process is a modification of the Strauss process in which the total number of points from each point is trimmed to never exceed a threshold, also called the saturation parameter (please refer to Chapter 13 in [13]). We fitted the Geyer saturation process, to the point pattern for downstream analysis (refer to ‘S2 File’ for the type of Gibbs model that was applied). The model parameters and the L-function statistics are presented in S1 Data.

Results

Here we discuss the results after we obtained the Gr and Lr values by processing all the images using the image processing protocol described above. Both Lr and Gr values were scaled using the standardized z-score for the rest of the analysis.

Convergence of the Gibbs process

As noted above, if the Gibbs model fits the data, we would expect the Lr and Gr values to be close and hence the metric M(r) = (Gr, Lr) is likely to be grouped for some patients. We determined the optimal number of groups for this metric. Fig 2(A) shows the result of applying 30 indices for determining the optimal number of groups (see Methods section for the algorithm used). A majority of the indices reported the optimal number of groups as 2 for this metric. One of the groups, we call the Invasion group, had a smaller range of values for this metric than the other group which we refer as the “Gibbs group.”

Fig 2(B) shows the metric M (r) for the Invasion group (red) and the Gibbs group (blue) where 145 patients were in the Invasion group and 266 belonged to the Gibbs group (see S2 Data for group association). We note that the variance of M (r) in the Gibbs group is lesser than in the Invasion group, indicating that Gr and Lr values converge at least for the Gibbs group.

Therefore, in order to determine if this is a convergence behavior in mathematical terms, we plotted the k-means centroids as we included more M (r) values. Fig 2(C) shows the k-means centroids starting with 20 patients and increasing to 411 patients (see S4 Data for the centroids of 392 patients). We see the two subgroups of patients, one with “clustering” phenotype (Invasion group) and the other “inhibitory” phenotype (Gibbs group). Moreover, calculating the overall survival time for these two groups of patients from the TCGA database, revealed the patients with the clustering phenotype survive significantly shorter than the patients with inhibitory phenotype. This is illustrated in Fig 2(D) (also see S5 Data for the actual numbers). One can expect this result since the patients with loss of heterotypic CIL between tumor and normal cells that result in tumor cell invasion and metastasis are likely to have clustering phenotype and poor survival. On the other hand, patients with homotypic CIL between tumor cells, where cells repel each other, are less prone to tumor cell invasion leading to high survival.

Gibbs process distinguishes survival time

Although the difference in the overall survival of the patients is significant between the two groups, we found several patients in the Gibbs group with poor survival and patients in the Invasion group with high survival. This finding is contrary to the expectation that if the loss of heterotypic CIL and homotypic CIL distinguishes these patient types, theoretically, the number of patients in the Gibbs group who do worse should be very small and correspondingly, the number of patients in the Invasion group who survive high should be negligible as well.

In other words, the difference in mean survival between the groups must be much higher (with much lower statistical significance) than what it is currently presented. We hypothesized that this could be due to the noise in the metric M(r) and smoothing the metric according to increasing survival time in each group will likely delineate and identify the mark where actual homotypic CIL is established in patients. Fig 3(A) shows the one dimensional version of the smoothed metric M (r), i.e., MIM(r), using the Loess regression fitting for the increasing survival time (dark blue), and randomized survival time (dashed-gray) patients, where increasing survival time and randomized survival time are determined in each group (the codes and the data for generating this are also provided). It also shows the randomized k-means centroids for the two groups as horizontal dots, red and blue.

We note that MIM(r) with increasing survival time is significantly associated (Fig 3(C), Fisher’s exact test p-value = 0.002) with the randomized k-means centroids of the Gibbs group than MIM(r) with randomized survival time, demonstrating that it is precisely the noise in the metric M(r) that included many patients with poor survival time in the Gibbs group and high survival time in the Invasion group. This association was measured up to 4 standard deviations from the randomized k-means centroids of the Gibbs group. Moreover, the association of MIM(r) with increasing survival time also revealed the point at which the homotypic CIL establishes in the tumor cells - around 179 days, corresponding to index 213 in the Gibbs group (see S1 Fig and S6 Data). Using this time-point to distinguish loss of heterotypic CIL (LH-CIL) and homotypic CIL (H-CIL), we see that patients with LH-CIL have an extremely poor outcome than H-CIL patients as shown in Fig 3(B).

Gene expression signatures between loss of heterotypic CIL and homotypic CIL subgroups

In order to determine if there are expression differences between LH-CIL and H-CIL subgroups, we performed RNA-seq analysis from the TCGA data that was available in the GDC database for several of the patients. There were 22 patients in the Invasion group and 47 patients in the Gibbs group who had the corresponding RNA-seq data (see S7 and S8 Data). The analysis revealed 447 differentially expressed molecules several of which were significantly implicated in cell movement, cell adhesion, and binding (see S9 Data). More specifically, 48 out of the 86 genes involved in cell movement were found to have low enrichment in H-CIL group (green) as compared to LH-CIL group (red) and exhibited decreased cell movement consistent with several findings [17, 18] that show these genes increase cell migration and invasion (see S2S4 Figs and S10 Data). This can be directly attributed to homotypic CIL that results in inhibition between the tumor cells. Fig 4(A) shows the differential gene expression for cell movement genes with an absolute value of the average gene expression difference between LH-CIL and H-CIL groups greater than 0.2. S7 Fig and S11 Data shows the heatmap and gene expression values (original and transformed) for the signature derived in Fig 4(A). Correspondingly, the increased expression of these 48 cell movement genes in LH-CIL patients demonstrate loss of heterotypic CIL leading to tumor invasion, and hence poor survival.

Further, pathway analysis using the differentially expressed genes revealed genes in Actin cytoskeleton and RhoA signaling as key molecular alterations (Fig 4(B) and 4(C)). S12S15 Data provides the values of these gene expression signatures and l S5 and S6 Figs shows the implicated signaling pathways.

The actin cytoskeleton plays a critical role in cell movement as they link to extracellular matrix proteins (through adhesions) when cell protrusions stabilize during movement [19]. Also, it is known that the three Rho isoforms – RhoA, RhoB, and RhoC – can induce stress fiber formation [7, 20]. Since directional cell migration involves the polarized formation of cell protrusions in the front and contraction of stress fibers at the edge, our results suggest that unstable protrusions and/or deformed stress fibers in LH-CIL patients are likely to induce tumor cell invasion. On the other hand, over-stabilized protrusions and/or formation of additional stress fibers may result in an enhanced CIL in homotypic patients.

Discussion

The importance of CIL in maintaining normal physiological function has been studied extensively [1, 2] and the loss of CIL is an important contributing factor towards metastasis [1, 4, 5, 21]. CIL is mediated by changes in the cytoskeleton using molecular mechanisms consisting of various process [2224]. The formation of a physical contact in the form of cell-cell adhesions between the interacting cells is a requirement for CIL [25]. This is mediated by Cadherins, a transmembrane glycoprotein which forms the cell-cell junctions and is a key regulator of the actin cytoskeleton [26]. The Eph receptor which belongs to the tyrosine kinase family also plays a role in cell-cell contact by binding the ephrin ligand on the neighboring cell and coupling the them [27]. The Rho family of GTPases controls cytoskeleton arrangement and dynamics [28] which is crucial in inhibiting protrusion of the leading edge upon cell-cell contact [29]. The Rac1 activity at the leading edge promotes actin polymerization which drives the protrusion [30]. RhoA dependent ROCK activity drives contraction of lamellae and ensures normal CIL function [3133]. Essentially, RhoA and Rac1 function are reciprocal with one inhibiting the other [34].

Molecular inhibition of N-cadherins also results in protrusive activity by Rac1 at the contact site in neural crest cells leading to loss of CIL [35]. Also, the overexpression of E-cadherins in neural crest cells leads to increase in Rac1 and an eventual loss of CIL [36]. There is evidence that shows that the repolarization of cells by activation of Rac1 in the free edge can lead to separation of the colliding cells. Several events in concert can promote cell separation upon collision. The disassembly of the adhesion complex formed at the contact junction can separate the cells through uncoupling [23]. The buildup in tension upon cell contact leading to an increase in microtubule dynamics mediated by RhoA/ROCK activity can promote separation [22]. The linking of actin retrograde flow in the coupled cells during contact can also lead to tension which is sufficient to pull the cells apart [23, 24, 37]. Together these events can mediate disassembly of cell-cell adhesions and promote cell separation restoring CIL.

CIL in cancers have been extensively studied, especially from a biophysical and molecular perspective [36]. CIL has been shown to be equally important for developmental processes as well as in cancer disease pathology. Previous studies have outlined several stages of CIL including cell-cell contact, protrusion inhibition and cycles of repolarization and contraction [38]. These processes are regulated by epithelial to mesenchymal transition (a key cancer initiation mechanism) and cadherin switching from dependence on E-cadherin to N-cadherin [39, 40]. These adhesion proteins then signal toward small GTPases to promote or oppose migration [41, 42]. Cell surface receptors such as Ephrin B or Robo are also actively involved in crosstalk with cadherin junction proteins in regulating CIL [43, 44]. Thus, CIL requires coordinated mechanotransduction and is linked to key molecular pathways that regulate cancer progression. In this study, we used mathematical modeling on histology images of glioblastoma patients to link spatial point process to CIL. We provide further validation of this approach by identifying key signaling pathways involved in CIL such as actin cytoskeleton and RhoA signaling as being differential between loss of heterotypic CIL and intact homotypic CIL. In [7], Mayor and Carmona-Fontaine proposes “the invasive behavior of tumors is facilitated by the absence of heterotypic CIL with normal cells, whereas homotypic CIL between cancer cells helps collective migration and/or dispersion of the tumor.” Our model suggests this hypothesis is likely to be true.

Spatial point processes are powerful mathematical frameworks for studying point patterns. Cells invariably move, and by representing them as points, it is appropriate to study these movement behaviors using point processes. Gibbs process as an inhibition model is a natural choice for studying inhibitory interactions that drive CIL in tumors. Although there have been efforts to learn tumor’s spatial architecture purely from a stochastic process perspective and point processes has been applied in various cancer studies [4548], to the best of our knowledge, a point process description of CIL has not been attempted. The power of our approach is that these mechanisms can be determined only using microscopy images along with a robust mathematical model. In particular, the fact that tumor cells invade, and this invasion can be deciphered through images presents a dominant approach for cancer diagnostics and treatment options. It opens a window for a plethora of investigations, both clinical and biological, that can be followed when integrated with the existing knowledge on the molecular landscape of Glioblastoma multiforme. Such integration can be made using the marked spatial point processes [49] such as marked Gibbs or marked Geyer saturation process.

This methodology that demonstrates the effectiveness of spatial point pattern analysis in Glioblastoma multiforme, can be applied to any cancer types. Gibbs and Strauss’s models are applicable where cell-to-cell ‘repulsion’ interactions define heterogeneity like in diffuse gliomas. In tumors of epithelial origin, tumor microenvironment support clustering of cells in a process called “nesting.” For these tumors or even if there is clustering in diffuse gliomas, probability models such as Cox, Neyman-Scott, Matérn, Thomas, Gauss-Poisson, Cauchy and several other processes can be applied and compared. Moreover, cluster point processes are natural to apply and compare in the setting of clonal evolution since these processes model parent offspring associations that mimic the natural process of cell division. In short, spatial point process models help establishing a strong association between biology and contemporary radiomic approaches in medicine where the primary goal is computing the correlations between computational features and clinical outcomes. Without establishing such associations, translating the results of radiomics into a clinical setting is likely to be unachievable [50], and by integrating spatial point process models, effective radiomic systems can be developed and implemented to translate biological findings to clinical care.

Methods

Diagnostic slide images

We queried the GDC portal (https://portal.gdc.cancer.gov/) for GBM cases for which diagnostic cases were available. The query is presented in Supplemental Information.

Image and data processing

Microscopy H&E images were resolved until 50 μm and a region of 300 μm2 was randomly chosen using QuPath [51], an open source software for digital pathology image analysis. Image segmentation was done using color-based segmentation algorithm that uses k-means clustering in Matlab [52]. Spatial statistics such as the L-function and the parameters of the Geyer saturation process (a Gibbs model) were obtained using spatstat [13], an open source software for spatial statistics.

For determining the optimal number of groups, we used NbClust [53] algorithm that applies 30 indices to estimate the optimum. The metrics for our data set are given in S3 Data. Also, for the k-means clustering, we used the standard package k-means as implemented in the R programming language.

Gene expression and pathway analysis

RNA-seq analysis was performed using TCGAbiolinks [54] queries which facilitate the GDC open-access data retrieval to perform standard reproducible differential expression analysis. TCGAbiolinks uses EdgeR differential expression package [55] under the query TCGAanalyze_DEA. We used false discovery rate cutoff of 0.001 and log-fold change cutoff of 6 with the generalized linear model glmLRT as parameters to TCGAanalyze_DEA. This resulted in 447 differentially expressed genes (see Supplemental Information). This differential gene expression signature was used in Ingenuity Pathway Analysis (IPA) tool [56] as well as the open-source software ToppGene suite [57], both of which revealed cell movement and adhesion signatures as top Gene Ontology process. Furthermore, IPA analysis revealed actin cytoskeleton and RhoA signaling as critical pathways.

For deriving the cell movement, Actin cytoskeleton, and RhoA signaling signatures, we extracted the differentially expressed genes in these processes and pathways from IPA and used the raw RNA-seq counts to get the average expression. The values were RMS normalized across LH-CIL and H-CIL values and log2 transformed and plotted using the MeV software [58].

Supporting information

S1 File. Contains the original query used in the GDC portal to download the images.

https://doi.org/10.1371/journal.pone.0277176.s001

(PNG)

S2 File. Contains a brief description of the Gibbs processes.

https://doi.org/10.1371/journal.pone.0277176.s002

(PDF)

S2 Data. Invasion group and Gibbs group association for the patients.

https://doi.org/10.1371/journal.pone.0277176.s004

(XLSX)

S3 Data. Optimal number of groups indices by NbClust.

https://doi.org/10.1371/journal.pone.0277176.s005

(XLSX)

S4 Data. Convergence of the Gibbs process - centroids information.

https://doi.org/10.1371/journal.pone.0277176.s006

(XLSX)

S6 Data. Information on LH-CIL and H-CIL division.

https://doi.org/10.1371/journal.pone.0277176.s008

(XLSX)

S7 Data. Original RNA-seq counts for H-CIL patients.

https://doi.org/10.1371/journal.pone.0277176.s009

(XLSX)

S8 Data. Original RNA-seq counts for LH-CIL patients.

https://doi.org/10.1371/journal.pone.0277176.s010

(XLSX)

S9 Data. Differentially expressed genes between H-CIL and LH-CIL patients.

https://doi.org/10.1371/journal.pone.0277176.s011

(XLSX)

S11 Data. Cell movement genes RNA-seq values (average and transformed).

https://doi.org/10.1371/journal.pone.0277176.s013

(XLSX)

S13 Data. Actin pathway RNA-seq values (average and transformed).

https://doi.org/10.1371/journal.pone.0277176.s015

(XLSX)

S15 Data. RhoA pathway RNA-seq values (average and transformed).

https://doi.org/10.1371/journal.pone.0277176.s017

(XLSX)

S1 Fig. Shows time point/indices at which homotypic CIL is established in the Gibbs group.

https://doi.org/10.1371/journal.pone.0277176.s018

(PNG)

S2 Fig. Decrease in cell movement as revealed by IPA.

https://doi.org/10.1371/journal.pone.0277176.s019

(PNG)

S3 Fig. Shows 48/86 genes have differential expression consistent with the direction of decrease in cell movement.

https://doi.org/10.1371/journal.pone.0277176.s020

(PNG)

S4 Fig. Cell movement as the top GO process as revealed by Toppgene, an open source pathway analysis tool.

https://doi.org/10.1371/journal.pone.0277176.s021

(PNG)

S6 Fig. Altered actin cytoskeleton pathway.

https://doi.org/10.1371/journal.pone.0277176.s023

(PNG)

S7 Fig. 86 cell movement gene signature heatmap.

https://doi.org/10.1371/journal.pone.0277176.s024

(TIFF)

References

  1. 1. Karthauser H.M. Abercrombie M. Heaysman J.E.M. “Social behaviour of cells in tissue culture. III. Mutual influence of sarcoma cells and fibroblasts”. In: Experimental Cell Research 13(2) (1957), pp. 276–291. pmid:13480295
  2. 2. Mayor R. Roycroft A. “Molecular basis of contact inhibition of locomotion.” In: Cell Mol Life Sci. 73(6) (2015), pp. 1119–1130. pmid:26585026
  3. 3. Abercrombie M., Heaysman J.E.M., Karthauser H.M. “Social behaviour of cells in tissue culture. III. Mutual influence of sarcoma cells and fibroblasts.” In: Exp. Cell Res. 13(2) (1957), pp. 276–291. pmid:13480295
  4. 4. Abercrombie. M. “Contact inhibition and malignancy”. In: Nature 281(5729) (1979), pp. 259–262. pmid:551275
  5. 5. Parish Roger W., Schmidhauser Christian, Schmidt Thomas, et al. “Mechanisms of tumour cell metastasis”. In: J Cell Sci Suppl. 8 (1987), pp. 181–197. pmid:3332660
  6. 6. Abercrombie M., Ambrose E. J. “The Surface Properties of Cancer Cells: A Review”. In: Cancer Research 22(5) (1962), pp. 525–548. pmid:13858936
  7. 7. Mayor R, Carmona-Fontaine C. “Keeping in touch with contact inhibition of locomotion”. In: Trends Cell Biol. 20(6) (2010), pp. 319–328. pmid:20399659
  8. 8. Zimmermann Juliane and Camley Brian A. and Rappel , Wouter-Jan and Herbert Levine. “Contact inhibition of locomotion determines cell-cell and cell-substrate forces in tissues”. In: Proceedings of the National Academy of Sciences 113(10) (2016), pp. 2660–2665. pmid:26903658
  9. 9. Vineis Paolo, Schatzkin Arthur and Potter John D. “Models of carcinogenesis: an overview.” In: Carcinogenesis 31(10) (2010), pp. 1703– 1709. pmid:20430846
  10. 10. Brenner David J., Shuryak Igor and Sachs Rainer K. Radiotherapy-Induced Carcinogenesis and Leukemogenesis: Mechanisms and Quantitative Modeling. Springer Berlin Heidelberg, 2014, pp. 205–226.
  11. 11. Novozhilov Artem S., Karev Georgy P., and Koonin Eugene V. “Biological applications of the theory of birth-and-death processes”. In: Briefings in Bioinformatics 7(1) (2006), pp. 70–85. pmid:16761366
  12. 12. Ohtsuki Hisashi and Innan Hideki. “Allele Frequency Spectrum in a Cancer Cell Population”. In: bioRxiv (2017).
  13. 13. Rubak A E. Baddeley and R.Turner. Spatial Point Patterns: Methodology and Applications with R. London: Chapman and Hall/CRC Press, 2015. ISBN: 9781482210200.
  14. 14. Preston C.J. “Spatial birth-and-death processes”. In: Bulletin of the International Statistical Institute 46 (1977), pp. 371–391.
  15. 15. Ripley B. D. “Modelling Spatial Patterns”. In: Journal of the Royal Statistical Society. Series B 39(2) (1977), pp. 172–212.
  16. 16. Leininger T. J. “Bayesian analysis of spatial point patterns”. In: PhD dissertation (2014).
  17. 17. Tuck AB, Hota C, Wilson SM, Chambers AF. “Osteopontin-induced migration of human mammary epithelial cells involves activation of EGF receptor and multiple signal transduction pathways.” In: Oncogene. 22(8) (2003), pp. 1198–205. pmid:12606946
  18. 18. van den Berg YW, van den Hengel LG, Myers HR, Ayachi O, Jordanova E, Ruf W, et al. “Alternatively, spliced tissue factor induces angiogenesis through integrin ligation.” In: Proc Natl Acad. Sci USA. 106(46) (2009), pp. 19497–502. pmid:19875693
  19. 19. Fife CM, McCarroll JA, Kavallaris M. “Movers and shakers: cell cytoskeleton in cancer metastasis”. In: Br. J. Pharmacol. 171(24) (2014), pp. 5507–5523. pmid:24665826
  20. 20. Wheeler Ann P. and Ridley Anne J. “Why three Rho proteins? RhoA, RhoB, RhoC, and cell motility”. In: Experimental Cell Research 301.1 (2004), pp. 43–49. pmid:15501444
  21. 21. Paddock SW, Dunn GA. Analysing collisions between fibroblasts and fibrosarcoma cells: fibrosarcoma cells show an active invasionary response. J Cell Sci. 1986;81:163–187. pmid:3733895
  22. 22. Kadir S, et al. Microtubule remodelling is required for the front-rear polarity switch during contact inhibition of locomotion. J Cell Sci. 2011;124(Pt 15):2642–2653. pmid:21750190
  23. 23. Davis JR, et al. Inter-cellular forces orchestrate contact inhibition of locomotion. Cell. 2015;161(2):361–373. pmid:25799385
  24. 24. Roycroft A, Mayor R. Forcing contact inhibition of locomotion. Trends Cell Biol. 2015;25(7):373–375. pmid:25981318
  25. 25. Abercrombie M, Ambrose EJ. Interference microscope studies of cell contacts in tissue culture. Exp Cell Res. 1958;15(2):332–345. pmid:13597891
  26. 26. Takeichi M. The cadherins: cell-cell adhesion molecules controlling animal morphogenesis. Development. 1988;102(4):639–655. pmid:3048970
  27. 27. Kullander K, Klein R. Mechanisms and functions of Eph and ephrin signalling. Nat Rev Mol Cell Biol. 2002;3(7):475–486. pmid:12094214
  28. 28. Hall A, Nobes CD. Rho GTPases: molecular switches that control the organization and dynamics of the actin cytoskeleton. Philos Trans R Soc Lond B Biol Sci. 2000;355(1399):965–970. pmid:11128990
  29. 29. Chrzanowska-Wodnicka M, Burridge K. Rho-stimulated contractility drives the formation of stress fibers and focal adhesions. J Cell Biol. 1996;133(6):1403–1415. pmid:8682874
  30. 30. Ridley AJ, et al. The small GTP-binding protein rac regulates growth factor-induced membrane ruffling. Cell. 1992;70(3):401–410. pmid:1643658
  31. 31. Carmona-Fontaine C, et al. Contact inhibition of locomotion in vivo controls neural crest directional migration. Nature. 2008;456(7224):957–961. pmid:19078960
  32. 32. Theveneau E, et al. Chase-and-run between adjacent cell populations promotes directional collective migration. Nat Cell Biol. 2013;15(7):763–772. pmid:23770678
  33. 33. Matthews HK, et al. Directional migration of neural crest cells in vivo is regulated by Syndecan-4/Rac1 and non-canonical Wnt signaling/RhoA. Development. 2008;135(10):1771–1780. pmid:18403410
  34. 34. Shoval I, Kalcheim C. Antagonistic activities of Rho and Rac GTPases underlie the transition from neural crest delamination to migration. Dev Dyn. 2012;241(7):1155–1168. pmid:22553120
  35. 35. Theveneau E, et al. Collective chemotaxis requires contact-dependent cell polarity. Dev Cell. 2010;19(1):39–53. pmid:20643349
  36. 36. Scarpa E. Cadherin switch during EMT in neural crest cells leads to contact inhibition of locomotion via repolarization of forces. Dev Cell. 2015;34(4):421–434. pmid:26235046
  37. 37. Gardel ML, et al. Traction stress in focal adhesions correlates biphasically with actin retrograde flow speed. J Cell Biol. 2008;183(6):999 1005. pmid:19075110
  38. 38. Stramer B, Mayor R. Mechanisms and in vivo functions of contact inhibition of locomotion. Nat Rev Mol Cell Biol. 2017 Jan;18(1):43–55. pmid:27677859
  39. 39. Scarpa E, Szabó A, Bibonne A, Theveneau E, Parsons M, Mayor R. Cadherin Switch during EMT in Neural Crest Cells Leads to Contact Inhibition of Locomotion via Repolarization of Forces. Dev Cell. 2015 Aug 24;34(4):421–34. pmid:26235046
  40. 40. Bracke ME, Depypere H, Labit C, Van Marck V, Vennekens K, Vermeulen SJ, et al. Functional downregulation of the E-cadherin/catenin complex leads to loss of contact inhibition of motility and of mitochondrial activity, but not of growth in confluent epithelial cell cultures. Eur J Cell Biol. 1997 Dec;74(4):342–9. pmid:9438130
  41. 41. Anear E, Parish RW. The effects of modifying RhoA and Rac1 activities on heterotypic contact inhibition of locomotion. FEBS Lett. 2012 May 7;586(9):1330–5. pmid:22498500
  42. 42. Theveneau E, Mayor R. Integrating chemotaxis and contact-inhibition during collective cell migration: Small GTPases at work. Small GTPases. 2010 Sep;1(2):113–117. pmid:21686264
  43. 43. Solanas G, Cortina C, Sevillano M, Batlle E. Cleavage of E-cadherin by ADAM10 mediates epithelial cell sorting downstream of EphB signalling. Nat Cell Biol. 2011 Jul 31;13(9):1100–7. pmid:21804545
  44. 44. Rhee J, Mahfooz NS, Arregui C, Lilien J, Balsamo J, Van Berkum MF. Activation of the repulsive receptor Roundabout inhibits N-cadherin-mediated cell adhesion. Nat Cell Biol. 2002 Oct;4(10):798–805. pmid:12360290
  45. 45. King Benjamin A. et al. “Spatiotemporal Patterns of Tumor Occurrence in Children with Intraocular Retinoblastoma.” In: PLoS ONE, 10(7) (2015). pmid:26230335
  46. 46. Jing H., Yang Y., and Nishikawa R. M. “Detection of clustered microcalcifications using spatial point process modeling.” In: Physics in Medicine and Biology 56(1) (2011), pp. 1–17. pmid:21119233
  47. 47. Saribudak A., Kucharavy H., Hubbard K., and Uyar M. U. “Spatial Heterogeneity Analysis in Evaluation of Cell Viability and Apoptosis for Colorectal Cancer Cells.” In: IEEE Journal of Translational Engineering in Health and Medicine 4 (2016), p. 4300209. pmid:27574578
  48. 48. Jones-Todd Charlotte M., Caie Peter, Illian Janine, Stevenson Ben C., Savage Anne, Harrison David J., Bown James L. “Unusual structures inherent in point pattern data predict colon cancer patient survival”. In: arXiv:1705.05938v1 [stat.ME] (2017).
  49. 49. Baddeley Adrian and Turner Rolf. “Spatstat: an R package for analyzing spatial point patterns.” In: Journal of Statistical Software 12.6 (2005). ISSN 1548-7660, pp. 1–42. URL: www.jstatsoft.org.
  50. 50. Zhou M. et al. “Radiomics in Brain Tumor: Image Assessment, Quantitative Feature Descriptors, and Machine-Learning Approaches”. In: American Journal of Neuroradiology (2017). ISSN: 0195-6108. pmid:28982791
  51. 51. Bankhead Peter, Loughrey Maurice B., Fernández José A., Dombrowski Yvonne, McArt Darragh G., Dunne , et al. Coleman, Jacqueline A. James, Manuel Salto-Tellez and Peter W. Hamilton. “QuPath: Open source software for digital pathology image analysis.” In: Sci. Rep. 7(16878) (2017).
  52. 52. Thompson C.M. and Shure L. Image Processing Toolbox: For Use with MATLAB;[user’s Guide]. MathWorks, 1995.
  53. 53. Charrad Malika et al. “NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set”. In: Journal of Statistical Software 61.6 (2014), pp. 1–36.
  54. 54. Colaprico A, Silva TC, Olsen C, Garofano L, Cava C, Garolini D, et al. “TCGAbiolinks: An R/Bioconductor package for integrative analysis of TCGA data.” In: Nucleic Acids Research (2015).
  55. 55. McCarthy J. D, Chen , Yunshun , Smyth K. G. “Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation.” In: Nucleic Acids Research 40(10) (2012), pp. 4288–4297. pmid:22287627
  56. 56. Krämer A, Green J, Pollard J Jr, Tugendreich S. “Causal analysis approaches in Ingenuity Pathway Analysis.” In: Bioinformatics. 30(4) (2013), pp. 523–530. pmid:24336805
  57. 57. Chen Jing, Bardes Eric E., Aronow Bruce J., Jegga Anil G. “ToppGene Suite for gene list enrichment analysis and candidate gene prioritization”. In: Nucleic Acids Research 37(2) (2009), W305–W311. pmid:19465376
  58. 58. Howe Eleanor et al. “MeV: MultiExperiment viewer”. In: Mar. 2010, pp. 267–277.