Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

An integrated method for the identification of novel genes related to oral cancer

  • Lei Chen ,

    Contributed equally to this work with: Lei Chen, Jing Yang

    Affiliations School of Life Sciences, Shanghai University, Shanghai, People’s Republic of China, College of Information Engineering, Shanghai Maritime University, Shanghai, People’s Republic of China

  • Jing Yang ,

    Contributed equally to this work with: Lei Chen, Jing Yang

    Affiliation School of Life Sciences, Shanghai University, Shanghai, People’s Republic of China

  • Zhihao Xing,

    Affiliation Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People’s Republic of China

  • Fei Yuan,

    Affiliation Department of Science & Technology, Binzhou Medical University Hospital, Binzhou, Shandong, People’s Republic of China

  • Yang Shu,

    Affiliation Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People’s Republic of China

  • YunHua Zhang,

    Affiliation School of Resources and Environment, Anhui Agricultural University, Hefei, Anhui, People’s Republic of China

  • XiangYin Kong,

    Affiliation Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People’s Republic of China

  • Tao Huang ,

    tohuangtao@126.com (TH); lihaipeng@picb.ac.cn (HPL); cai_yud@126.com (YDC)

    Affiliation Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People’s Republic of China

  • HaiPeng Li ,

    tohuangtao@126.com (TH); lihaipeng@picb.ac.cn (HPL); cai_yud@126.com (YDC)

    Affiliation CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People’s Republic of China

  • Yu-Dong Cai

    tohuangtao@126.com (TH); lihaipeng@picb.ac.cn (HPL); cai_yud@126.com (YDC)

    Affiliation School of Life Sciences, Shanghai University, Shanghai, People’s Republic of China

Abstract

Cancer is a significant public health problem worldwide. Complete identification of genes related to one type of cancer facilitates earlier diagnosis and effective treatments. In this study, two widely used algorithms, the random walk with restart algorithm and the shortest path algorithm, were adopted to construct two parameterized computational methods, namely, an RWR-based method and an SP-based method; based on these methods, an integrated method was constructed for identifying novel disease genes. To validate the utility of the integrated method, data for oral cancer were used, on which the RWR-based and SP-based methods were trained, thereby building two optimal methods. The integrated method combining these optimal methods was further adopted to identify the novel genes of oral cancer. As a result, 85 novel genes were inferred, among which eleven genes (e.g., MYD88, FGFR2, NF-κBIA) were identified by both the RWR-based and SP-based methods, 70 genes (e.g., BMP4, IFNG, KITLG) were discovered only by the RWR-based method and four genes (L1R1, MCM6, NOG and CXCR3) were predicted only by the SP-based method. Extensive analyses indicate that several novel genes have strong associations with cancers, indicating the effectiveness of the integrated method for identifying disease genes.

Introduction

Cancer is a significant public health problem worldwide. Oral cancer (OC) is a subgroup of head and neck cancer; it develops on the lips, tongue, salivary glands, gingiva, oropharynx, and on buccal surfaces [1]. Oral squamous cell carcinoma (OSCC) accounts for more than 90% of all OC [2]. OC is estimated by the WHO (World Health Organization) to be the eleventh most common cancer in the world, accounting for 300,000 new cases and 145,000 deaths in 2012 [3, 4]. The incidence of OC exhibits significant local variation and continues to be high in India, East Asia, Eastern Europe, and parts of South America [5]. Tobacco and alcohol are the most important risk factors for OC [6]. Poor nutrition, genetic factors and viral infection may be potential risk factors for OC [7, 8].

Carcinogenesis is a multi-step process, and a variety of alterations accumulate, driving and gradually increasing tumorigenesis [9]. During the past decade, a large variety of genomic variations has been implicated in OC. EGFR (epidermal growth factor receptor) amplification and over-expression were found in a large proportion of oral tumors [1012]. ErbB2 amplification and overexpression appear to occur frequently in OC specimens, and high levels of ErbB2 may be related to a worse prognosis of patients [12]. A strong correlation has been detected between c-erbB-2 overexpression and overall survival of patients with oral squamous cell carcinoma [13, 14]. The cyclin family plays a critical role in cell cycle progression. Aberrant up-expression of cyclin D accounts for 36–66% of OC. Amplification and an SNP of cyclin D may be associated with a worse prognosis and susceptibility to OC, respectively [15, 16]. Several lines of evidence suggest that the p53 tumor suppression network is altered in OC [17, 18]. In addition, many other target genes have been reported, such as ras, VEGF (vascular endothelial growth factor), and the MMP (matrix metalloproteinases) family [1921]. To the best of our knowledge, the mechanism that underlies OC is still unclear. A search for new genes related to OC may facilitate earlier diagnosis and effective treatment.

In general, many standard methods have been used for detection of virulence genes. In the hospital and laboratory, cancer samples are sliced into pathological sections and stained to determine the disease pathology type. Quantitative reverse transcription-PCR is typically used to detect the mRNA level of genes in cancer. IHC (immunohistochemical) and western blotting measure the expression of related proteins in tissues and cells, respectively. However, it is difficult to analyze genes synthetically and comprehensively. In addition, some large-scale experiments have been exploited to screen virulence genes, such as microarray, GWAS (genome-wide association study) and NGS (next-generation sequencing). The most common techniques are time-consuming and costly, and thus new methods must be explored to identify tumor genes. In recent years, with the development of computer techniques [2228], some of them can be applied to tackle this problem. Up to now, several computational methods have been proposed to identify disease genes. Many of them are based on guilt-by-association [29], i.e., the assumption that genes are similar to their neighbors in a gene network. Thus, the neighbors of the disease genes are more likely to be disease genes. However, these types of methods are local methods that use only part of the network. Thus, they do not always yield good performance. Many other methods employ the Random Walk with Restart (RWR) algorithm to identify disease genes [3032]. This algorithm simulates a walker starting from a seed node or a set of seed nodes that represent disease genes and random walking on the network. The probability of a node being a disease gene is updated until the probabilities of all nodes become stable. The genes corresponding to nodes with high probabilities are selected as novel candidate disease genes. Recently, computational methods have adopted the shortest path (SP) algorithm to address the problem [3339]. This algorithm assumes that genes lying in the shortest paths connecting any two disease genes may also be disease genes. Clearly, methods based on RWR or SP algorithms take full advantage of the network compared with those based on guilt-by-association. Thus, they can yield clues for the discovery of novel disease genes.

In this study, we used the RWR and SP algorithms to construct two novel computational methods, the RWR-based method and the SP-based method, respectively, for the identification of novel disease genes. Additionally, an integrated method was constructed by combining these two methods. To indicate the effectiveness of the integrated method, disease genes of OC were employed. It has been reported in some studies that using only the RWR algorithm and SP algorithm consistently produces several false discoveries [33, 40], which may be caused by the structure of the network or lack of consideration of the essential properties of genes. Thus, further rules, produced by the permutation test, associations between candidates and the validation of genes according to their properties, were integrated into the RWR-based and SP-based methods. These methods were executed using the corresponding large network that was built using protein-protein interaction (PPI) information and that was trained on validated OC genes to determine the optimal parameters. The obtained optimal methods were used to infer novel genes related to OC, and the integrated method combined the predicted genes using these two optimal methods to yield 85 novel genes. Among them, eleven genes were obtained by both methods, 70 genes were obtained only by the RWR-based method and four genes were obtained only by the SP-based method. According to the analyses, several genes show stimulative or suppressive effects on cancers by experiments or have a certain relationship with cancers reported in published papers, indicating the utility of the integrated method. It is also clear that the integrated method can provide more comprehensive analysis of various diseases because it is capable of producing more possible disease genes than the RWR-based method or the SP-based method.

Materials and methods

Genes related to oral cancer

The OC-related genes were collected from the following three sources: (1) 44 genes were retrieved from UniProt (http://www.uniprot.org/, accessed in July, 2015) [41] after ‘human oral cancer reviewed’ was input as a keyword; (2) seven genes were chosen from the catalogue of oral cancer from the TSGene (https://bioinfo.uth.edu/TSGene/, accessed in July, 2015) database; and (3) 156 genes were retrieved from the NCI (National Cancer Institute, https://gforge.nci.nih.gov, accessed in July, 2015) database using ‘Homo sapiens’ as a keyword. After combining the OC-related genes mentioned above, 202 OC-related genes were finally obtained; these genes are provided in S1 Table. Because our methods are based on the network constructed from the PPI information retrieved from Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) [42], these 202 OC-related genes were mapped into their Ensembl IDs, and those not occurring in the network were discarded. One hundred eighty-four Ensembl IDs of OC-related genes were accessed and composed the set Soc.

Protein-protein interactions

PPIs play important roles in several intracellular and intercellular biochemical processes. The known PPI information has been widely used to investigate several protein-related problems, such as protein function prediction [43, 44] and disease gene identification [33, 34, 36, 39]. Based on the results reported in these previous studies, it can be concluded that two proteins that can interact with each other always have a functional relationship. In this study, we attempted to discover novel candidate OC-related genes using validated OC-related genes. Thus, we can utilize PPIs to search for proteins that have a functional relationship with proteins encoded by validated OC-related genes, thereby mining novel OC-related genes.

The PPI information was retrieved from STRING (http://string-db.org/, version 9.1) [42], a well-known online public database that contains known and predicted protein interactions. The interactions reported in STRING are derived from the following four types of sources: (I) Genomic Context; (II) High-throughput Experiments; (III) (Conserved) Coexpression; (IV) Previous Knowledge, which include direct (physical) and indirect (functional) associations between proteins and thus offer more chances to mine hidden protein information. From the database, a file called ‘protein.links.v9.1.txt.gz‘ was retrieved, which contained 2,425,314 human PPIs. Each PPI was represented by two Ensembl IDs and a score ranging between 150 and 999, which indicates the strength of the interaction; i.e., proteins in interactions with higher scores are regarded as more likely to interact with each other. Let us denote the score of a PPI between proteins p1 and p2 by S(p1, p2).

RWR-based method

RWR is a ranking algorithm [30]. In a network, this algorithm simulates a walker starting from a seed node or a set of seed nodes and random walking on the network. In each step, the probability of the walker walking to each node is updated, and it stabilizes after several steps. Nodes in the network are ranked by the final probabilities assigned to them. Based on the validated disease genes (used as seed nodes), the RWR algorithm can be used to discover novel disease genes by investigating novel genes with high ranks. This algorithm has therefore been used to mine novel disease genes in recent years [3032]. However, the RWR algorithm consistently provides false discoveries [40], supporting the need for additional screening rules. In this study, we used the RWR algorithm as the basic algorithm and added some rules to build the present RWR-based learning method for the identification of novel OC-related genes. Furthermore, to obtain a better RWR-based method, some parameters were employed that are determined by training the method.

Network construction for RWR algorithm.

An accurate network is important for the identification of novel OC-related genes. Here, we adopted the PPI information mentioned in the Section “Protein-protein interactions” to build the network. The constructed network defined the 20,770 proteins occurring in the 2,425,314 human PPIs as nodes, and two nodes were adjacent if and only if the corresponding proteins could interact with each other. It can be observed that each edge represented a PPI. To employ the interaction score in the network, each edge was assigned a weight that was defined as the score of the corresponding PPI. For convenience, let us denote the constructed network by NRWR.

Searching for new candidate genes using the RWR algorithm.

As mentioned in the Section “Genes related to oral cancer”, a set Soc, consisting of 184 Ensembl IDs of OC-related genes was used in this study. The RWR algorithm simulated a random walker starting from Soc. Before executing the RWR algorithm on NRWR, each node in NRWR was assigned a probability, with 1/184 set for the Ensembl IDs in Soc and 0 set for the remaining nodes. The initial probability of each node constituted the probability vector P0. The RWR algorithm updated the probability vector in each step. Let Pt denote the probability vector after performing the t-th step, which can be updated by (1) where r is set to 0.8 and A is the column-wise normalized adjacency matrix of NRWR. The update procedure was repeated until the change between Pt and Pt+1, measured by L1 norm, was less than 1e-6. Ensembl IDs with high probabilities were considered to be encoded by OC-related genes. Because we did not know which probability was a suitable threshold for selecting candidate genes, we established a parameter, , for the threshold, which can be determined by training the RWR-based method.

SP-based method

The SP algorithm is a classic graph algorithm. In recent years, some investigators have applied the SP algorithm for the identification of disease genes [3339]. The new candidate genes were extracted from the shortest paths connecting any two validated disease genes. Here, we built an SP-based machine learning method to identify novel OC-related genes. Like the RWR-based method, further screening rules were established to discard non-essential candidate genes and select important ones.

Network construction for SP algorithm.

Similar to NRWR for the RWR-based method, we also constructed the network NSP. As mentioned in the Section “Network construction for RWR-based method”, each edge in NRWR represented a PPI. In addition, each PPI had an interaction score as described in the Section “Protein-protein interactions”. This score was used to define the weight of the edge. The range of the interaction score was between 150 and 999, and an edge with a low weight indicated strong correlations between its endpoints in the SP-based model. Thus, for an edge e with endpoints n1 and n2, weight was defined in the following manner: (2) where p1 and p2 were two corresponding proteins of nodes n1 and n2. The constructed network NSP consisted of 20,770 nodes and 2,425,314 edges.

Searching for new candidate genes using the SP algorithm.

The SP algorithm, Dijkstra’s algorithm [45], was applied to the network NSP to search for all of the shortest paths connecting any two Ensembl IDs in Soc. From the obtained shortest paths, we extracted the inner nodes that did not represent validated OC-related genes. Genes corresponding to the extracted nodes were considered to be related to OC and denoted candidate genes. In addition, each candidate gene was assigned a measurement, called betweenness, which was defined as the number of shortest paths within which it was contained.

Screening rules

The RWR and SP algorithms can produce a number of candidate genes for OC when the probability threshold is provided for the RWR algorithm. However, several false discoveries are inevitable, as mentioned in previous studies [33, 40]. To screen out these false discoveries, a series of screening rules were added to this section and integrated into the RWR-based and SP-based methods. Some parameters were employed to build the rules. Their optimal values in RWR-based and SP-based methods are determined by training these two methods, respectively.

The probability that a candidate gene produced by the RWR algorithm was influenced by the structure of the network NRWR and the betweenness for a candidate gene produced by the SP algorithm was also clearly influenced by the structure of the NSP network. Some candidate genes receiving high probabilities (betweenness) were not specific to OC. Thus, a permutation test was designed that first randomly constructed 1,000 sets of Ensembl IDs such that each set had the same size of Soc, denoted by S1, S2,⋯, S1000. For each set, the RWR algorithm and SP algorithm were executed on NRWR and NSP, respectively, by setting the Ensembl IDs in the set as the input, yielding a probability for each candidate gene produced by the RWR algorithm and a betweenness for each candidate gene produced by the SP algorithm. Finally, for each candidate gene, there was one probability (betweenness) for Soc and 1,000 probabilities (betweenness) for S1, S2,⋯,S1000. If the candidate gene was specific to OC, its probability (betweenness) for Soc should clearly be larger than most probabilities (betweenness) for S1, S2,⋯,S1000. Thus, we calculated the p-value for each candidate gene g, which was defined by (3) where δi = 1 if the probability (betweenness) for Si was larger than that of Soc; δi = 0 otherwise. Because 0.05 was always used as an important cut-off for the significance level of the test, it was set to be the threshold of the p-value; i.e., candidate genes with p-values less than 0.05 were selected.

The network-based method is useful for complicated problems. However, this type of method seldom considers the essential properties of nodes in the network, leading to several false discoveries. Rules utilizing the essential properties of candidate and validated genes are needed to exclude false discoveries and select important ones.

As mentioned in the Section “Protein-protein interactions”, two proteins that can interact with each other always have a functional relationship. Furthermore, considering the interaction scores, proteins in an interaction with a high score are more likely to have a strong functional relationship than those in an interaction with a low score. Thus, the interaction score can be used to measure the associations between candidate genes and OC-related genes. For each candidate gene g, we calculated the maximum interaction score (MIS), which was defined by (4)

Clearly, a candidate gene with a high MIS is more likely to be a novel OC-related gene and thus should be selected. However, the threshold of MIS was not easy to determine. Thus, it was set as a parameter, pMIS. In the RWR-based method, the optimal value of pMIS is determined by training this method. In addition, the optimal value of pMIS in the SP-based method is also accessed by training the SP-based method.

The MIS measures the associations between candidate genes and OC-related genes. The following measurement evaluates the associations between them in another way. It is known that OC-related genes must be highly related to some gene ontology (GO) terms or biological pathways. Thus, we can use the annotated GO term and KEGG pathway information for candidate genes and OC-related genes to evaluate their correlations. To achieve this goal, each candidate gene or OC-related gene was encoded by its GO enrichment scores and KEGG enrichment scores. For a candidate gene g and an OC-related gene g′, their associations can be measured by (5) where ES(g) (ES(g′), respectively) is a vector consisting of the GO enrichment scores and KEGG enrichment scores of g (g′, respectively). A high value for Eq 5 indicated a strong association. Similar to the definition of MIS, we calculated the maximum function score (MFS) of each candidate gene g by (6)

Similarly, a candidate gene with a high MFS might be a novel OC-related gene with a high probability and should be selected. Additionally, it was difficult to determine the threshold of MFS. The parameter pMFS was also established for this threshold. Similar to parameter pMIS mentioned above, the optimal value of pMFS in the RWR-based and SP-based methods is determined by training these two methods, respectively.

Integrated method

The RWR-based and SP-based methods, together with the screen rules mentioned in the Section “Screening rules”, were constructed. The pseudo-codes of these two methods are listed in Tables 1 and 2, respectively. The integrated method encompassed these two methods by combining their results.

Evaluation methods

The RWR-based and SP-based methods were applied to identify novel OC-related genes. However, there are some parameters in these two methods that must be determined before they are used to identify novel OC-related genes. Thus, these two methods were trained on the validated OC-related gene set Soc, through which the optimal parameters can be determined. We used the jackknife test [46, 47], which is one of the classic cross-validation methods [48, 49], to evaluate the performance of these two methods, i.e., each OC-related gene in Soc was singled out sequentially, and the remaining genes in Soc were used to generate predictions under various combinations of parameters. When training the methods, we supposed that all genes in the network other than the validated OC-related genes were negative; i.e., they were not OC-related genes. The performance of the method can be measured according to the following two features: (1) whether the selected OC-related gene can be recovered by executing the method on the remaining OC-related genes; (2) the predicted genes other than the selected OC-related gene should be as low as possible. Thus, we considered the following two measurements: precision and recall, which are always used to evaluate the performance of the methods on a binary classification problem in the fields of pattern recognition and information retrieval. Recall is defined as the proportion of retrieved OC-related genes among all OC-related genes, and precision represents the proportion of retrieved OC-related genes among all retrieved genes. Furthermore, another measurement, the F1-measure, is often used to evaluate overall performance, and it can be calculated by (7)

Eq 7 shows that recall and precision have the same role. However, in this study, we considered recall to be more important than precision because the method with low recall and high precision could not reliably produce the predicted results. Thus, we simply revised Eq 7 and defined a new measurement, namely, F1-measure-R, which can be computed by (8)

Because the retrieved genes differ when the selected OC-related gene was not the same, the recall, precision and F1-measure-R must be considered each time for the predicted results. Thus, under a combination of parameters, the RWR-based and SP-based methods can produce a series of recall, precision and F1-measure-R values. We calculated the average values to indicate the performance of the method under this combination of parameters. For convenience, the recall, precision and F1-measure-R presented in the rest of this paper represent the average values.

Results

Optimized parameters for the RWR-based and SP-based methods

As mentioned in the Section “RWR-based method”, “SP-based method” and “Screening rule”, some parameters should be optimized for the RWR-based method and the SP-base method. To extract an optimal combination of parameters for each method, the RWR-based method and the SP-base method were trained on Soc, and their performance was evaluated by the Jackknife test.

For the RWR-based method, three parameters, the threshold of probability , the threshold of MIS pMIS, and the threshold of MFS pMFS, should be optimized. For , we tried various values ranging from 1E-05 to 1E-04; for pMIS, we tried three values, 400, 700, 900, which are reported in STRING for thresholds of medium confidence, high confidence and highest confidence, respectively; for pMFS, we tried various values ranging from 0 to 0.9. The measurements mentioned in the Section “Evaluation methods” for the RWR-based method with different combinations of parameters are listed in S2 Table. For ease of observation, the values of F1-measure-R obtained using the RWR-based method with different parameters are illustrated in Fig 1; we can see the same level of performance when the parameters and pMFS are equivalent. The maximum F1-measure-R was 1.677E-03 when the parameters were set to be , pMFS = 0.8 and pMIS = 400 or 700. Because this combination of parameters yielded the best performance, they were used to build the optimal RWR-based method, which is adopted to identify novel OC-related genes.

thumbnail
Fig 1. The performance of the RWR-based method under different combinations of parameters.

(a) The performance of the RWR-based method setting pMIS = 400. (b) The performance of the RWR-based method setting pMIS = 700. (c) The performance of the RWR-based method setting pMIS = 900.

https://doi.org/10.1371/journal.pone.0175185.g001

The SP-based method was also trained to extract the optimal parameters for pMIS and pMFS. We tested three values for pMIS, as mentioned in the above paragraph, and various values ranging from 0 to 0.9 for pMFS. For the results obtained using the SP-based method with different combinations of parameters, the measurements mentioned in the Section “Evaluation methods” were counted and are provided in S3 Table. Additionally, three curves are plotted in Fig 2 to show the values of F1-measure-R obtained using the SP-based method with different values of pMFS and a fixed value of pMIS. When the values of pMFS were small, the values of F1-measure-R were proportional to the value of pMIS. However, the values of F1-measure-R were almost the same when pMFS were large. The maximum F1-measure-R was 2.693E-04 when the parameters were set to be pMFS = 0.8 and pMIS = 400 or 700 or 900. Similarly, we used these values to build the optimal SP-based method for the identification of novel OC-related genes.

thumbnail
Fig 2. The performance of the SP-based method under different combinations of parameters.

There are three lines in this figure, which represent the performance of the SP-based method with different thresholds of maximum interaction score. In detail, the full line represents the performance of the SP-based method with the threshold of maximum interaction score 900, the dot line represents the performance of the SP-based method with the threshold of maximum interaction score 700, the dash line represents the performance of the SP-based method with the threshold of maximum interaction score 400.

https://doi.org/10.1371/journal.pone.0175185.g002

Inferred results of the RWR-based method

As mentioned in the Section “Optimized parameters for the RWR-based and SP-based methods”, the optimal RWR-based method was built using , pMFS = 0.8 and pMIS = 400 or 700. This method was further used to identify novel OC-related genes based on all validated OC-related genes mentioned in the Section “Genes related to oral cancer”. Because the optimal RWR-based method contains two options for the parameter pMIS, this method was executed twice. First, pMIS was set to 400, while in the second iteration, it was set to 700. The genes identified using this method with a different parameter, pMIS, are listed in S4 Table. A careful review of the results showed that the identified genes produced using these two optimal RWR-based methods were equivalent. They all identified 81 novel genes. Because all of these genes were viewed as high probability by the RWR method and had strong associations with validated OC-related genes, we believe that they are highly related to OC, and they are termed putative OC-related genes.

Inferred results of the SP-based method

The optimal SP-based method was built in the Section “Optimized parameters for the RWR-based and SP-based methods”, in which pMFS was set to 0.8 and pMIS was set to 400, 700 or 900. This method was also adopted to identify novel OC-related genes. By setting pMIS to 400, 700 or 900, we can build three optimal SP-based methods. They were executed sequentially to identify novel OC-related genes. Similarly, they all yielded fifteen identical genes, which are listed in Table 3. It is believed that these genes are closely related to OC, and they are termed putative OC-related genes.

thumbnail
Table 3. Genes identified by the optimal SP-based method.

https://doi.org/10.1371/journal.pone.0175185.t003

Inferred results of the integrated method

As mentioned in the Sections “Inferred results of the RWR-based method” and “Inferred results of the SP-based method”, the RWR-based method yielded 81 putative genes, and the SP-based method yielded fifteen putative genes. The union of these two putative gene sets provided the results of the integrated method, in which 85 putative genes were obtained. Their distribution is illustrated in Fig 3; we can see that eleven putative genes were identified by both the RWR-based and the SP-based methods. Among the remaining putative genes, 70 were identified using the RWR-based method, and four putative genes were identified using the SP-based method. Because the principles of the RWR-based and SP-based methods are very different, the identified novel OC-related genes were not the same. By considering the identified genes produced by either of them, we can obtain more putative genes and have an opportunity to extensively study disease genes in OC. To indicate the obtained 85 putative genes are highly related to OC, we extracted a sub-network containing these putative genes and OC-related genes from NRWR and NSP as shown in Fig 4.

thumbnail
Fig 3. The distribution of the 85 putative OC-related genes obtained in this study.

The blue part represents the set consisting of 70 putative genes obtained using only the RWR-based method. The red part represents the set consisting of 11 putative genes obtained using both the RWR-based and SP-based methods. The green part represents the set consisting of 4 putative genes obtained using only the SP-based method.

https://doi.org/10.1371/journal.pone.0175185.g003

thumbnail
Fig 4. The sub-network containing the putative genes and OC-related genes that was extracted from the network for RWR-based and SP-based methods.

The blue nodes represent putative genes and red nodes represent OC-related genes.

https://doi.org/10.1371/journal.pone.0175185.g004

Discussion

Using the RWR-based method, 81 genes were obtained. Using the SP-based method, fifteen genes were accessed. All of these genes were deemed to be significantly associated with OC. Furthermore, eleven genes were identified by both methods, which may be more important than the others. In this section, some important putative genes (listed in Tables 46) were extensively analyzed to confirm their associations with OC.

thumbnail
Table 4. Eleven putative genes identified using both RWR-based and SP-based methods.

https://doi.org/10.1371/journal.pone.0175185.t004

thumbnail
Table 5. Important genes among the seventy putative genes identified using the RWR-based method.

https://doi.org/10.1371/journal.pone.0175185.t005

thumbnail
Table 6. Four putative genes identified using the SP-based method.

https://doi.org/10.1371/journal.pone.0175185.t006

Putative genes identified using both RWR-based and SP-based methods

The following eleven proteins were identified using both RWR-based and SP-based methods: CYP3A43, FGF4, NFKBIA, THBS1, IL1B, CXCL13, CYP2A6, SELE, STAT1, MYD88 and FGFR2, as listed in Table 4.

MYD88 (myeloid differentiation primary response 88) encodes a cytosolic adapter protein that functions as a key signal transducer in the interleukin-1 and Toll-like receptor (TLRs) signaling pathways. TLRs and their ligands play a crucial role in inflammation and host defense [50]. In a broad variety of tumor tissues and cell lines, aberrant expression of TLRs plays a role in tumor immune escape or resistance to apoptosis [51, 52]. The expression of TLR4 and MyD88 is also aberrant and affects the down-stream signaling pathway [53]. In this study, MyD88 was predicted using both RWR-based and SP-based methods, and thus it may be a potential target for OC.

FGFR2 (fibroblast growth factor receptor 2) belongs to the FGFR tyrosine kinase family, which is one of the most frequently altered kinase families in some types of cancer [54]. Point mutations have been observed in 12% of endometrial carcinomas [55]. These studies suggest that FGFR2 may act together with the regulation of the PI3K/AKT/mTOR pathway to drive endometrial cancer growth in a subset of patients [56, 57]. FGFR2 mutations also occur in oral squamous cell carcinoma, and a patient with OSCC was found to respond to pazopanib, a multiple tyrosine kinase inhibitor [58]. These findings support the potential diagnostic and therapeutic value of FGFR2 as an effective strategy for the treatment of patients with OC with defined molecular characteristics. In addition, our results showed that FGF4 had a significant probability in both methods. Some large-scale experiments have suggested that aberrant amplification of FGF4 occurs in several types of cancer, including lymph node metastasis and urinary bladder cancer [59, 60].

NF-κBIA (nuclear factor-kappa B inhibitor alpha) inhibits NF-κB, which is involved in inflammatory responses and is a hallmark linking inflammation to tumor development and progression [61, 62]. The polymorphic variations in NFKBIA were associated with the risk of various cancers, including gastric cancer, prostate cancer and melanoma [6365]. NFKBIA polymorphisms have significant associations with OSCC [66]. In our study, NFKBIA had a close relationship with OC using both methods.

SELE (selectin E), which was found in cytokine-stimulated endothelial cells, functions in the accumulation of blood leukocytes at sites of inflammation [67]. The level of SELE-mediated adhesion of colon cancer and head and neck squamous cell cancer cells to the endothelium has been implied to be involved in metastasis [68, 69]. Our analysis revealed that SELE might be a putative marker for tumorigenesis in OC. A study in Taiwan supports the idea that SELE-related inflammation plays a crucial role in the pathogenesis process of OSCC [70]. Future investigations of the function of SELE may clarify the mechanism of tumorigenesis and metastasis.

THBS1 (thrombospondin 1) is a subunit of a disulfide-linked homotrimeric protein, which mediates cell-cell and cell-matrix interactions [71]. The significant relationship between THBS1 and OC was detected using both RWR-based and SP-based methods. Several studies have indicated that THBS1 has crucial functions in oral tumorigenesis [72, 73]. THBS1 might be a potential diagnostic and therapeutic target for OC.

STAT1 (signal transducer and activator of transcription 1), which belongs to the STAT protein family, mediates the expression of a variety of genes and cell viability in response to stimuli and pathogens [74]. Aberrant activation of STAT1 has frequently been found in various cancers, such as head and neck cancer [75]. A similar STAT1 activation status was detected in patients with OSCC [76]. In this study, STAT1 displayed a significant association with OC using both the RWR-based and SP-based methods, and this molecular marker could help in guiding diagnostic and therapeutic decisions in patients with OC.

Among these eleven putative genes, including genes such as MYD88, FGFR2 and THBS1, stimulative or suppressive effects on cancers have been shown by the experiments. We speculate that these three genes have certain functions in OC. In addition, it has been reported that other genes, including NFKBIA, SELE, STAT1, IL1B, CYP2A6, CYP3A43, FGF4 and CXCL13 [7780], have a certain relationship with various forms cancer. However, the mechanism has not been thoroughly studied. In our analysis, they have a significant relationship with OC, and the mechanism should be explored.

Putative genes identified only by the RWR-based method

Seventy proteins were identified only by the RWR-based method. The important genes are BMP4, CCL2, CCL20, CCL5, CCR7, IFNG, KITLG, TNFRSF1A, PIK3CB, PIK3CD, PIK3CG, CYP2B6, CYP2C19, CYP2J2, CYP3A4, CYP4X1 and RAC3; these genes are listed in Table 5. Some of them participate in tumorigenesis by regulating cell growth or apoptosis, such as BMP4 and RAC3. Some genes are important factors in the immune system of cancer patients, such as CCL2, CCL20, CCL5, CCR7, IFNG and P450 family. Detailed function analyses of candidate genes are shown in S1 File.

We identified several novel putative genes using only the RWR-based method, which have strong associations with the tumorigenesis of OC. The functions of some genes have been explored in experiments, such as BMP4, IFNG, KITLG, CCL5, CCR5, CCR7, CYP2J2, PIK3CB, PIK3CG and RAC3; others revealed mutations or aberrant expression in cancers, but the mechanism is not clear. Future research is required to replicate and validate the effects of these genes.

Putative genes identified only by the SP-based method

Four proteins were predicted to be closely associated with OC using the SP-based method but not the RWR-based method. These genes were IL1R1, MCM6, NOG and CXCR3; they are listed in Table 6. It has been reported that CXCR3 can promote metastasis in some types of cancer [8186]. Other genes showed aberrant expression or mutations in cancers, which merits attention. Detailed function analyses of candidate genes are shown in S1 File.

Based on the analyses of the novel genes obtained by the integrated method, we found that several genes have shown stimulative or suppressive effects on cancers by experiments or aberrant expression or mutations in cancers reported in published papers, implying the effectiveness of the integrated method. In addition, the integrated method can provide more potential disease genes for investigating OC because it combines the results yielded by the RWR-based and SP-based methods; i.e., simultaneous usage of the RWR-based and SP-based method can help us mine more information about disease genes in oral cancer. We believe that it is helpful to investigate various diseases using both the RWR-based and the SP-based methods.

GO-term and pathway function enrichment analysis of putative genes using DAVID

By the integrated method, 85 putative genes were obtained. To uncover the biological meaning behind these genes, the functional annotation tool, the Database for Annotation, Visualization and Integrated Discovery (DAVID) [87], was adopted to analyze them. The obtained results are provided in S5 Table.

For the GO enrichment results yielded by DAVID (see S5 Table), we can see that seven-two biological process (BP) GO terms, fifteen molecular function (MF) GO terms and nine cellular component (CC) GO terms are with statistically significance (FDR<0.05). The biological process mostly focused on cell proliferation, apoptosis, transcription and signal transduction. Malignant proliferation and anti-apoptosis are the major characteristics of many cancers. In this study, it has been found many proliferation related genes such as EGF, IGF1 and CDKN1B. EGF (epidermal growth factor) and its receptor EGFR affect cellular processes like proliferation, motility and adhesion. EGF has a higher expression and is correlated with the progression of cancer such as breast cancer [88]. The circulating IGF1 (insulin-like growth factor-1) level is associated with the risk to develop breast cancer [89, 90]. CDKN1B functions as a key cell cycle gatekeeper to prevent or slow down cell division [91, 92]. Aberrant downregulating CDKN1B may promote the proliferation of multiple myeloma cells [93]. Un-controlled expanded number of cancers is determined not only by cell proliferation but also by the cell evading apoptosis. In our study, many genes are enriched in apoptosis process such as TP53, CASP3 and cmyc. The mutations of TP53 are the most common alteration in cancer and are related to cell apoptosis, cell cycle and malignancy [9497]. TP53 could arrest cell cycle and induced cell apoptosis in response to DNA damage [98]. It has been reported that over-expression of c-Myc drives the level of BAX and other apoptosis-related genes and is involved in cell apoptosis [99103]. CASPs is a kind of cysteine-dependent aspartate-specific proteases, and is associated with the initiation and execution of apoptosis. As an activate effector, CASP3 receives the apoptotic signals to perform the cell death process [104]. In our result, a series of genes were enriched in the GO term related apoptosis, which proved process of apoptosis is a key process in oral cancer. In addition, several other biological process GO terms were shown such as transcription and signal transduction. In proliferation, apoptosis and other processes, functional genes were selectively transcribed and expressed. For example, c-Myc is a transcription factor which could mediate a series of down-stream genes express to drive cellular proliferation and apoptosis [105]. It can be seen from S5 Table that putative genes were enriched in several CC GO terms and MF GO terms such as growth factor activity (MF), cytokine activity (MF), enzyme or protein binding (MF), extracellular region (CC), extracellular space (CC) and plasma membrane (CC). These results suggest that the functional activity and localization of the protein are directly or indirectly related to oral cancer.

The DAVID also produced the KEGG pathway enrichment on putative genes (see S5 Table), several pathways were highlighted such as TNF signaling pathway, PI3K-Akt pathway, Pap1 pathway, MAPK pathway, TLR pathway, NF-kB pathway, JaK-STAT, Ras pathway and cytokine-cytokine receptor interaction. TNF pathway could be activated by various signals to affect the immunity, cell growth, apoptosis and other biological behaviors of tumor cells [106110]. Protein-protein interaction and pathways is a cross-talk network. TNF can also activates the NF-kB pathway. The NF-kB pathway is activated in various cancers [111, 112]. Activation of NF-kB can be inhibited by the blockade of PI3K/Akt and ERK pathway to suppress tumor metastasis [113]. It was reported that PI3K/AKT-NF-kB is an axis which promotes bone metastasis in prostate cancer [114]. The Jak/STAT pathway is critical in normal tissues and tumors and the Jak kinase family includes JAK1, JAK2, JAK3 and TYK2 [115]. Jak mediated STAT phosphorylation leads to their nuclear translocation. STAT molecules bind specific promoter of genes and result in the transcription in nucleus, which regulate the cell proliferation, differentiation and apoptosis [115, 116]. It was reported that over-activation of the JAK/STAT pathway is related to subsets of patients with certain solid tumors and chronic myeloid leukemia [117, 118]. MAPK pathway is induced by activation of TLR (toll like receptors) and NOD (nucleotide-binding oligomerization domain receptors), which initiates of inflammation and are involved in cancer proliferation and control [119]. Ras pathway is a highly conserved pathway in cell function, including cell proliferation, differentiation and signaling transduction. This pathway is commonly deregulated in cancers, making the components in the pathway as targets for therapeutic interventions [120]. In these pathways, several putative genes were enriched, which indicates these pathway and gene need more attention in tumorigenesis of oral cancer.

Conclusions

In this study, we investigated genes related to oral cancer. Two popular algorithms, the random walk with a restart algorithm and the shortest path algorithm, which are often used to identify novel disease genes, were integrated with some further rules to build two computational methods. An integrated method was further built by combining these two methods. To access an optimal prediction method for the identification of genes related to oral cancer using the integrated method, these two methods were trained on validated genes. The optimal prediction method was further adopted to identify novel genes related to oral cancer. The results indicated the following facts: (1) The integrated method is effective for identification of disease genes of oral cancer; (2) Candidate genes produced by the integrated method provide an opportunity to achieve a more extensive investigation of oral cancer. We hope that the integrated method can be useful for identifying novel disease genes. In view of the utility of the RWR-based and SP-based methods for identification of disease genes, it is hopeful that these methods can be applied to investigate other problems, such as DNA-binding protein prediction [121], protein fold recognition [122, 123], detection of tubule boundary [124], etc.

Supporting information

S1 File. Detailed analysis of the putative genes obtained by RWR-based method or SP-based method.

https://doi.org/10.1371/journal.pone.0175185.s001

(DOCX)

S1 Table. 202 genes related to oral cancer and their sources.

https://doi.org/10.1371/journal.pone.0175185.s002

(DOCX)

S2 Table. The performance of the RWR-based method with different combinations of parameters.

https://doi.org/10.1371/journal.pone.0175185.s003

(DOCX)

S3 Table. The performance of the SP-based method with different combinations of parameters.

https://doi.org/10.1371/journal.pone.0175185.s004

(DOCX)

S4 Table. Genes identified by the optimal RWR-based method.

https://doi.org/10.1371/journal.pone.0175185.s005

(DOCX)

S5 Table. The enrichment analysis of putative genes yielded by DAVID.

https://doi.org/10.1371/journal.pone.0175185.s006

(XLSX)

Acknowledgments

This study was supported by the National Natural Science Foundation of China (31371335), Natural Science Foundation of Shanghai (17ZR1412500), Shanghai Sailing Program and The Youth Innovation Promotion Association of Chinese Academy of Sciences (CAS) (2016245).

Author Contributions

  1. Conceptualization: TH HL YDC.
  2. Data curation: ZX FY YS.
  3. Formal analysis: LC JY YZ XK.
  4. Funding acquisition: TH YDC.
  5. Methodology: LC YDC.
  6. Supervision: LC YDC.
  7. Validation: JY ZX FY TH.
  8. Writing – original draft: LC JY.
  9. Writing – review & editing: TH YDC.

References

  1. 1. Tsantoulis PK, Kastrinakis NG, Tourvas AD, Laskaris G, Gorgoulis VG. Advances in the biology of oral cancer. Oral Oncol. 2007;43(6):523–34. pmid:17258495
  2. 2. Shridhar K, Walia GK, Aggarwal A, Gulati S, Geetha AV, Prabhakaran D, et al. DNA methylation markers for oral pre-cancer progression: A critical review. Oral Oncol. 2016;53:1–9. pmid:26690652
  3. 3. Bray F, Ren JS, Masuyer E, Ferlay J. Global estimates of cancer prevalence for 27 sites in the adult population in 2008. International journal of cancer Journal international du cancer. 2013;132(5):1133–45. pmid:22752881
  4. 4. Noda M. International Agency for Research on Cancer (http://www.iarc.fr/). Japanese journal of clinical oncology. 1999;29(11):592. pmid:10678568
  5. 5. Cancer incidence in five continents. Volume IX. IARC Sci Publ. 2008;(160):1–837. pmid:19388204
  6. 6. Warnakulasuriya S, Sutherland G, Scully C. Tobacco, oral cancer, and treatment of dependence. Oral Oncol. 2005;41(3):244–60. pmid:15743687
  7. 7. Lucenteforte E, Garavello W, Bosetti C, La Vecchia C. Dietary factors and oral and pharyngeal cancer risk. Oral Oncol. 2009;45(6):461–7. pmid:18990606
  8. 8. Pavia M, Pileggi C, Nobile CG, Angelillo IF. Association between fruit and vegetable consumption and oral cancer: a meta-analysis of observational studies. The American journal of clinical nutrition. 2006;83(5):1126–34. pmid:16685056
  9. 9. Fearon ER, Vogelstein B. A genetic model for colorectal tumorigenesis. Cell. 1990;61(5):759–67. pmid:2188735
  10. 10. Nagatsuka H, Ishiwari Y, Tsujigiwa H, Nakano K, Nagai N. Quantitation of epidermal growth factor receptor gene amplification by competitive polymerase chain reaction in pre-malignant and malignant oral epithelial lesions. Oral Oncol. 2001;37(7):599–604. pmid:11564582
  11. 11. Ishitoya J, Toriyama M, Oguchi N, Kitamura K, Ohshima M, Asano K, et al. Gene amplification and overexpression of EGF receptor in squamous cell carcinomas of the head and neck. British journal of cancer. 1989;59(4):559–62. pmid:2713242
  12. 12. Shiraki M, Odajima T, Ikeda T, Sasaki A, Satoh M, Yamaguchi A, et al. Combined expression of p53, cyclin D1 and epidermal growth factor receptor improves estimation of prognosis in curatively resected oral cancer. Modern pathology: an official journal of the United States and Canadian Academy of Pathology, Inc. 2005;18(11):1482–9.
  13. 13. Xia W, Lau YK, Zhang HZ, Liu AR, Li L, Kiyokawa N, et al. Strong correlation between c-erbB-2 overexpression and overall survival of patients with oral squamous cell carcinoma. Clin Cancer Res. 1997;3(1):3–9. pmid:9815530
  14. 14. Werkmeister R, Brandt B, Joos U. Clinical relevance of erbB-1 and -2 oncogenes in oral carcinomas. Oral Oncol. 2000;36(1):100–5. pmid:10889928
  15. 15. Miyamoto R, Uzawa N, Nagaoka S, Hirata Y, Amagasa T. Prognostic significance of cyclin D1 amplification and overexpression in oral squamous cell carcinomas. Oral Oncol. 2003;39(6):610–8. pmid:12798405
  16. 16. Koontongkaew S, Chareonkitkajorn L, Chanvitan A, Leelakriangsak M, Amornphimoltham P. Alterations of p53, pRb, cyclin D(1) and cdk4 in human oral and pharyngeal squamous cell carcinomas. Oral Oncol. 2000;36(4):334–9. pmid:10899671
  17. 17. Huang MF, Chang YC, Liao PS, Huang TH, Tsay CH, Chou MY. Loss of heterozygosity of p53 gene of oral cancer detected by exfoliative cytology. Oral Oncol. 1999;35(3):296–301. pmid:10621851
  18. 18. Largey JS, Meltzer SJ, Yin J, Norris K, Sauk JJ, Archibald DW. Loss of heterozygosity of p53 in oral cancers demonstrated by the polymerase chain reaction. Cancer. 1993;71(6):1933–7. pmid:8443743
  19. 19. Sakata K. Alterations of tumor suppressor genes and the H-ras oncogene in oral squamous cell carcinoma. Journal of oral pathology & medicine: official publication of the International Association of Oral Pathologists and the American Academy of Oral Pathology. 1996;25(6):302–7.
  20. 20. Shang ZJ, Li ZB, Li JR. VEGF is up-regulated by hypoxic stimulation and related to tumour angiogenesis and severity of disease in oral squamous cell carcinoma: in vitro and in vivo studies. Int J Oral Maxillofac Surg. 2006;35(6):533–8. pmid:16388929
  21. 21. Patel BP, Shah PM, Rawal UM, Desai AA, Shah SV, Rawal RM, et al. Activation of MMP-2 and MMP-9 in patients with oral squamous cell carcinoma. J Surg Oncol. 2005;90(2):81–8. pmid:15844188
  22. 22. Zeng X, Liao Y, Liu Y, Zou Q. Prediction and validation of disease genes using HeteSim Scores. IEEE/ACM Trans Comput Biol Bioinform. 2016.
  23. 23. Chen X, Perez-Jimenez MJ, Valencia-Cabrera L, Wang BZ, Zeng XX. Computing with viruses. Theoretical Computer Science. 2016;623:146–59.
  24. 24. Zou Q, Wan S, Ju Y, Tang J, Zeng X. Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy. BMC Syst Biol. 2016;10(Suppl 4):114. pmid:28155714
  25. 25. Gu B, Sheng VS, Tay KY, Romano W, Li S. Incremental Support Vector Learning for Ordinal Regression. Ieee T Neur Net Lear. 2015;26(7):1403–16.
  26. 26. Gu B, Sheng VS, Li S. Bi-parameter space partition for cost-sensitive SVM. Proceedings of the 24th International Conference on Artificial Intelligence; Buenos Aires, Argentina. 2832741: AAAI Press; 2015. p. 3532–9.
  27. 27. Tang W, Liao Z, Zou Q. Which statistical significance test best detects oncomiRNAs in cancer tissues? An exploratory analysis. Oncotarget. 2016;7(51):85613–23. pmid:27784000
  28. 28. Zhang JR, Ju Y, Lu HJ, Xuan P, Zou Q. Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology. Int J Genomics. 2016.
  29. 29. Oliver S. Guilt-by-association goes global. Nature. 2000;403(6770):601–3. pmid:10688178
  30. 30. Kohler S, Bauer S, Horn D, Robinson PN. Walking the interactome for prioritization of candidate disease genes. The Amerian Journal of Human Genetics. 2008;82(4):949–58.
  31. 31. Navlakha S, Kingsford C. The power of protein interaction networks for associating genes with diseases. Bioinformatics. 2010;26(8):1057–63. pmid:20185403
  32. 32. Care MA, Bradford JR, Needham CJ, Bulpitt AJ, Westhead DR. Combining the interactome and deleterious SNP predictions to improve disease gene identification. Hum Mutat. 2009;30(3):485–92. pmid:19156842
  33. 33. Chen L, Xing Z, Huang T, Shu Y, Huang G, Li H-P. Application of the shortest path algorithm for the discovery of breast cancer related genes. Current Bioinformatics. 2016;11(1):51–8.
  34. 34. Gao Y-F, Shu Y, Yang L, He Y-C, Li L-P, Huang G, et al. A Graphic Method for Identification of Novel Glioma Related Genes. BioMed Research International. 2014;2014:891945. pmid:25050377
  35. 35. Zhang J, Yang J, Huang T, Shu Y, Chen L. Identification of novel proliferative diabetic retinopathy related genes on protein-protein interaction network. Neurocomputing. 2016;217:63–72.
  36. 36. Chen L, Wang B, Wang S, Yang J, Hu J, Xie Z, et al. OPMSP: A computational method integrating protein interaction and sequence information for the identification of novel putative oncogenes. Protein Pept Lett. 2016;23(12):1081–94. pmid:27774893
  37. 37. Chen L, Huang T, Zhang YH, Jiang Y, Zheng M, Cai YD. Identification of novel candidate drivers connecting different dysfunctional levels for lung adenocarcinoma using protein-protein interactions and a shortest path approach. Sci Rep. 2016;6:29849. Epub 2016/07/15. pmid:27412431
  38. 38. Chen L, Yang J, Huang T, Kong X, Lu L, Cai Y-D. Mining for novel tumor suppressor genes using a shortest path approach. Journal of Biomolecular Structure and Dynamics. 2016;34(3):664–75. pmid:26209080
  39. 39. Cai Y-D, Zhang Q, Zhang Y-H, Chen L, Huang T. Identification of genes associated with breast cancer metastasis to bone on a protein-protein interaction network with a shortest path algorithm. J Proteome Res. 2017;16(2):1027–38. pmid:28076954
  40. 40. Chen L, Zhang YH, Huang T, Cai YD. Identifying novel protein phenotype annotations by hybridizing protein-protein interactions and protein sequence similarities. Molecular Genetics and Genomics. 2016;291(2):913–34. pmid:26728152
  41. 41. The UniProt Consortium. Update on activities at the Universal Protein Resource (UniProt) in 2013. Nucleic Acids Res. 2013;41(Database issue):D43–7. Epub 2012/11/20. pmid:23161681
  42. 42. Franceschini A, Szklarczyk D, Frankild S, Kuhn M, Simonovic M, Roth A, et al. STRING v9. 1: protein-protein interaction networks, with increased coverage and integration. Nucleic acids research. 2013;41(D1):D808–D15.
  43. 43. Ng KL, Ciou JS, Huang CH. Prediction of protein functions based on function-function correlation relations. Comput Biol Med. 2010;40(3):300–5. pmid:20089249
  44. 44. Hu LL, Huang T, Shi X, Lu WC, Cai YD, Chou KC. Predicting functions of proteins in mouse based on weighted protein-protein interaction network and protein hybrid properties. PLoS ONE. 2011;6(1):e14556. pmid:21283518
  45. 45. Gormen TH, Leiserson CE, Rivest RL, Stein C, editors. Introduction to algorithms: MIT press Cambridge, MA; 1990.
  46. 46. Chen L, Zeng W-M, Cai Y-D, Feng K-Y, Chou K-C. Predicting Anatomical Therapeutic Chemical (ATC) Classification of Drugs by Integrating Chemical-Chemical Interactions and Similarities. PLoS ONE. 2012;7(4):e35254. pmid:22514724
  47. 47. Wei L, Xing P, Shi G, Ji ZL, Zou Q. Fast prediction of protein methylation sites using a sequence-based feature selection technique. IEEE/ACM Trans Comput Biol Bioinform. 2017.
  48. 48. Wei L, Xing P, Tang J, Zou Q. PhosPred-RF: a novel sequence-based predictor for phosphorylation sites using sequential information only. IEEE Trans Nanobioscience. 2017.
  49. 49. Kohavi R, editor A study of cross-validation and bootstrap for accuracy estimation and model selection. International joint Conference on artificial intelligence; 1995: Lawrence Erlbaum Associates Ltd.
  50. 50. Akira S, Uematsu S, Takeuchi O. Pathogen recognition and innate immunity. Cell. 2006;124(4):783–801. pmid:16497588
  51. 51. Szajnik M, Szczepanski MJ, Czystowska M, Elishaev E, Mandapathil M, Nowak-Markwitz E, et al. TLR4 signaling induced by lipopolysaccharide or paclitaxel regulates tumor survival and chemoresistance in ovarian cancer. Oncogene. 2009;28(49):4353–63. pmid:19826413
  52. 52. Szczepanski MJ, Czystowska M, Szajnik M, Harasymczuk M, Boyiadzis M, Kruk-Zagajewska A, et al. Triggering of Toll-like receptor 4 expressed on human head and neck squamous cell carcinoma promotes tumor development and protects the tumor from immune attack. Cancer Res. 2009;69(7):3105–13. pmid:19318560
  53. 53. Tucci M, Stucci S, Passarelli A, Giudice G, Dammacco F, Silvestris F. The immune escape in melanoma: role of the impaired dendritic cell function. Expert review of clinical immunology. 2014;10(10):1395–404. pmid:25225774
  54. 54. Cancer Genome Atlas Research N. Comprehensive genomic characterization of squamous cell lung cancers. Nature. 2012;489(7417):519–25. pmid:22960745
  55. 55. Dutt A, Salvesen HB, Chen TH, Ramos AH, Onofrio RC, Hatton C, et al. Drug-sensitive FGFR2 mutations in endometrial carcinoma. Proc Natl Acad Sci U S A. 2008;105(25):8713–7. pmid:18552176
  56. 56. Byron SA, Gartside MG, Wellens CL, Mallon MA, Keenan JB, Powell MA, et al. Inhibition of activated fibroblast growth factor receptor 2 in endometrial cancer cells induces cell death despite PTEN abrogation. Cancer Res. 2008;68(17):6902–7. pmid:18757403
  57. 57. Gozgit JM, Squillace RM, Wongchenko MJ, Miller D, Wardwell S, Mohemmad Q, et al. Combined targeting of FGFR2 and mTOR by ponatinib and ridaforolimus results in synergistic antitumor activity in FGFR2 mutant endometrial cancer models. Cancer Chemother Pharmacol. 2013;71(5):1315–23. pmid:23468082
  58. 58. Liao RG, Jung J, Tchaicha J, Wilkerson MD, Sivachenko A, Beauchamp EM, et al. Inhibitor-sensitive FGFR2 and FGFR3 mutations in lung squamous cell carcinoma. Cancer Res. 2013;73(16):5195–205. pmid:23786770
  59. 59. Zaharieva BM, Simon R, Diener PA, Ackermann D, Maurer R, Alund G, et al. High-throughput tissue microarray analysis of 11q13 gene amplification (CCND1, FGF3, FGF4, EMS1) in urinary bladder cancer. The Journal of pathology. 2003;201(4):603–8. pmid:14648664
  60. 60. Sugahara K, Michikawa Y, Ishikawa K, Shoji Y, Iwakawa M, Shibahara T, et al. Combination effects of distinct cores in 11q13 amplification region on cervical lymph node metastasis of oral squamous cell carcinoma. International journal of oncology. 2011;39(4):761–9. pmid:21701773
  61. 61. Karin M, Greten FR. NF-kappaB: linking inflammation and immunity to cancer development and progression. Nat Rev Immunol. 2005;5(10):749–59. Epub 2005/09/22. pmid:16175180
  62. 62. Pikarsky E, Porat RM, Stein I, Abramovitch R, Amit S, Kasem S, et al. NF-kappaB functions as a tumour promoter in inflammation-associated cancer. Nature. 2004;431(7007):461–6. pmid:15329734
  63. 63. Bu H, Rosdahl I, Sun XF, Zhang H. Importance of polymorphisms in NF-kappaB1 and NF-kappaBIalpha genes for melanoma risk, clinicopathological features and tumor progression in Swedish melanoma patients. J Cancer Res Clin Oncol. 2007;133(11):859–66. pmid:17492467
  64. 64. Zhang P, Wei Q, Li X, Wang K, Zeng H, Bu H, et al. A functional insertion/deletion polymorphism in the promoter region of the NFKB1 gene increases susceptibility for prostate cancer. Cancer Genet Cytogenet. 2009;191(2):73–7. pmid:19446741
  65. 65. Lo SS, Chen JH, Wu CW, Lui WY. Functional polymorphism of NFKB1 promoter may correlate to the susceptibility of gastric cancer in aged patients. Surgery. 2009;145(3):280–5. pmid:19231580
  66. 66. Lin CW, Hsieh YS, Hsin CH, Su CW, Lin CH, Wei LH, et al. Effects of NFKB1 and NFKBIA gene polymorphisms on susceptibility to environmental factors and the clinicopathologic development of oral cancer. PLoS One. 2012;7(4):e35078. pmid:22509384
  67. 67. Bevilacqua MP, Nelson RM. Selectins. J Clin Invest. 1993;91(2):379–87. pmid:7679406
  68. 68. Lauri D, Needham L, Martin-Padura I, Dejana E. Tumor cell adhesion to endothelial cells: endothelial leukocyte adhesion molecule-1 as an inducible adhesive receptor specific for colon carcinoma cells. Journal of the National Cancer Institute. 1991;83(18):1321–4. pmid:1715924
  69. 69. Wenzel CT, Scher RL, Richtsmeier WJ. Adhesion of head and neck squamous cell carcinoma to endothelial cells. The missing links. Archives of otolaryngology—head & neck surgery. 1995;121(11):1279–86.
  70. 70. Chang PY, Kuo YB, Wu TL, Liao CT, Sun YC, Yen TC, et al. Association and prognostic value of serum inflammation markers in patients with leukoplakia and oral cavity cancer. Clinical chemistry and laboratory medicine: CCLM / FESCC. 2013;51(6):1291–300.
  71. 71. Lahav J. The functions of thrombospondin and its involvement in physiology and pathophysiology. Biochim Biophys Acta. 1993;1182(1):1–14. pmid:8347680
  72. 72. Yao L, Zhao YL, Itoh S, Wada S, Yue L, Furuta I. Thrombospondin-1 expression in oral squamous cell carcinomas: correlations with tumor vascularity, clinicopathological features and survival. Oral Oncol. 2000;36(6):539–44. pmid:11036248
  73. 73. Pal SK, Nguyen CT, Morita KI, Miki Y, Kayamori K, Yamaguchi A, et al. THBS1 is induced by TGFB1 in the cancer stroma and promotes invasion of oral squamous cell carcinoma. Journal of oral pathology & medicine: official publication of the International Association of Oral Pathologists and the American Academy of Oral Pathology. 2016.
  74. 74. Durbin JE, Hackenmiller R, Simon MC, Levy DE. Targeted disruption of the mouse Stat1 gene results in compromised innate immunity to viral disease. Cell. 1996;84(3):443–50. pmid:8608598
  75. 75. Xi S, Dyer KF, Kimak M, Zhang Q, Gooding WE, Chaillet JR, et al. Decreased STAT1 expression by promoter methylation in squamous cell carcinogenesis. Journal of the National Cancer Institute. 2006;98(3):181–9. pmid:16449678
  76. 76. Laimer K, Spizzo G, Obrist P, Gastl G, Brunhuber T, Schafer G, et al. STAT1 activation in squamous cell cancer of the oral cavity: a potential predictive marker of response to adjuvant chemotherapy. Cancer. 2007;110(2):326–33. pmid:17559122
  77. 77. Haroun F, Al-Shaar L, Habib RH, El-Saghir N, Tfayli A, Bazarbachi A, et al. Effects of CYP2B6 genetic polymorphisms in patients receiving cyclophosphamide combination chemotherapy for breast cancer. Cancer Chemother Pharmacol. 2015;75(1):207–14. pmid:25428516
  78. 78. Li Y, Zhao W, Zhao Z, Wu J, Chen L, Ma Y, et al. IL1B gene polymorphisms, age and the risk of non-small cell lung cancer in a Chinese population. Lung Cancer. 2015;89(3):232–7. pmid:26141218
  79. 79. Zeigler-Johnson C, Friebel T, Walker AH, Wang Y, Spangler E, Panossian S, et al. CYP3A4, CYP3A5, and CYP3A43 genotypes and haplotypes in the etiology and severity of prostate cancer. Cancer Res. 2004;64(22):8461–7. pmid:15548719
  80. 80. Chen L, Huang Z, Yao G, Lyu X, Li J, Hu X, et al. The expression of CXCL13 and its relation to unfavorable clinical characteristics in young breast cancer. J Transl Med. 2015;13:168. pmid:25990390
  81. 81. Walser TC, Ma X, Kundu N, Dorsey R, Goloubeva O, Fulton AM. Immune-mediated modulation of breast cancer growth and metastasis by the chemokine Mig (CXCL9) in a murine model. Journal of immunotherapy. 2007;30(5):490–8. pmid:17589289
  82. 82. Walser TC, Rifat S, Ma X, Kundu N, Ward C, Goloubeva O, et al. Antagonism of CXCR3 inhibits lung metastasis in a murine model of metastatic breast cancer. Cancer Res. 2006;66(15):7701–7. pmid:16885372
  83. 83. Kawada K, Hosogi H, Sonoshita M, Sakashita H, Manabe T, Shimahara Y, et al. Chemokine receptor CXCR3 promotes colon cancer metastasis to lymph nodes. Oncogene. 2007;26(32):4679–88. pmid:17297455
  84. 84. Ma X, Norsworthy K, Kundu N, Rodgers WH, Gimotty PA, Goloubeva O, et al. CXCR3 expression is associated with poor survival in breast cancer and promotes metastasis in a murine model. Mol Cancer Ther. 2009;8(3):490–8. pmid:19276169
  85. 85. Cambien B, Karimdjee BF, Richard-Fiardo P, Bziouech H, Barthel R, Millet MA, et al. Organ-specific inhibition of metastatic colon carcinoma by CXCR3 antagonism. British journal of cancer. 2009;100(11):1755–64. pmid:19436305
  86. 86. Pradelli E, Karimdjee-Soilihi B, Michiels JF, Ricci JE, Millet MA, Vandenbos F, et al. Antagonism of chemokine receptor CXCR3 inhibits osteosarcoma metastasis to lungs. International journal of cancer Journal international du cancer. 2009;125(11):2586–94. pmid:19544560
  87. 87. Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57. Epub 2009/01/10. pmid:19131956
  88. 88. Kim BK, Lee JW, Park PJ, Shin YS, Lee WY, Lee KA, et al. The multiplex bead array approach to identifying serum biomarkers associated with breast cancer. Breast cancer research: BCR. 2009;11(2):R22. pmid:19400944
  89. 89. Key TJ, Appleby PN, Reeves GK, Roddam AW, Breast TEH. Insulin-like growth factor 1 (IGF1), IGF binding protein 3 (IGFBP3), and breast cancer risk: pooled individual data analysis of 17 prospective studies. Lancet Oncology. 2010;11(6):530–42. pmid:20472501
  90. 90. Karamouzis MV, Papavassiliou AG. Targeting insulin-like growth factor in breast cancer therapeutics. Crit Rev Oncol Hematol. 2012;84(1):8–17. pmid:22424863
  91. 91. Polyak K, Lee MH, Erdjumentbromage H, Koff A, Roberts JM, Tempst P, et al. Cloning of P27(Kip1), a Cyclin-Dependent Kinase Inhibitor and a Potential Mediator of Extracellular Antimitogenic Signals. Cell. 1994;78(1):59–66. pmid:8033212
  92. 92. Chang BL, Zheng SQL, Isaacs SD, Wiley KE, Turner A, Li G, et al. A polymorphism in the CDKN1B gene is associated with increased risk of hereditary prostate cancer. Cancer Research. 2004;64(6):1997–9. pmid:15026335
  93. 93. Lang T, Nie YL. MiR-148a participates in the growth of RPMI8226 multiple myeloma cells by regulating CDKN1B. Biomedicine & Pharmacotherapy. 2016;84:1967–71.
  94. 94. Nanda MS, Shah ZA, Sameer AS, Syeed N, Murtaza I, Siddiqi MA, et al. TP53-Molecular Soldier's Mutations in Bladder Cancer in the Kashmiri Population. Asian Pacific Journal of Cancer Prevention. 2011;12(1):67–72. pmid:21517233
  95. 95. Cheng LA, Zhang SB, MacLennan GT, Williamson SR, Lopez-Beltran A, Montironi R. Bladder cancer: translating molecular genetic insights into clinical practice. Human Pathology. 2011;42(4):455–81. pmid:21106220
  96. 96. Levine AJ, Oren M. The first 30 years of p53: growing ever more complex. Nat Rev Cancer. 2009;9(10):749–58. pmid:19776744
  97. 97. Nishiyama H, Watanabe J, Ogawa O. p53 and chemosensitivity in bladder cancer. Int J Clin Oncol. 2008;13(4):282–6. pmid:18704627
  98. 98. Wang SJ, Gu W. To be, or not to be: functional dilemma of p53 metabolic regulation. Curr Opin Oncol. 2014;26(1):78–85. pmid:24240177
  99. 99. Evan GI, Wyllie AH, Gilbert CS, Littlewood TD, Land H, Brooks M, et al. Induction of Apoptosis in Fibroblasts by C-Myc Protein. Cell. 1992;69(1):119–28. pmid:1555236
  100. 100. Bissonnette RP, Echeverri F, Mahboubi A, Green DR. Apoptotic cell death induced by c-myc is inhibited by bcl-2. Nature. 1992;359(6395):552–4. pmid:1406975
  101. 101. Egle A, Harris AW, Bouillet P, Cory S. Bim is a suppressor of Myc-induced mouse B cell leukemia. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(16):6164–9. pmid:15079075
  102. 102. Murphy DJ, Junttila MR, Pouyet L, Karnezis A, Shchors K, Bui DA, et al. Distinct thresholds govern Myc's biological output in vivo. Cancer Cell. 2008;14(6):447–57. pmid:19061836
  103. 103. Mitchell KO, Ricci MS, Miyashita T, Dicker DT, Jin ZY, Reed JC, et al. Bax is a transcriptional target and mediator of c-Myc-induced apoptosis. Cancer Research. 2000;60(22):6318–25. pmid:11103792
  104. 104. Nicholson DW, Thornberry NA. Caspases: killer proteases. Trends Biochem Sci. 1997;22(8):299–306. pmid:9270303
  105. 105. Dang CV, O'Donnell KA, Zeller KI, Nguyen T, Osthus RC, Li F. The c-Myc target gene network. Seminars in cancer biology. 2006;16(4):253–64. pmid:16904903
  106. 106. Zhao X, Mohaupt M, Jiang J, Liu S, Li B, Qin Z. Tumor necrosis factor receptor 2-mediated tumor suppression is nitric oxide dependent and involves angiostasis. Cancer Res. 2007;67(9):4443–50. pmid:17483359
  107. 107. Qin Z, Kruger-Krasagakes S, Kunzendorf U, Hock H, Diamantstein T, Blankenstein T. Expression of tumor necrosis factor by different tumor cell lines results either in tumor suppression or augmented metastasis. J Exp Med. 1993;178(1):355–60. pmid:8315391
  108. 108. Blankenstein T, Qin ZH, Uberla K, Muller W, Rosen H, Volk HD, et al. Tumor Suppression after Tumor-Cell Targeted Tumor-Necrosis-Factor-Alpha Gene-Transfer. Journal of Experimental Medicine. 1991;173(5):1047–52. pmid:2022919
  109. 109. Oliff A, Defeo-Jones D, Boyer M, Martinez D, Kiefer D, Vuocolo G, et al. Tumors secreting human TNF/cachectin induce cachexia in mice. Cell. 1987;50(4):555–63. pmid:3607879
  110. 110. Kulbe H, Thompson R, Wilson JL, Robinson S, Hagemann T, Fatah R, et al. The inflammatory cytokine tumor necrosis factor-alpha generates an autocrine tumor-promoting network in epithelial ovarian cancer cells. Cancer Res. 2007;67(2):585–92. pmid:17234767
  111. 111. Ben-Neriah Y, Karin M. Inflammation meets cancer, with NF-kappaB as the matchmaker. Nat Immunol. 2011;12(8):715–23. pmid:21772280
  112. 112. DiDonato JA, Mercurio F, Karin M. NF-kappa B and the link between inflammation and cancer. Immunol Rev. 2012;246:379–400. pmid:22435567
  113. 113. Kang MH, Oh SC, Lee HJ, Kang HN, Kim JL, Kim JS, et al. Metastatic function of BMP-2 in gastric cancer cells: the role of PI3K/AKT, MAPK, the NF-kappaB pathway, and MMP-9 expression. Exp Cell Res. 2011;317(12):1746–62. pmid:21570392
  114. 114. Graham TR, Odero-Marah VA, Chung LW, Agrawal KC, Davis R, Abdel-Mageed AB. PI3K/Akt-dependent transcriptional regulation and activation of BMP-2-Smad signaling by NF-kappaB in metastatic prostate cancer cells. The Prostate. 2009;69(2):168–80. pmid:18942118
  115. 115. Levine RL, Pardanani A, Tefferi A, Gilliland DG. Role of JAK2 in the pathogenesis and therapy of myeloproliferative disorders. Nat Rev Cancer. 2007;7(9):673–83. pmid:17721432
  116. 116. Silva M, Richard C, Benito A, Sanz C, Olalla I, Fernandez-Luna JL. Expression of Bcl-x in erythroid precursors from patients with polycythemia vera. N Engl J Med. 1998;338(9):564–71. pmid:9475763
  117. 117. Sansone P, Bromberg J. Targeting the Interleukin-6/Jak/Stat Pathway in Human Malignancies. Journal of Clinical Oncology. 2012;30(9):1005–14. pmid:22355058
  118. 118. Traer E, MacKenzie R, Snead J, Agarwal A, Eiring AM, O'Hare T, et al. Blockade of JAK2-mediated extrinsic survival signals restores sensitivity of CML cells to ABL inhibitors. Leukemia. 2012;26(5):1140–3. pmid:22094585
  119. 119. Groeger S, Jarzina F, Domann E, Meyle J. Porphyromonas gingivalis activates NFkappaB and MAPK pathways in human oral epithelial cells. BMC Immunol. 2017;18(1):1. pmid:28056810
  120. 120. McCubrey JA, Steelman LS, Chappell WH, Abrams SL, Franklin RA, Montalto G, et al. Ras/Raf/MEK/ERK and PI3K/PTEN/Akt/mTOR Cascade Inhibitors: How Mutations Can Result in Therapy Resistance and How to Overcome Resistance. Oncotarget. 2012;3(10):1068–111. pmid:23085539
  121. 121. Wei L, Tang J, Zou Q. Local-DPP: An improved DNA-binding protein prediction method by exploring local evolutionary information. Inform Sciences. 2017;384:135–44.
  122. 122. Wei L, Zou Q. Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition. International journal of molecular sciences. 2016;17(12).
  123. 123. Wei L, Liao M, Gao X, Zou Q. Enhanced Protein Fold Prediction Method Through a Novel Feature Extraction Technique. IEEE Trans Nanobioscience. 2015;14(6):649–59. pmid:26335556
  124. 124. Su R, Zhang C, Pham TD, Davey R, Bischof L, Vallotton P, et al. Detection of tubule boundaries based on circular shortest path and polar-transformation of arbitrary shapes. Journal of microscopy. 2016;264(2):127–42. pmid:27172164
  125. 125. Bromberg J. Stat proteins and oncogenesis. J Clin Invest. 2002;109(9):1139–42. pmid:11994401
  126. 126. Wu YW, Tang W, Zuo JP. Toll-like receptors: potential targets for lupus treatment. Acta pharmacologica Sinica. 2015;36(12):1395–407. pmid:26592511
  127. 127. Xiang W, Chao ZY, Feng DY. Role of Toll-like receptor/MYD88 signaling in neurodegenerative diseases. Reviews in the neurosciences. 2015;26(4):407–14. pmid:25870959
  128. 128. Muller M, Herrath J, Malmstrom V. IL-1R1 is expressed on both Helios(+) and Helios(-)FoxP3(+)CD4(+) T cells in the rheumatic joint. Clinical and Experimental Immunology. 2015;182(1):90–100. pmid:26076982
  129. 129. Danielson PB. The cytochrome P450 superfamily: biochemistry, evolution and drug metabolism in humans. Curr Drug Metab. 2002;3(6):561–97. pmid:12369887
  130. 130. Haudenschild DR, Palmer SM, Moseley TA, You Z, Reddi AH. Bone morphogenetic protein (BMP)-6 signaling and BMP antagonist noggin in prostate cancer. Cancer Res. 2004;64(22):8276–84. pmid:15548695
  131. 131. Zhang X, Ibrahimi OA, Olsen SK, Umemori H, Mohammadi M, Ornitz DM. Receptor specificity of the fibroblast growth factor family. The complete mammalian FGF family. J Biol Chem. 2006;281(23):15694–700. pmid:16597617
  132. 132. Zhou H, Wu J, Wang T, Zhang X, Liu D. CXCL10/CXCR3 axis promotes the invasion of gastric cancer via PI3K/AKT pathway-dependent MMPs production. Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie. 2016;82:479–88.
  133. 133. Pranzatelli MR, Tate ED, McGee NR, Travelstead AL, Verhulst SJ, Ransohoff RM. Expression of CXCR3 and its ligands CXCL9, -10 and -11 in paediatric opsoclonus-myoclonus syndrome. Clin Exp Immunol. 2013;172(3):427–36. pmid:23600831
  134. 134. Baenziger NL, Brodie GN, Majerus PW. Isolation and properties of a thrombin-sensitive protein of human platelets. J Biol Chem. 1972;247(9):2723–31. pmid:4260214
  135. 135. Bornstein P. Diversity of function is inherent in matricellular proteins: an appraisal of thrombospondin 1. The Journal of cell biology. 1995;130(3):503–6. pmid:7542656
  136. 136. Labib K, Tercero JA, Diffley JF. Uninterrupted MCM2-7 function required for DNA replication fork progression. Science. 2000;288(5471):1643–7. pmid:10834843
  137. 137. Elcin AE, Parmaksiz M, Dogan A, Seker S, Durkut S, Dalva K, et al. Differential gene expression profiling of human adipose stem cells differentiating into smooth muscle-like cells by TGFbeta1/BMP4. Exp Cell Res. 2017.
  138. 138. Vallinoto AC, Graca ES, Araujo MS, Azevedo VN, Cayres-Vallinoto I, Machado LF, et al. IFNG +874T/A polymorphism and cytokine plasma levels are associated with susceptibility to Mycobacterium tuberculosis infection and clinical manifestation of tuberculosis. Human immunology. 2010;71(7):692–6. pmid:20353805
  139. 139. Elgizouli M, Lowe DM, Speckmann C, Schubert D, Hulsdunker J, Eskandarian Z, et al. Activating PI3Kdelta mutations in a cohort of 669 patients with primary immunodeficiency. Clin Exp Immunol. 2016;183(2):221–9. pmid:26437962
  140. 140. Wang G, Wang H, Zhang C, Liu T, Li Q, Lin X, et al. Rac3 regulates cell proliferation through cell cycle pathway and predicts prognosis in lung adenocarcinoma. Tumour Biol. 2016;37(9):12597–607. pmid:27402308
  141. 141. Haataja L, Kaartinen V, Groffen J, Heisterkamp N. The small GTPase Rac3 interacts with the integrin-binding protein CIB and promotes integrin alpha(IIb)beta(3)-mediated adhesion and spreading. J Biol Chem. 2002;277(10):8321–8. pmid:11756406
  142. 142. Mira JP, Benard V, Groffen J, Sanders LC, Knaus UG. Endogenous, hyperactive Rac3 controls proliferation of breast cancer cells by a p21-activated kinase-dependent pathway. Proc Natl Acad Sci U S A. 2000;97(1):185–9. pmid:10618392