Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Fusion of KATZ measure and space projection to fast probe potential lncRNA-disease associations in bipartite graphs

  • Yi Zhang ,

    Contributed equally to this work with: Yi Zhang, Min Chen

    Roles Conceptualization, Data curation, Writing – original draft, Writing – review & editing

    Affiliations School of Information Science and Engineering, Guilin University of Technology, Guilin, China, Guangxi Key Laboratory of Embedded Technology and Intelligent System, Guilin University of Technology, Guilin, China

  • Min Chen ,

    Contributed equally to this work with: Yi Zhang, Min Chen

    Roles Conceptualization, Formal analysis, Methodology

    chenmin@hnit.edu.cn

    Affiliation School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, China

  • Li Huang,

    Roles Validation

    Affiliations Academy of Arts and Design, Tsinghua University, Beijing, China, The Future Laboratory, Tsinghua University, Beijing, China

  • Xiaolan Xie,

    Roles Funding acquisition

    Affiliation School of Information Science and Engineering, Guilin University of Technology, Guilin, China

  • Xin Li,

    Roles Software

    Affiliation School of Information Science and Engineering, Guilin University of Technology, Guilin, China

  • Hong Jin,

    Roles Validation

    Affiliation School of Information Science and Engineering, Guilin University of Technology, Guilin, China

  • Xiaohua Wang,

    Roles Resources

    Affiliation Pharmacy School, Guilin Medical University, Guilin, China

  • Hanyan Wei

    Roles Resources

    Affiliation Pharmacy School, Guilin Medical University, Guilin, China

Abstract

It is well known that numerous long noncoding RNAs (lncRNAs) closely relate to the physiological and pathological processes of human diseases and can serves as potential biomarkers. Therefore, lncRNA-disease associations that are identified by computational methods as the targeted candidates reduce the cost of biological experiments focusing on deep study furtherly. However, inaccurate construction of similarity networks and inadequate numbers of observed known lncRNA–disease associations, such inherent problems make many mature computational methods that have been developed for many years still exit some limitations. It motivates us to explore a new computational method that was fused with KATZ measure and space projection to fast probing potential lncRNA-disease associations (namely KATZSP). KATZSP is comprised of following key steps: combining all the global information with which to change Boolean network of known lncRNA–disease associations into the weighted networks; changing the similarities calculation into counting the number of walks that connect lncRNA nodes and disease nodes in bipartite graphs; obtaining the space projection scores to refine the primary prediction scores. The process to fuse KATZ measure and space projection was simplified and uncomplicated with needing only one attenuation factor. The leave-one-out cross validation (LOOCV) experimental results showed that, compared with other state-of-the-art methods (NCPLDA, LDAI-ISPS and IIRWR), KATZSP had a higher predictive accuracy shown with area-under-the-curve (AUC) value on the three datasets built, while KATZSP well worked on inferring potential associations related to new lncRNAs (or isolated diseases). The results from real cases study (such as pancreas cancer, lung cancer and colorectal cancer) further confirmed that KATZSP is capable of superior predictive ability to be applied as a guide for traditional biological experiments.

Introduction

Long non-coding RNAs (lncRNAs) whose length are longer than 200 nucleotides (nt) have crucial roles in gene expression control during developmental and differentiational processes [1]. Therefore, there is no surprise that mutation and dysregulation of lncRNAs could contribute to the development of various human complex diseases [2], such as HOTAIR in breast cancer [3] and MALAT1 in early-stage non-small cell lung cancer [4]. LncRNAs can also drive many important cancer phenotypes through their interactions with other cellular macromolecules including DNA, protein, and RNA [58]. There is urgent need to discern potential functional roles of lncRNAs to further study the pathology, diagnosis, therapy, prognosis, prevention of human complex diseases, and detect disease biomarkers at lncRNA level [9, 10]. With strong data support from lncRNA related databases (such as LncRNAdb [11], LncRNADisease [12], NRED [13], and NONCODE [14]) and similarity calculation based on miRNA information [1520], the computational prediction models that were built to infer lncRNA–disease associations could supply more accurate targeted candidates [21]: 1) saving cost and time for biological experiments; 2) making bio-experiments focus on deeper study of targets; 3) speeding up understanding the pathogenesis of complex diseases.

The computational models used for inferring lncRNA–disease associations have been divided into three main categories: 1) Machine learning-based inferring models use naive Bayesian classifier model [22, 23], support vector machine (SVM) [24, 25], matrix completion [26, 27], matrix factorization [2830] to infer potential lncRNA–disease associations. However, the models categorized to this category are not able to achieve high predictive accuracy. 2) Network-based inferring models, based on the biological premise that lncRNAs with similar functions tend to be associated with similar diseases [31, 32], use random walk [3335], KATZ measure [36, 37], hyper geometric distribution [15], label propagation algorithm [38], propagating information streams [39], lncRNA-miRNA interaction [15, 30] to identify potential lncRNA–disease associations. Nevertheless, the models categorized to this category rely heavily on the information integrated from diverse biological data sources, and it is difficult to integrate heterogeneous data from multiple sources deeply. 3) Convolutional neural network (CNN) based inferring models [4043], are at the early research stage, with consuming relatively high time complexity and relying on the quality of multiple sources biological data as well. Therefore, those above models still have different limitations, such as, needing negative samples, not being able to infer associations related to isolated diseases and new lncRNAs directly, not high accuracy with singular methodology. Addressing these limitations, we explored a novel prediction method based on the fusion of KATZ Measure and Space Projection to infer potential lncRNA-disease associations in bipartite graphs, namely KATZSP.

KATZ measure such a graph-based computational method could be used to transform the problem of calculating similarities between nodes to link prediction in bipartite graph. In the context of lncRNA-disease association prediction, the heterogeneous networks are represented by matrices (also called bipartite graph). Therefore, calculating similarities between the nodes of lncRNAs and diseases is further transformed into the problem of counting the number of walks that connect the interactive lncRNA-disease pairs in bipartite graph. Furthermore, the number of walks as the lengths decided the potential association probability of this lncRNA-disease pair [36, 44]. Space projection method [45, 46] could improve the lncRNA-disease association predictive ability easily with few regulation parameters, even though the known lncRNA-disease associations exist inherent data sparsity. After simplified and uncomplicated fusion process, KATZ measure and space projection method were fused to form an integrated computational model KATZSP with needing only one attenuation factor, while dropping above limitations.

Experimental evaluation and discussion

Evaluation metrics

Leave One Out Cross Validation (LOOCV) experiments were implemented for evaluating the predictive performance of KATZSP. We divided the dataset of known associations into two parts: the testing subset and the training subset. In the testing subset, each known association was used as a test data in turn, and the remaining known associations formed the training subset. Under the framework of LOOCV, we compared the prediction results on some specific threshold to obtain the following four metrics: true positive (TP), false positive (FP), false negative (FN), true negative (TN). Furthermore, according to some specified thresholds, we calculated the true positive rate () against false positive rate () with which to plot out the receiver operating characteristic curve (ROC). The area under the ROC curve (AUC) was finally calculated to numerically evaluate the overall predictive performance of KATZSP.

Impact with parameter selection

Coefficient β plays as an attenuation factor of weight to control the contribution of lengths coming from walks on calculating the similarities between any two interactive nodes. According to the convergence properties of sequences required by KATZ method, the value of β should be less than the reciprocal of the max-eigenvalue of the adjacency matrix A. In order to obtain the optimal value of β, we set β = 1/max(eig(A))*K where max(eig(A)) denotes the max-eigenvalue of adjacency matrix A. Then the value of K was increased from 0.1 to 0.9 with step size of 0.1. With changing the value of K, LOOCV was implemented on all the three datasets built (dataset 1, dataset 2 and dataset 3). The results in Fig 1 showed that AUC could achieve the maximum value on all the three datasets when K = 0.1.

thumbnail
Fig 1. Impact with parameter variation on model prediction accuracy.

https://doi.org/10.1371/journal.pone.0260329.g001

Compare predictive abilities under different solutions

To demonstrate how our technical solution selected performed better than others, LOOCV experiments were implemented under following four technical solutions: only using space projection (SP), only using KATZ (KATZ), using space project first and then KATZ (SPKATZ), using KATZ first and then space projection (KATZSP). The results compared on three datasets (dataset 1, dataset 2 and dataset 3) were shown in Figs 24, respectively.

thumbnail
Fig 2. Predictive abilities with different technical solutions on dataset 1.

https://doi.org/10.1371/journal.pone.0260329.g002

thumbnail
Fig 3. Predictive abilities with different technical solutions on dataset 2.

https://doi.org/10.1371/journal.pone.0260329.g003

thumbnail
Fig 4. Predictive abilities with different technical solutions on dataset 3.

https://doi.org/10.1371/journal.pone.0260329.g004

From the comparison results shown in Figs 24, we easily found the solution used in our model (KATZSP) achieved AUC values of 0.9324, 0.9403 and 0.9472 on dataset 1, dataset 2 and dataset 3, respectively. Among above four solutions, our KATZSP which performed the best predictive ability on all three datasets with distinct advantage than other three solutions.

Compare performance with other models

To further demonstrate the reliable predictive ability of our model, we chose some the-state-of-art computational models in similar type (NCPLDA [47], LDAI-ISPS [48] and IIRWR [49]) to compare with our model in the framework of LOOCV. To make comparison fairly, we configured the same experimental environment and condition for all models on dataset 1, dataset 2 and dataset 3. From the comparison results shown in Figs 57, our KATZSP achieved the highest AUC values on all three datasets with detail analysis shown in Table 1.

thumbnail
Fig 5. Predictive abilities of KATZSP and other models on dataset 1.

https://doi.org/10.1371/journal.pone.0260329.g005

thumbnail
Fig 6. Predictive abilities of KATZSP and other models on dataset 2.

https://doi.org/10.1371/journal.pone.0260329.g006

thumbnail
Fig 7. Predictive abilities of KATZSP and other models on dataset 3.

https://doi.org/10.1371/journal.pone.0260329.g007

thumbnail
Table 1. AUCs of KATZSP and other models on all three datasets.

https://doi.org/10.1371/journal.pone.0260329.t001

Verify predictive ability for new lncRNAs and isolated diseases

To implement the verification in this section, we simulated each lncRNA in the known lncRNA-disease associations dataset to be a new lncRNA by removing all known associations relating to it. Similarly, we simulated each disease in the known lncRNA-disease associations dataset to be an isolated disease by removing all known associations relating to it. Each new lncRNA (or isolated disease) simulated was specified to be the test sample for model evaluation and the rest lncRNAs (or diseases) in the known lncRNA-disease associations dataset worked as the training samples for model learning. Until the associations between each new lncRNA and diseases or the associations between lncRNAs and each isolated disease were inferred by our KATZSP, the inferred results on dataset 1, dataset 2 and dataset 3 were shown in Fig 8.

thumbnail
Fig 8. Predictive ability of KATZSP for new lncRNAs and isolated diseases.

https://doi.org/10.1371/journal.pone.0260329.g008

With the AUC values in Fig 8, it demonstrated that our KATZSP could be effectively applied to infer associations related to new lncRNAs and associations related to isolated diseases.

Cases study

Case study for three specific diseases

To further demonstrate the predictive performance of our KATZSP on real cases study, we selected three specific diseases (pancreas cancer, lung cancer and colorectal cancer) as the cases to examine. With using the training samples composed of the known associations in dataset 2 and the testing samples composed of the unknown associations, our KATZSP focused on inferring the potential lncRNAs relating to above three cases. The lncRNAs with the top five highest prediction scores of each case were listed in Table 2. If the same associations predicted by KATZSP were also found in some literatures or the newest databases, such as LncRNADisease 2.0 (http://www.rnanut.net/lncrnadisease) and Lnc2Cancer 3.0 (http://www.biobigdata.net/lnc2cancer), it could further validate with the supporting evidences that our KATZSP was capable of the reliable predictive ability and practicability.

thumbnail
Table 2. Top 5 specific diseases-related candidate lncRNAs.

https://doi.org/10.1371/journal.pone.0260329.t002

Case study for isolated diseases

In recent years, many new diseases without any known association r lncRNAs have been gradually discovered, namely isolated diseases. It is important to verify if our KATZSP could be applied to infer the potential lncRNAs associated to such kind of isolated diseases. Above three cases (pancreas cancer, lung cancer and colon cancer) were simulated as the isolated diseases by removing all known associations relating to them in dataset 2. Our KATZSP only used other information to infer the potential lncRNAs associated with these three isolated diseases simulated. The top five lncRNAs with highest prediction scores of each disease were listed in Table 3 where only two prediction results (TC0101441 and KRASP1) couldn’t be found supporting evidence from any databases or published literatures.

thumbnail
Table 3. Top 5 specific isolated diseases-related candidate lncRNAs.

https://doi.org/10.1371/journal.pone.0260329.t003

In Tables 2 and 3, all predicted results except two were confirmed with extra evidences, which validated our KATZSP could be effectively applied in real life with supplying calculated candidates to guide biological experiments.

Materials and methods

Obtain data source

Known lncRNA-disease associations.

From a publicly accessible address at http://www.cuilab.cn/lncrnadisease, three versions of the databases which consist of associations between lncRNAs and human diseases were obtained for our work. With processing of the database in version 2013, we built a new dataset (namely dataset 1) with 352 known lncRNA–disease associations involved in 156 lncRNAs and 190 diseases. With processing of the database in version 2016, a new-built dataset (namely dataset 2) consists of 621 known lncRNA–disease associations involved in 285 lncRNAs and 226 diseases. With processing of the database in version 2017, a similar new-built dataset (namely dataset 3) consists of 1695 known lncRNA–disease associations involved in 828 lncRNAs and 314 diseases. The observed lncRNA–disease associations with lncRNA nodes and disease nodes form the bipartite graph denoted by the Boolean matrix LD = (ldij)nl×nd, whose element ldij is 1 when lncRNA li relates to disease dj. Otherwise, the value of element ldij is 0. The number of lncRNAs and the number of diseases in matrix LD are denoted by nl and nd, respectively.

Disease–disease semantic similarity.

Referring to the description by Wang et al. [51], in DAG (Directed Acyclic Graph), the contribution of a disease dt to the semantics of disease di has following definition with denotation of : (1) where Δ was set to be the most suitable value of 0.5.

Based on both the addresses of diseases in DAG graphs and the semantic relations with ancestor diseases, the element ddij in matrix DD = (ddij)nd×nd denotes the semantic similarity between diseases di and dj with definition as follows: (2) where is the set of all ancestor nodes relating to disease di, including node di itself in DAG.

LncRNA–lncRNA functional similarity.

How to accurately measure the functional similarity between two lncRNAs was detailly descripted in many literatures [4749, 52]. A group of diseases which have associations with lncRNA li were denoted by , and the similarity between any disease dt in and the whole set has following definition: (3)

Similarly, set denotes a group of diseases associate with lncRNA lj. The similarity between any disease dt in and the whole set has following definition: (4)

Functional similarities between the lncRNAs were denoted by LL = (llij)nl×nl whose element llij represents the functional similarity between li and lj with calculation as follows: (5)

Central similarity of the Gaussian interaction profile.

Compared to the number of unknown lncRNA–disease associations, the number of known lncRNA–disease associations is very small, which leads the bipartite graph represented by Boolean matrix of known lncRNA–disease associations to have sparsity. In order to reduce the influence from sparsity on prediction precision, the central similarities of Gaussian interaction profile were calculated in accordance with the description in Laarhoven’s work [53]. Therefore, the central similarities of Gaussian interaction profile between the diseases were denoted by whose element represents the central similarity of Gaussian interaction profile between disease di and dj with following definition: (6) where the ith column of matrix LD was denoted by LD(:,i) which represents all the known associations relating to disease di; The Gaussian kernel bandwidth here was denoted by γd with following definition in accordance to the previous study [54]: (7)

Similarly, the central similarities of Gaussian interaction profile between the lncRNAs were denoted by whose element represents the central similarity of Gaussian interaction profile between lncRNA li and lj with definition as follows: (8) where the ith row of matrix LD was denoted by LD(i,:) which represents all the known associations relating to lncRNA li; The Gaussian kernel bandwidth here was denoted by γl with following definition: (9)

Integrated similarity of lncRNAs and diseases.

The final similarity matrix of diseases denoted by comes from an integration of DD and DD(g), and the final similarity matrix of lncRNAs denoted by comes from an similar integration of LL and LL(g). When the original semantic similarity between disease di and dj was 0, the value of element in matrix DD(f) was set as the central similarity of the Gaussian interaction profile, otherwise it was set as the original semantic similarity between disease di and dj. The value of element in matrix LL(f) has a similar setting process as above. For clarity, the formalized acquirement for element values was defined as follows: (10) (11)

Obtain primary prediction scores

Construct adjacency matrix.

Based on KATZ measurement, the number of walks that connect lncRNA nodes and disease nodes in the original bipartite graph were calculated to measure the similarities between these nodes as the potential association probabilities. The different lengths of walks between lncRNA nodes and disease nodes contributed differently to the similarities between these two kinds of nodes. The shorter length of walks contributed more to the similarities than the longer one. To make full use of the heterogeneous network constructed above, matrix DD(f), LL(f) and LD were integrated into a new heterogeneous network A(nl+nd)×(nl+nd) as the adjacency matrix with definition as follows: (12)

Calculate primary prediction score on KAZT measurement.

By applying KATZ measurement, potential association probabilities between node li and node dj could be calculated as follows with denotation of : (13) where β is a non-negative coefficient to control the contribution of lengths coming from walks on the similarities between any two nodes, such as li and dj, βw raised to the power of w, denotes the number of paths whose length of walks equals w between corresponding nodes pair, such as li and dj, m denotes the maximum value of the length of walks.

Because bigger value of the length of walks contributes less to the similarities between two nodes, the above formula for similarity calculation could be approximately described in matrix when the value of m tends to be infinity (m→∞): (14) where the value of coefficient β was set in range of (0,min{1,1/‖A2}), matrix SKATZ has the same size as adjacency matrix A.

Submatrix SKATZ[1:nl,nl+1:nl+nd] denotes the elements that located at the rows 1 to nl and the columns nl+1 to nl+nd in matrix SKATZ, which has the same location as matrix LD in adjacency matrix A. In order to express in a consistent way, submatrix SKATZ[1:nl,nl+1:nl+nd] was denoted by matrix to represent the primary prediction results in the first stage.

Refine primary prediction scores

In order to improve the prediction performance of the proposed model, matrix space projection was used to refine the primary prediction scores obtained in the first stage ().

Project on lncRNA space.

Project the final similarity matrix of lncRNAs (LL(f)) on the matrix of primary prediction scores (LD(p)) to obtain the projection scores on the lncRNA space, which were denoted by with detailed definition as follows: (15) where denotes the predicted score of the association between lncRNA li and disease dj with lncRNA space projection, ‖LD(p)(:,j)‖ is the 2-norm of vector LD(p)(:,j).

Project on disease space.

Similarly, project the final similarity matrix of diseases (DD(f)) on the matrix of primary prediction scores (LD(p)) to obtain the projection scores on the disease space, which were denoted by with detailed definition as follows: (16) where (LD(p)(i,:))T denotes the transpose of vector LD(p)(i,:), and ‖LD(p)(i,:)‖ is the 2-norm of vector LD(p)(i,:).

Integrate space projection scores.

In order to fully capture the information of disease similarity, lncRNA similarity, and known lncRNA–disease associations, we integrated the projection scores on lncRNA space () and the projection scores on disease space () to obtain the final prediction scores () with detailed definition as follows: (17)

Represent workflow model

With the related data preparation, the inferring process with each key step of KATZSP for lncRNA-disease associations was graphically reprensented in Fig 9.

Conclusions

In recent years, even though many computational models for inferring lncRNA–disease associations have emerged, those computational methods still have some limitations that motivated us to propose a new model (KATZSP) to infer lncRNA–disease associations. The main contribution of KATZSP is composed of: only needing one attenuation factor β to control the contribution of walk lengths between any two nodes in bipartite graphs; making up the sparsity with simply integrating KATZ measurement and space projection; no needing negative samples; being able to be applied to isolated diseases and new lncRNAs directly. Compared with some state-of-the-art methods in similar type (NCPLDA, LDAI-ISPS and IIRWR), our model KATZSP achieved higher prediction accuracy on all three datasets (dataset 1, dataset 2 and dataset 3). The results from case study further confirmed the stronger predictive performance of KATZSP to be applied for real cases. Our KATZSP still has following limitations that need to be improved in future: further reducing the biases that the predicted results prefer the data with more known associations; the prediction accuracy needing to be enhanced further with fusion of more heterogeneous data.

Supporting information

S1 File. We have released our code publicly at the address of https://github.com/zywait/KATZSP.

In the public repository released includes our minimal underlying datasets (data352.mat, data621.mat, data1695.mat).

https://doi.org/10.1371/journal.pone.0260329.s001

(ZIP)

Acknowledgments

The authors thank the anonymous reviewers for suggestions that helped improve the paper substantially.

References

  1. 1. Fatica A., Bozzoni I., Long non-coding RNAs: new players in cell differentiation and development, Nature Reviews Genetics, 15 (2014) 7–21. pmid:24296535
  2. 2. Chen X., Yan C.C., Zhang X., You Z.-H., Long non-coding RNAs and complex diseases: from experimental results to computational models, Brief Bioinform, 18 (2017) 558–576. pmid:27345524
  3. 3. Xue X., Yang Y.A., Zhang A., Fong K., Kim J., Song B., et al., LncRNA HOTAIR enhances ER signaling and confers tamoxifen resistance in breast cancer, Oncogene, 35 (2016) 2746–2755. pmid:26364613
  4. 4. Gutschner T., Hämmerle M., Eißmann M., Hsu J., Kim Y., Hung G., et al., The noncoding RNA MALAT1 is a critical regulator of the metastasis phenotype of lung cancer cells, Cancer Research, 73 (2013) 1180–1189. pmid:23243023
  5. 5. Dai X., Zhang S., Zaleta-Rivera K., RNA: interactions drive functionalities, Mol Biol Rep, 47 (2020) 1413–1434. pmid:31838657
  6. 6. Hu H., Zhang L., Ai H., Zhang H., Fan Y., Zhao Q., et al., HLPI-ensemble: prediction of human lncRNA-protein interactions based on ensemble strategy, Rna Biology, 15 (2018) 797–806. pmid:29583068
  7. 7. Wang C.-C., Han C.-D., Zhao Q., Chen X., Circular RNAs and complex diseases: from experimental results to computational models, Brief Bioinform, (2021), pmid:34329377
  8. 8. Liu W., Jiang Y., Peng L., Sun X., Gan W., Zhao Q., et al., Inferring Gene Regulatory Networks Using the Improved Markov Blanket Discovery Algorithm, Interdisciplinary Sciences: Computational Life Sciences, (2021), pmid:34495484
  9. 9. Chen X., Sun Y.-Z., Guan N.-N., Qu J., Huang Z.-A., Zhu Z.-X., et al., Computational models for lncRNA function prediction and functional similarity calculation, Briefings in Functional Genomics, 18 (2019) 58–82. pmid:30247501
  10. 10. Zhang L., Yang P., Feng H., Zhao Q., Liu H., Using network distance analysis to predict lncRNA–miRNA interactions, Interdisciplinary Sciences: Computational Life Sciences, 13 (2021) 535–545. pmid:34232474
  11. 11. Quek X.C., Thomson D.W., Maag J.L., Bartonicek N., Signal B., Clark M.B., et al., lncRNAdb v2. 0: expanding the reference database for functional long noncoding RNAs, Nucleic Acids Res, 43 (2014) D168–D173. pmid:25332394
  12. 12. Geng C., Ziyun W., Dongqing W., Chengxiang Q., Mingxi L., Xing C., et al., LncRNADisease: a database for long-non-coding RNA-associated diseases, Nucleic Acids Res, 41 (2013) D983–D986. pmid:23175614
  13. 13. Dinger M.E., Pang K.C., Mercer T.R., Crowe M.L., Grimmond S.M., Mattick J.S., NRED: a database of long noncoding RNA expression, Nucleic Acids Res, 37 (2008) D122–D126. pmid:18829717
  14. 14. Bu D., Yu K., Sun S., Xie C., Skogerbø G., Miao R., et al., NONCODE v3. 0: integrative annotation of long noncoding RNAs, Nucleic Acids Res, 40 (2011) D210–D215. pmid:22135294
  15. 15. Chen X., Predicting lncRNA-disease associations and constructing lncRNA functional similarity network based on the information of miRNA, Sci Rep, 5 (2015) 13186. pmid:26278472
  16. 16. Chen X., Xie D., Zhao Q., You Z.-H., MicroRNAs and complex diseases: from experimental results to computational models, Brief Bioinform, 20 (2019) 515–539. pmid:29045685
  17. 17. Chen X., Wang L., Qu J., Guan N.-N., Li J.-Q., Predicting miRNA–disease association based on inductive matrix completion, Bioinformatics, 34 (2018) 4256–4265. pmid:29939227
  18. 18. Chen X., Zhu C.-C., Yin J., Ensemble of decision tree reveals potential miRNA-disease associations, PLoS Comput Biol, 15 (2019) e1007209. pmid:31329575
  19. 19. Chen X., Huang L., LRSSLMDA: Laplacian regularized sparse subspace learning for MiRNA-disease association prediction, PLoS Comput Biol, 13 (2017) e1005912. pmid:29253885
  20. 20. Chen X., Yin J., Qu J., Huang L., MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction, PLoS Comput Biol, 14 (2018) e1006418. pmid:30142158
  21. 21. Huang Y.-A., Chen X., You Z.-H., Huang D.-S., Chan K.C., ILNCSIM: improved lncRNA functional similarity calculation model, Oncotarget, 7 (2016) 25902–25914. pmid:27028993
  22. 22. Zhao T., Xu J., Liu L., Bai J., Xu C., Xiao Y., et al., Identification of cancer-related lncRNAs through integrating genome, regulome and transcriptome features, Mol Biosyst, 11 (2015) 126–136. pmid:25354589
  23. 23. Yu J., Ping P., Wang L., Kuang L., Li X., Wu Z., A Novel Probability Model for LncRNA–Disease Association Prediction Based on the Naïve Bayesian Classifier, Genes, 9 (2018) 345. pmid:29986541
  24. 24. Chen Q., Lai D., Lan W., Wu X., Chen B., Chen Y.-P.P., et al., ILDMSF: inferring associations between long non-coding RNA and disease based on multi-similarity fusion, IEEE/ACM Transactions on Computational Biology, 18 (2019) 1106–1112.
  25. 25. Lan W., Li M., Zhao K., Liu J., Wu F.-X., Pan Y., et al., LDAP: a web server for lncRNA-disease association prediction, Bioinformatics, 33 (2016) 458–460.
  26. 26. Li W., Wang S., Xu J., Mao G., Tian G., Yang J., Inferring latent disease-lncRNA associations by faster matrix completion on a heterogeneous network, Frontiers in genetics, 10 (2019) 769. pmid:31572428
  27. 27. Lu C., Yang M., Luo F., Wu F.-X., Li M., Pan Y., et al., Prediction of lncRNA–disease associations based on inductive matrix completion, Bioinformatics, 34 (2018) 3357–3364. pmid:29718113
  28. 28. Liu J.-X., Cui Z., Gao Y.-L., Kong X.-Z., WGRCMF: A weighted graph regularized collaborative matrix factorization method for predicting novel LncRNA-disease associations, IEEE journal of biomedical, 25 (2020) 257–265.
  29. 29. Fu G., Wang J., Yu G., Domeniconi C., Matrix factorization-based data fusion for the prediction of lncRNA–disease associations, Bioinformatics, 34 (2017) 1529–1537.
  30. 30. Liu H., Ren G., Chen H., Liu Q., Yang Y., Zhao Q., Predicting lncRNA–miRNA interactions based on logistic matrix factorization with neighborhood regularized, Knowledge-Based Systems, 191 (2020) 105261.
  31. 31. Zhao Q., Yu H., Ming Z., Hu H., Ren G., Liu H., The bipartite network projection-recommended algorithm for predicting long non-coding RNA-protein interactions, Molecular Therapy-Nucleic Acids, 13 (2018) 464–471. pmid:30388620
  32. 32. Ping P., Wang L., Kuang L., Ye S., Iqbal M.F.B., Pei T., A novel method for lncRNA-disease association prediction based on an lncRNA-disease association network, IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), 16 (2018) 688–693.
  33. 33. Zhou M., Wang X., Li J., Hao D., Wang Z., Shi H., et al., Prioritizing candidate disease-related long non-coding RNAs by walking on the heterogeneous lncRNA and disease network, Mol Biosyst, 11 (2015) 760–769. pmid:25502053
  34. 34. Yu G., Fu G., Lu C., Ren Y., Wang J., BRWLDA: bi-random walks for predicting lncRNA-disease associations, Oncotarget, 8 (2017) 60429. pmid:28947982
  35. 35. Chen X., You Z.-H., Yan G.-Y., Gong D.-W., IRWRLDA: improved random walk with restart for lncRNA-disease association prediction, Oncotarget, 7 (2016) 57919–57931. pmid:27517318
  36. 36. Chen X., KATZLDA: KATZ measure for the lncRNA-disease association prediction, Sci Rep, 5 (2015) 16840. pmid:26577439
  37. 37. Zhang Z., Zhang J., Fan C., Tang Y., Deng L., KATZLGO: large-scale prediction of LncRNA functions by using the KATZ measure based on multiple networks, IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), 16 (2017) 407–416. pmid:28534780
  38. 38. Liu Y., Feng X., Zhao H., Xuan Z., Wang L., A novel network-based computational model for prediction of potential LncRNA–disease association, International journal of molecular sciences, 20 (2019) 1549.
  39. 39. Zhang J., Zhang Z., Chen Z., Deng L., Integrating multiple heterogeneous networks for novel lncRNA-disease association inference, IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), 16 (2017) 396–406. pmid:28489543
  40. 40. Xuan P., Sheng N., Zhang T., Liu Y., Guo Y., CNNDLP: a method based on convolutional autoencoder and convolutional neural network with adjacent edge attention for predicting lncRNA–disease associations, International journal of molecular sciences, 20 (2019) 4260. pmid:31480319
  41. 41. Sheng N., Cui H., Zhang T., Xuan P., Attentional multi-level representation encoding based on convolutional and variance autoencoders for lncRNA–disease association prediction, Brief Bioinform, 22 (2021) bbaa067. pmid:32444875
  42. 42. Xuan P., Jia L., Zhang T., Sheng N., Li X., Li J., LDAPred: a method based on information flow propagation and a convolutional neural network for the prediction of disease-associated lncRNAs, International journal of molecular sciences, 20 (2019) 4458. pmid:31510011
  43. 43. Xuan P., Cao Y., Zhang T., Kong R., Zhang Z., Dual convolutional neural networks with attention mechanisms based method for predicting disease-related lncRNA genes, Frontiers in genetics, 10 (2019) 416. pmid:31130990
  44. 44. Zou Q., Li J., Hong Q., Lin Z., Wu Y., Shi H., et al., Prediction of MicroRNA-Disease Associations Based on Social Network Analysis Methods, Biomed Res Int, 2015 (2015) 810514. pmid:26273645
  45. 45. Zhang Y., Chen M., Li A., Cheng X., Jin H., Liu Y., LDAI-ISPS: LncRNA–disease associations inference based on integrated space projection scores, International journal of molecular sciences, 21 (2020) 1508. pmid:32098405
  46. 46. Chen M., Peng Y., Li A., Deng Y., Li Z., A novel lncRNA-disease association prediction model using Laplacian regularized least squares and space projection-federated method, IEEE Access, 8 (2020) 111614–111625.
  47. 47. Li G., Luo J., Liang C., Xiao Q., Ding P., Zhang Y., Prediction of LncRNA-Disease Associations Based on Network Consistency Projection, IEEE Access, 7 (2019) 58849–58856.
  48. 48. Zhang Y., Chen M., Li A., Cheng X., Jin H., Liu Y., LDAI-ISPS: LncRNA–Disease Associations Inference Based on Integrated Space Projection Scores, International journal of molecular sciences, 21 (2020) 1508. pmid:32098405
  49. 49. Wang L., Xiao Y., Li J., Feng X., Li Q., Yang J., IIRWR: Internal Inclined Random Walk With Restart for LncRNA-Disease Association Prediction, IEEE Access, 7 (2019) 54034–54041.
  50. 50. Bin J., Nie S., Tang Z., Kang A., Fu Z., Hu Y., et al., Long noncoding RNA EPB41L4A‐AS1 functions as an oncogene by regulating the Rho/ROCK pathway in colorectal cancer, Journal of cellular physiology, 236 (2021) 523–535. pmid:32557646
  51. 51. Wang D., Wang J., Lu M., Song F., Cui Q., Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases, Bioinformatics, 26 (2010) 1644–1650. pmid:20439255
  52. 52. Jie S., Hongbo S., Zhenzhen W., Changjian Z., Lin L., Letian W., et al., Inferring novel lncRNA-disease associations based on a random walk model of a lncRNA functional similarity network, Mol Biosyst, 10 (2014) 2074–2081. pmid:24850297
  53. 53. van Laarhoven T., Nabuurs S.B., Marchiori E., Gaussian interaction profile kernels for predicting drug–target interaction, Bioinformatics, 27 (2011) 3036–3043. pmid:21893517
  54. 54. Chen X., Huang Y.-A., Wang X.-S., You Z.-H., Chan K.C., FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model, Oncotarget, 7 (2016) 45948–45958. pmid:27322210