LLCLPLDA: a novel model for predicting lncRNA–disease associations
- 156 Downloads
Long noncoding RNAs play a significant role in the occurrence of diseases. Thus, studying the relationship prediction between lncRNAs and disease is becoming more popular. Researchers hope to determine effective treatments by revealing the occurrence and development of diseases at the molecular level. However, the traditional biological experimental way to verify the association between lncRNAs and disease is very time-consuming and expensive. Therefore, we developed a method called LLCLPLDA to predict potential lncRNA–disease associations. First, locality-constrained linear coding (LLC) is leveraged to project the features of lncRNAs and diseases to local-constraint features, and then, a label propagation (LP) strategy is used to mix up the initial association matrix and the obtained features of lncRNAs and diseases. To demonstrate the performance of our method, we compared LLCLPLDA with five methods in the leave-one-out cross-validation and fivefold cross-validation scheme, and the experimental results show that the proposed method outperforms the other five methods. Additionally, we conducted case studies on three diseases: cervical cancer, gliomas, and breast cancer. The top five predicted lncRNAs for cervical cancer and gliomas were verified, and four of the five lncRNAs for breast cancer were also confirmed.
KeywordsLocality-constrained linear coding Label propagation lncRNA–disease associations Prediction
This work was supported by the National Natural Science Foundation of China (618002072, 61702112), the Natural Science Foundation of Guangdong Province (2018A030313389), the Science and Technology Plan Project of Guangdong Province (2017A040405050, 2016B030306004, 2016B030301008), and the Opening Project of the Guangdong Province Key Laboratory of Computational Science (2018012).
Compliance with ethical standards
Conflict of interest
All authors declare that they have no conflict of interest.
This article does not contain any studies with human participants or animals performed by any of the authors.
- Bao Z, Yang Z, Huang Z, Zhou Y, Cui Q, Dong D (2018) LncRNADisease 2.0: an updated database of long non-coding RNA-associated diseases. Nucleic Acids Res 47:1034–1037Google Scholar
- Brown CJ, Ballabio A, Rupert JL, Lafreniere RG, Grompe M, Tonlorenzi R, Willard HF (1991) A gene from the region of the human X inactivation centre is expressed. Nature 349:3Google Scholar
- Chen X, Yan CC, Luo C, Ji W, Zhang Y, Dai Q (2015) Constructing lncRNA functional similarity network based on lncRNA–disease associations and disease semantic similarity. Sci Rep UK 5:11338Google Scholar
- Consortium EP (2007) Identification and analysis of functional elements in 1% of the human genome by the encode pilot project. Nature 447:799Google Scholar
- Consortium I H GS (2001) Initial sequencing and analysis of the human genome. Nature 409:860Google Scholar
- Crick F, Barnett L, Brenner S, Watts-Tobin RJ (1961) General nature of the genetic code for proteins. Nature 192:1227–1232Google Scholar
- Ferlay J, Soerjomataram I, Dikshit R, Eser S, Mathers C, Rebelo M, Parkin DM, Forman D, Bray F (2015) Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer 136:359–386Google Scholar
- Gao Y, Wang P, Wang Y, Ma X, Zhi H, Zhou D, Li X, Fang Y, Shen W, Xu Y (2018) Lnc2Cancer v2. 0: updated database of experimentally supported long non-coding RNAs in human cancers. Nucleic Acids Res 47:1028–1033Google Scholar
- Jaccard P (1908) Nouvelles recherches sur la distribution florale. Bull Soc Vaud Sci Nat 44:223–270Google Scholar
- Montange RK, Batey RT (2008) Riboswitches: emerging themes in RNA structure and function. Ann Rev Biophys 37:117–133Google Scholar
- Parkinson H, Kapushesky M, Shojatalab M, Abeygunawardena N, Coulson R, Farne A, Holloway E, Kolesnykov N, Lilja P, Lukk M (2006) Arrayexpress—a public database of microarray experiments and gene expression profiles. Nucleic Acids Res 35:747–750Google Scholar
- Ping P, Wang L, Kuang L, Ye S, Iqbal MFB, Pei T (2018) A novel method for lncRNA–disease association prediction based on an lncRNA–disease association network. IEEE ACM Trans Comput Biol 16:688–693Google Scholar
- Qu Y, Zhang H, Chen Lyu CL (2018) LLCMDA: a novel method for predicting miRNA gene and disease relationship based on locality-constrained linear coding. Front Genet Nov 28:576Google Scholar
- Schlicker A, Lengauer T, Albrecht M (2010) Improving disease gene prioritization using the semantic similarity of gene ontology terms. Bioinformatics 26:561–567Google Scholar
- Wang B, Huang Z, Gao R, Zeng Z, Yang W, Sun Y, Wei W, Wu Z, Yu L, Li Q (2017a) Expression of long noncoding RNA urothelial cancer associated 1 promotes cisplatin resistance in cervical cancer. Cancer Biother Radio 32:101–110Google Scholar
- Wang G, Yin H, Li B, Yu C, Wang F, Xu X, Cao J, Bao Y, Wang L, Abbasi A (2018) Characterization and identification of long non-coding RNAs based on feature relationship. bioRxiv:327882Google Scholar
- Wang J, Peck L, Becherer K (1983) DNA supercoiling and its effects on DNA structure and function. Cold Spring Harb Symp Quant Biol (Cold Spring Harbor Laboratory Press) 47:85–91Google Scholar
- Wang J, Yang J, Yu K, Lv F, Huang T, Gong Y (2010) Locality-constrained linear coding for image classification. In: 2010 IEEE computer society conference on computer vision and pattern recognition (Citeseer), pp 3360–3367Google Scholar
- Yang G, Lu X, Yuan L (2014) LncRNA: a link between RNA and cancer. Bba-Gene Regul Mech 1839:1097–1109Google Scholar
- Zhao Q, Liang D, Hu H, Ren G, Liu H (2018a) RWLPAP: random walk for lncRNA-protein associations prediction. Protein Peptide Lett 25:830–837Google Scholar