Discovering Emerging Research Topics Based on SPO Predications
With the rapid growth of scientific literatures, it is very important to discover the implicit knowledge from the vast information accurately and efficiently. To achieve this goal, we propose a percolation approach to discovering emerging research topics by combining text mining and scientometrics methods based on Subject-Predication-Object (SPO) predications, which consist of a subject argument, an object argument, and the relation that binds them. Firstly, SPO predications are extracted and cleaned from content of literatures to construct SPO semantic networks. Then, community detection is conducted in the SPO semantic networks. Afterwards, two indicators of Research Topic Age (RTA) and Research Topic Authors Number (RTAN) combined by hypervolume-based selection algorithm (HBS) are chosen to identify potential emerging research topics from communities. Finally, scientific literatures of stem cells are selected as a case study, and the result indicates that the approach can effectively and accurately discover the emerging research topics.
KeywordsEmerging research topics Subject-Predication-Object Community detection Hypervolume-based selection Stem cell
The work in this paper was supported by the Informationization Special Project of Chinese Academy of Sciences “E-Science Application for Knowledge Discovery in Stem Cells” (Grant No: XXH13506-203) and the Fundamental Research Funds for the Central Universities (Grant No. A0920502051815-69).
- 1.Swanson, D.R.: Medical literature as a potential source of new knowledge. Bull. Med. Libr. Assoc. 78(1), 29–37 (1990)Google Scholar
- 7.Gong, X., Jiang, J., Duan, Z., Lu, H.: A new method to measure the semantic similarity from query phenotypic abnormalities to diseases based on the human phenotype ontology. BMC Bioinform. 19(4), 111–119 (2018)Google Scholar
- 10.Zhao, M., Zhang, S., Li, W., Chen, G.: Matching biomedical ontologies based on formal concept analysis. J. Biomed. Semant. 9(11), 1–27 (2018)Google Scholar
- 12.Fiszman, M., Rindflesch, T.C., Kilicoglu, H.: Abstraction summarization for managing the biomedical research literature. In: Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics, pp. 76–83 (2004)Google Scholar
- 13.Hu, Z.-Y., Zeng, R.-Q., Qin, X.-C., Wei, L., Zhang, Z.: A method of biomedical knowledge discovery by literature mining based on SPO predications: a case study of induced pluripotent stem cells. In: Perner, P. (ed.) MLDM 2018. LNCS (LNAI), vol. 10935, pp. 383–393. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-96133-0_29CrossRefGoogle Scholar
- 16.Xu, X.Y., Zheng, Y.N., Liu, Z.H.: Study on the method of identifying research fronts based on scientific papers and patents. Libr. Inf. Serv. 60(24), 97–106 (2016)Google Scholar
- 18.Wei, L., Hu, Z.Y., Pang, H.S., et al.: Study on knowledge discovery in biomedical literature based on SPO predications: a case study of induced pluripotent stem cells. Digit. Libr. Forum 9, 28–34 (2017)Google Scholar