Visualization analysis of big data research based on Citespace

  • Weihong Wang
  • Chang LuEmail author


In recent years, with the massive growth of data, the world today has entered the era of big data. Big data has brought tremendous value to all fields of today’s society, and it has also brought enormous challenges, which has attracted great attention from all walks of life. Analyze and forecast the research hotspots and future development trends in the field of big data, and understand the development changes and priorities in the field of big data research, which will play a significant role in promoting the development of social development and scientific research. In the era of big data, how to extract information from huge amounts of complex data and present complex information more clearly and clearly, the most effective way is to use visualization technology. The article uses the information visualization software Citespace to study the data related to big data in the Web of Science and CNKI database from 2008 to 2017 for 10 years, from macro to micro to the representative countries of the literature, keywords and co-cited documents. Through visualization analysis, the article clarifies the key research directions, key documents and hot spot frontiers in the field of big data research, forecasts the future development trends in this field, and compares the research situation at home and abroad, in order to provide readers and other researchers with certain reference and help.


Big data Citespace Visualization analysis Knowledge maps 



This paper is funded by the National Natural Science Special Fund Project (61340058) and the Zhejiang Provincial Natural Science Fund Key Project (LZ14F020001).

Compliance with ethical standards

Conflict of interest

All Authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

Informed consent was obtained from all individual participants included in the study.


  1. Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z (2007) Dbpedia: a nucleus for a web of open data. The semantic web. Springer, Berlin, pp 722–735CrossRefGoogle Scholar
  2. Boyd D, Crawford K (2012) Critical questions for big data. Inf Commun Soc 15:1–18CrossRefGoogle Scholar
  3. Brandes U (2001) A faster algorithm for betweenness centrality. J Math Sociol 25(2):163–177CrossRefGoogle Scholar
  4. Chen CLP, Zhang CY (2014) Data-intensive applications, challenges, techniques and technologies: A survey on big data. Inf Sci 275(11):314–347CrossRefGoogle Scholar
  5. Chen K, Zheng W (2010) Cloud computing: system instances and current research: cloud computing: system instances and current research. J Softw 20:1337–1348CrossRefGoogle Scholar
  6. Chen Y, Liu Z, Chen J, Hou J (2008) History and theory of mapping knowledge domains. Stud Sci Sci 26(3):449–460Google Scholar
  7. Chen H, Chiang R, Storey V (2012) Business intelligence and analytics: from big data to big impact. MIS Q 36:1165–1188CrossRefGoogle Scholar
  8. Chen M, Mao S, Liu Y (2014) Big data: a survey. Mob Netw Appl 19(2):171–209CrossRefGoogle Scholar
  9. Chen X, Chen B, Zhang C, Hao T (2017a) Discovering the recent research in natural language processing field based on a statistical approach. Springer, Berlin, pp 507–517Google Scholar
  10. Chen X, Weng H, Hao T (2017b) A data-driven approach for discovering the recent research status of diabetes in china. Health information science. Springer International Publishing, Cham, pp 89–101CrossRefGoogle Scholar
  11. Cheng X, Jin X, Wang Y, Guo J, Zhang T, Li G (2014) Survey on big data system and analytic technology. J Softw 9:1889–1908CrossRefGoogle Scholar
  12. Dai S, Dong J, Xue J (2014) Visualization analysis and application of the big data in scientific computing. Eng Eng Interdiscip Perspect 6(3):275–281Google Scholar
  13. Danasingh AA, Tamizhpoonguil B, Epiphany JL (2016) A survey on big data and cloud computing. Int J Recent Innov Trends Comput Commun 4:273–277Google Scholar
  14. Dean J, Ghemawat S (2008) Mapreduce: simplified data processing on large clusters. Commun ACM 51(1):107–113CrossRefGoogle Scholar
  15. Fu H, Ho Y, Sui Y, Li Z (2010) A bibliometric analysis of solid waste research during the period 1993–2008. Waste Manag (New York) 30:2410–2417CrossRefGoogle Scholar
  16. Gantz J, Reinsei D (2011) Extracting value from chaos. IDC iview 1142(2011):1–12Google Scholar
  17. Guan S, Meng X, Li Z, Liu Y (2015) Big data study on the current situation, problems and countermeasures. J Intell 5:98–104Google Scholar
  18. Hao T, Chen X, Li G, Yan J (2018) A bibliometric analysis of text mining in medical research. Soft Comput 22(23):7875–7892CrossRefGoogle Scholar
  19. Hou J, Hu Z (2019) Review on the application of Citespace at home and Abroad. J Mod Inf 33:99–103Google Scholar
  20. Lazer D, Kennedy R, King G, Vespignani A (2014) The parable of Google Flu: traps in big data analysis. Science 343(6176):1203CrossRefGoogle Scholar
  21. Li G, Cheng X (2012) Research status and scientific thinking of big data. Bull Chin Acad Sci 6:647–657Google Scholar
  22. Li Z, Nie F, Chang X, Yang Y (2017) Beyond trace ratio: weighted harmonic mean of trace ratios for multiclass discriminant analysis. IEEE Trans Knowl Data Eng 29(10):2100–2110CrossRefGoogle Scholar
  23. Liu H, Morstatter F, Tang J, Zafarani R (2016a) The good, the bad, and the ugly: uncovering novel research opportunities in social media mining. Int J Data Sci Anal 1:137–143CrossRefGoogle Scholar
  24. Liu Q, Li Y, Duan H, Liu Y, Qin Z (2016b) Knowledge graph construction techniques. J Comput Res Dev 53(3):582–600Google Scholar
  25. Luo M, Chang X, Yang Y, Nie L, Hauptmann A, Zheng Q (2017) Simple to complex cross-modal learning to rank. Comput Vis Image Underst 163:67–77CrossRefGoogle Scholar
  26. Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Hung-Byers A (2011) Big data: the next frontier for innovation, competition, and productivity. McKinsey Global Institute, Available at:
  27. Marcos S, Garcia-Penalvo F (2018) Information retrieval methodology for aiding scientific database search. Soft Comput 10:1–10Google Scholar
  28. Mayerschonberger V, Cukier K (2014) Big data: a revolution that will transform how we live, work, and think. Math Comput Educ 47(17):181–183Google Scholar
  29. McAfee A, Brynjolfsson E (2012) Big data: the management revolution. Harv Bus Rev 90:60–68Google Scholar
  30. Meng X, Ci X (2013) Big data management: concepts, techniques and challenges. J Comput Res Dev 1:146–169Google Scholar
  31. Price D (1965) Networks of scientific papers. Science 149(3683):510–515CrossRefGoogle Scholar
  32. Qin C, Hou H (2009) Mapping knowledge domain—a new field of information management and knowledge management. J Acad Libr 1:30–37Google Scholar
  33. Science (2011) A special issue of science: dealing with data. Sci Technol Appl 2(1):93–94Google Scholar
  34. Shi Y, Meng X (2014) A survey of query techniques in cloud data management systems. Chin J Comput 36:209–225CrossRefGoogle Scholar
  35. Shneider AM (2009) Four stages of a scientific discipline; four types of scientist. Trends Biochem Sci 34(5):217–223CrossRefGoogle Scholar
  36. Tan H, Gao Y (2017) Regularized constraint subspace based method for image set classification. IEEE Access 5:15001–15012CrossRefGoogle Scholar
  37. Tan H, Gao Y, Ma Z (2018) Regularized constraint subspace based method for image set classification. Pattern Recogn 76:434–448CrossRefGoogle Scholar
  38. Taylor R (2010) An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics. BMC Bioinf 11(Suppl 12):S1CrossRefGoogle Scholar
  39. Wang S, Wang H, Tan X (2011) Architecting big data: challenges, studies and forecasts. Chin J Comput 34(10):1741–1752CrossRefGoogle Scholar
  40. Wang Y, Jin X, Cheng X (2013) Network big data: present and future. Chin J Comput 36(6):1125–1138CrossRefGoogle Scholar
  41. Wei L, Zhao Y (2015) Bibliometric analysis of global environmental assessment research in a 20-year period. Environ Impact Assess Rev 50:158–166CrossRefGoogle Scholar
  42. Wu X, Zhu X, Wu G, Wei D (2013) Data mining with big data. IEEE Trans Knowl Data Eng 26(1):97–107Google Scholar
  43. Wu Y, Yang F, Lai G, Lin K (2016) Research progress of knowledge graph learning and reasoning. J Chin Comput Syst 37(9):2007–2013Google Scholar
  44. Yang L, Wei X (2011) Visualization research in foreign social network analysis based on mapping knowledge domain. Inf Sci 29:1041–1048Google Scholar
  45. Zhang Y, Chen M, Liao X (2013) Big data applications: a survey. J Comput Res Dev 50(z2):216–233Google Scholar
  46. Zhang S, Yang Z, Xing X, Gao Y, Xie D, Wong HS (2017) Generalized pair-counting similarity measures for clustering and cluster ensembles. IEEE Access 5:16904–16918CrossRefGoogle Scholar
  47. Zheng L (2013) Stride into the era of “big data”. Inf Constr 1(2011):10–13Google Scholar

Copyright information

© Springer-Verlag GmbH Germany, part of Springer Nature 2019

Authors and Affiliations

  1. 1.School of Computer Science and TechnologyZhejiang University of TechnologyHangzhouChina

Personalised recommendations