Research on Auto-Generating Test-Paper Model Based on Spatial-Temporal Clustering Analysis

  • Yuling Fan
  • Likai Dong
  • Xuesong SunEmail author
  • Dong Wang
  • Wang Qin
  • Cao Aizeng
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10955)


In the process of auto-generating test-paper, the category and the difficulty of the title plays a key role in the quality of generating test-paper. It will produce low quality questions and hard to popularize when used the methods of artificial generating test-paper and random generating test-paper, because considering less on the knowledge point classification and difficulty in the subject. To improve the quality of auto-generating test-paper, this paper takes the evaluation data of ACM Online Judge system as the research object. After normalization, (1) we can get three different results by the K-means clustering analysis based on the temporal and spatial characteristics of time variance and average time; (2) On the basis of clustering, the difficulty index of each topic of all the categories is calculated by using the number of submissions and the number of submissions to solve the problem. The ratio of the two is proportional to the difficulty of the problem. In this paper, the ratio of the two to determine the degree of difficulty index; (3) The Gaussian stochastic process is used to extract numbers of questions of each knowledge point, and calculated the difficulty index which were extracted to make sure they are within range to complete the auto-generating test-paper. In the experiment, we try to train and test the automatic test paper model by the number of professional problems (about 50000 data) in the C language test question of the university OJ system. The average difficulty index of the test paper was 0.4663, which meet the requirements, and the difficulty index of the title fit in with the normal distribution. Compared with the traditional generating test-paper method, the automatic test paper model is based on the difficulty and discrimination of the subject, and it can evaluate the level of tester scientifically. The experimental results show that the proposed automatic test model is simple and effective.


Spatial-temporal feature Clustering algorithm Difficult coefficient Auto-generating test paper 



This research was supported by Shandong Provincial Natural Science Foundation (No. ZR2018LF005), Industry-University Cooperative Education Project of Ministry of Education (No. 201601023018), the Scientific Research Fund of Jinan University (No. XKY1711, No. XKY1622, No. XBS1653) and Teaching Research Project of Jinan University (No. J1638).


  1. 1.
    Guo, C., Tian, F., Jin, X.: Data Mining Tutorial, pp. 107–121. Tsinghua University Press (2005)Google Scholar
  2. 2.
    Duda, R.O., Hart, P.E., Stork, D.G.: Model Classification, 2nd edn., pp. 11–12. Machinery Industry Press, Beijing (2003). (Li hongdong, yao tianxiang)Google Scholar
  3. 3.
    Zhou, A., Chen, B., Wang, Y.: Research and improvement of k-means algorithm. J. Comput. Res. Dev. 22(10), 101–104 (2012)Google Scholar
  4. 4.
    Sun, J., Liu, J., Zhao, L.: Research on clustering algorithm. J. Softw. 19, 48–61 (2008)CrossRefGoogle Scholar
  5. 5.
    Zhijie, Li, Yuanxiang, Li, Feng, Wang, Li, Kuang: Accelerated multi-task online learning algorithm for big data stream. J. Comput. Res. Dev. 52(11), 2545–2554 (2015)Google Scholar
  6. 6.
    Yang, Y., Jin, F., Kamel, M.: Evaluation of clustering effectiveness. Appl. Res. Comput. 25(6), 1630–1632 (2008)Google Scholar
  7. 7.
    Sun, J., Li, X.: The algorithm and design of the difficulty coefficient of the test in the classical test theory. China Sci. Technol. Inf. (19), 44–45 (2009)Google Scholar
  8. 8.
    Asifa, R., Merceronb, A., Abbas Alic, S., Haidera, N.G.: Analyzing undergraduate students’ performance using educational data mining. Comput. Educ. 113, 177–194 (2017)CrossRefGoogle Scholar
  9. 9.
    Chen, C., Wang, Y., Li, C., Zhang, Y., Xing, C.: The research and application of big data in the field of online education. J. Comput. Res. Dev. 51, 67–74 (2014)Google Scholar
  10. 10.
    Li, T., Wang, T.: Study and analysis technology research and application status review. China Educ. Technol. 8, 129–133 (2012)Google Scholar
  11. 11.
    Zong, Y., Zheng, Q., Zhang, X., Chen, L.: The research about difficulty coefficient of MOOCs’ formative tests in the perspective of learning analytics. J. Distance Educ. 3, 96–103 (2016)Google Scholar
  12. 12.
    Zhang, R., Zhou, Q., Hu, B., Jiang, J., Wen, S.: Research and implemnetation of test paper intellectual difficulty adaptively generating. Comput. Mod. 3, 8–10 (2012)Google Scholar
  13. 13.
    Tang, X., Qiu, G., Zhuang, L.: Clustering method for irregular and uncompact data a clustering method for the distribution of non-dense spatial distribution data. Comput. Sci. 36(3), 167–169 (2009)Google Scholar
  14. 14.
    Korhonen, P., Salo, S., Steuer, R.E.: A heuristic for estimating nadir criterion values in multiple objective linear programming. Oper. Res. 45(5), 751–757 (1997)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  • Yuling Fan
    • 1
  • Likai Dong
    • 1
  • Xuesong Sun
    • 1
    Email author
  • Dong Wang
    • 1
    • 2
  • Wang Qin
    • 1
  • Cao Aizeng
    • 1
  1. 1.School of Information Science and EngineeringUniversity of JinanJinanChina
  2. 2.Shandong Provincial Key Laboratory of Network Based Intelligent ComputingUniversity of JinanJinanChina

Personalised recommendations