Data Preprocessing Techniques for Research Performance Analysis

  • Fatin Shahirah ZulkepliEmail author
  • Roliana Ibrahim
  • Faisal Saeed
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 555)


Business intelligence (BI) system mixes operational data with the analytical tools to represent descriptive and complicated data to groups of decision makers. BI aims to enhance the features and accuracy of data warehouse to the decision-making process and widely applied in industry. In order to achieve that, BI pulls and gathers information from multiple sources of information systems. Data from multiple sources tend to have flaws such as missing values, inconsistency data, and redundant data. Hence, this paper aims to show data preprocessing techniques used to produce clean and quality data for Universiti Teknologi Malaysia (UTM) research performance analysis. For this research study, required data were provided by UTM management level. In future, this study is expected to compare different data preprocessing techniques and recommend the best one for research performance analysis.


Business intelligence Data preprocessing Research performance 



This work is supported by the Malaysia Ministry of Higher Education (MOHE) and the Research Management Centre of Universiti Teknologi Malaysia under the Fundamental Research Grant Scheme (Vote No. R.J130000.7828.4F741).


  1. 1.
    Han, J. & Kamber, M., (2006). Data Mining: Concepts and Techniques Second., San Francisco, CA: Elsevier Inc.Google Scholar
  2. 2.
    Dhillon, S.K. Ibrahim, R. & Selamat, A., (2013). Strategy Identification For Sustainable Key Performance Indicators Delivery Process For Scholarly Publication and Citation. International Journal of Information Technology & Management. 3(3), pp. 103–113.Google Scholar
  3. 3.
    Negash, Solomon. “Business Intelligence.” The communications of the Association for Information Systems 13.1 (2004): 54.Google Scholar
  4. 4.
    Agrawal, Akshat, and Sushil Kumar. “Analysis of Multidimensional Modeling Related To Conceptual Level.” Analysis (2015): 119–123.Google Scholar
  5. 5.
    Baina, K., Tata, S., and Benali, K. A Model for Process Service Interaction. In Proceedings 1st Conference on Business Process Management (EindHoven, The Netherlands, 2003).Google Scholar
  6. 6.
    Horkoff, Barone, et al. “Strategic business modeling: representation and reasoning.” Software & Systems Modeling 13.3 (2014): 1015–1041.Google Scholar
  7. 7.
    Chou, J.-S. et al., (2014). Machine learning in concrete strength simulations: Multi-nation data analytics. Construction and Building Materials. 73, pp. 771–780.Google Scholar
  8. 8.
    Namdev, N. Agrawal, S. & Silkari, S., (2015). Recent Advancement in Machine Learning Based Internet Traffic Classification. Procedia Computer Science. 60, pp. 784–791.Google Scholar
  9. 9.
    Jared, D., (2014). Big Data, Data Mining, and Machine Learning: Value Creation for Business Leaders and Practitioners, Hoboken, New Jersey: John Wiley & Son, Inc.Google Scholar
  10. 10.
    Liu, B. (University of I., (2011). Data-Centric Systems and Applications Second. S. Ceri & M. J. Carey, eds., Heidelberg: Springer.Google Scholar
  11. 11.
    Therese D. Pigott. A Review of Methods for Missing Data (2001). Educational Research and Evaluation. Vol. 7, No. 4, pp. 353–383.Google Scholar
  12. 12.
    Chong, M., (2005). Traffic accident analysis using machine learning paradigms. Informatica. 29, pp. 89–98.Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2017

Authors and Affiliations

  • Fatin Shahirah Zulkepli
    • 1
    Email author
  • Roliana Ibrahim
    • 1
  • Faisal Saeed
    • 1
  1. 1.Information System Department, Faculty of ComputingUniversiti Teknologi MalaysiaJohor BahruMalaysia

Personalised recommendations