Skip to main content

Data Preprocessing Techniques for Research Performance Analysis

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 555))

Abstract

Business intelligence (BI) system mixes operational data with the analytical tools to represent descriptive and complicated data to groups of decision makers. BI aims to enhance the features and accuracy of data warehouse to the decision-making process and widely applied in industry. In order to achieve that, BI pulls and gathers information from multiple sources of information systems. Data from multiple sources tend to have flaws such as missing values, inconsistency data, and redundant data. Hence, this paper aims to show data preprocessing techniques used to produce clean and quality data for Universiti Teknologi Malaysia (UTM) research performance analysis. For this research study, required data were provided by UTM management level. In future, this study is expected to compare different data preprocessing techniques and recommend the best one for research performance analysis.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Han, J. & Kamber, M., (2006). Data Mining: Concepts and Techniques Second., San Francisco, CA: Elsevier Inc.

    Google Scholar 

  2. Dhillon, S.K. Ibrahim, R. & Selamat, A., (2013). Strategy Identification For Sustainable Key Performance Indicators Delivery Process For Scholarly Publication and Citation. International Journal of Information Technology & Management. 3(3), pp. 103–113.

    Google Scholar 

  3. Negash, Solomon. “Business Intelligence.” The communications of the Association for Information Systems 13.1 (2004): 54.

    Google Scholar 

  4. Agrawal, Akshat, and Sushil Kumar. “Analysis of Multidimensional Modeling Related To Conceptual Level.” Analysis (2015): 119–123.

    Google Scholar 

  5. Baina, K., Tata, S., and Benali, K. A Model for Process Service Interaction. In Proceedings 1st Conference on Business Process Management (EindHoven, The Netherlands, 2003).

    Google Scholar 

  6. Horkoff, Barone, et al. “Strategic business modeling: representation and reasoning.” Software & Systems Modeling 13.3 (2014): 1015–1041.

    Google Scholar 

  7. Chou, J.-S. et al., (2014). Machine learning in concrete strength simulations: Multi-nation data analytics. Construction and Building Materials. 73, pp. 771–780.

    Google Scholar 

  8. Namdev, N. Agrawal, S. & Silkari, S., (2015). Recent Advancement in Machine Learning Based Internet Traffic Classification. Procedia Computer Science. 60, pp. 784–791.

    Google Scholar 

  9. Jared, D., (2014). Big Data, Data Mining, and Machine Learning: Value Creation for Business Leaders and Practitioners, Hoboken, New Jersey: John Wiley & Son, Inc.

    Google Scholar 

  10. Liu, B. (University of I., (2011). Data-Centric Systems and Applications Second. S. Ceri & M. J. Carey, eds., Heidelberg: Springer.

    Google Scholar 

  11. Therese D. Pigott. A Review of Methods for Missing Data (2001). Educational Research and Evaluation. Vol. 7, No. 4, pp. 353–383.

    Google Scholar 

  12. Chong, M., (2005). Traffic accident analysis using machine learning paradigms. Informatica. 29, pp. 89–98.

    Google Scholar 

Download references

Acknowledgements

This work is supported by the Malaysia Ministry of Higher Education (MOHE) and the Research Management Centre of Universiti Teknologi Malaysia under the Fundamental Research Grant Scheme (Vote No. R.J130000.7828.4F741).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fatin Shahirah Zulkepli .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Zulkepli, F.S., Ibrahim, R., Saeed, F. (2017). Data Preprocessing Techniques for Research Performance Analysis. In: Patnaik, S., Popentiu-Vladicescu, F. (eds) Recent Developments in Intelligent Computing, Communication and Devices. Advances in Intelligent Systems and Computing, vol 555. Springer, Singapore. https://doi.org/10.1007/978-981-10-3779-5_20

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3779-5_20

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3778-8

  • Online ISBN: 978-981-10-3779-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics