Abstract
This paper focuses on the use of knowledge management techniques to help organisations tap into the power of statistical learning when conducting analytics. Its main contribution is in the use of an ontology development process to derive the essential concepts required for an ontology to represent variables of interest and their interrelationships with each other and with statistical datasets. This ontology is developed with the help of two case studies in the area of digital marketing and commodity pricing. A number of competency questions have been designed to map to user requirements in both case studies. A prototype system has been developed using a semantic modelling tool and a semantic data repository to demonstrate that the proposed ontology can support the competency questions via semantic queries.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
The Computational Social Science Society of the Americas. https://computationalsocialscience.org
Gilbert, N. (ed.): Computational Social Science, vol. 21. Sage, Thousand Oaks (2010)
Dhar, V.: Data science and prediction. Commun. ACM 56(12), 64–73 (2013)
Schlegal, K., Linden, A.: Predicts 2017: Analytics Strategy and Technology. Gartner, Stamford (2016)
Nural, M.V., Cotterell, M.E., Miller, J.A.: Using semantics in predictive big data analytics. In: 2015 IEEE International Congress on Big Data (BigData Congress), pp. 254–261. IEEE, June 2015
Labrinidis, A., Jagadish, H.V.: Challenges and opportunities with big data. Proc. VLDB Endow. 5(12), 2032–2033 (2012)
Vapnik, V.N.: An overview of statistical learning theory. IEEE Trans. Neural Networks 10(5), 988–999 (1999)
Migon, H.S., Gamerman, D., Louzada, F.: Statistical Inference: An Integrated Approach. CRC Press, Boca Raton (2014)
James, G., Witten, D., Hastie, T., Tibshirani, R.: An Introduction to Statistical Learning, vol. 112. Springer, New York (2013)
Harper, K.E., Dagnino, A.: Agile software architecture in advanced data analytics. In: 2014 IEEE/IFIP Conference on Software Architecture (WICSA), pp. 243–246. IEEE, April 2014
Yao, L., Rabhi, F.A.: Building architectures for data-intensive science using the ADAGE framework. Concurr. Comput. Pract. Exp. 27(5), 1188–1206 (2015)
Behnaz, A., Rabhi, F., Peat, M.: A software architecture for enabling statistical learning on big data. In: Rojas, I., Pomares, H., Valenzuela, O. (eds.) ITISE 2016. CS, pp. 343–357. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-55789-2_24
Withers, D., Kawas, E., McCarthy, L., Vandervalk, B., Wilkinson, M.: Semantically-guided workflow construction in Taverna: the SADI and BioMoby plug-ins. In: Margaria, T., Steffen, B. (eds.) ISoLA 2010. LNCS, vol. 6415, pp. 301–312. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-16558-0_26
Miller, J.A., Han, J., Hybinette, M.: Using domain specific language for modeling and simulation: scalation as a case study. In: Proceedings of the Winter Simulation Conference, pp. 741–752, December 2010
Panov, P., Džeroski, S., Soldatova, L.: OntoDM: an ontology of data mining. In: IEEE International Conference on Data Mining Workshops, ICDMW 2008, pp. 752–760. IEEE, December 2008
Lin, M.S., Zhang, H., Yu, Z.G.: An ontology for supporting data mining process. In: IMACS Multiconference on Computational Engineering in Systems Applications, vol. 2, pp. 2074–2077. IEEE, October 2006
Espinosa, R., GarcÃa-Saiz, D., Zorrilla, M.E., Zubcoff, J.J., Mazón, J.N.: Development of a knowledge base for enabling non-expert users to apply data mining algorithms. In: SIMPDA, pp. 46–61, August 2013
Behnaz, A., Natarajan, A., Rabhi, Fethi A., Peat, M.: A semantic-based analytics architecture and its application to commodity pricing. In: Feuerriegel, S., Neumann, D. (eds.) FinanceCom 2016. LNBIP, vol. 276, pp. 17–31. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-52764-2_2
Shah, T.M.: Designing and conceptualising ontology patterns for modelling cross-domain health information. Ph.D. thesis, University of New South Wales (2016)
Suárez-Figueroa, M.C.: NeOn methodology for building ontology networks: specification, scheduling and reuse. Doctoral thesis, Artificial Intelligence, Universidad Politécnica De Madrid (2010)
Yang, S., Lin, S., Carlson, J.R., Ross Jr., W.T.: Brand engagement on social media: will firms’ social media efforts influence search engine advertising effectiveness? J. Mark. Manage. 32(5–6), 526–557 (2016)
Info Package for UNSW Data Science Hackathon. http://www.cse.unsw.edu.au/~fethir/HackathonInfo/HackathonStudentPack_v7.pdf
Acknowledgements
We are grateful to Capsifi and Ignition Wealth, especially Terry Roach, Mark Fordree and Mike Giles for sponsoring the research which led to this paper. We are also grateful to Adnene Guabtni and Chedia Dhaoui for helping with digital marketing case study. We thank Gino Conte on the visualization development of the prototype application.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Behnaz, A., Bandara, M., Rabhi, F.A., Peat, M. (2019). A Statistical Learning Ontology for Managing Analytics Knowledge. In: Mehandjiev, N., Saadouni, B. (eds) Enterprise Applications, Markets and Services in the Finance Industry. FinanceCom 2018. Lecture Notes in Business Information Processing, vol 345. Springer, Cham. https://doi.org/10.1007/978-3-030-19037-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-19037-8_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-19036-1
Online ISBN: 978-3-030-19037-8
eBook Packages: Computer ScienceComputer Science (R0)