Abstract
With this paper we aim to contribute to the proper handling of veracity, which is generally recognized as one of the main problems related to ‘Big’ data. Veracity refers to the extent to which the used data adequately reflect real world information and hence can be trusted. More specifically we describe a novel computational intelligence technique for handling veracity aspects of nominal data, which are often encountered when users have to select one or more items from a list. First, we discuss the use of fuzzy sets for modelling nominal data and specifying search criteria on nominal data. Second, we introduce the novel concept of a multipolar satisfaction degree as a tool to handle criteria evaluation. Third, we discuss aggregation of multipolar satisfaction degrees. Finally, we demonstrate the proposed technique and discuss its benefits using a film genre example.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bronselaer, A., De Mol, R., De Tré, G.: A measure-theoretic foundation for data quality. IEEE Trans. Fuzzy Syst. 26(2), 627–639 (2018)
De Mauro, A., Greco, M., Grimaldi, M.: A formal definition of big data based on its essential features. Library Rev. 65, 122–135 (2016)
De Tré, G., Zadrożny, S., Matthé, T., Kacprzyk, J., Bronselaer, A.: Dealing with positive and negative query criteria in fuzzy database querying. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds.) FQAS 2009. LNCS (LNAI), vol. 5822, pp. 593–604. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04957-6_51
De Tré, G., Kacprzyk, J., Pasi, G., Zadrożny, S., Bronselaer, A. (eds.): International Journal of Intelligent Systems, Special Issue on Human Centric Data Management, vol. 33, no. 10 (2018)
Dubois, D., Prade, H.: Handling bipolar queries in Fuzzy Information Processing. In: Galindo, J. (ed.) Handbook of Research on Fuzzy Information Processing in Databases, pp. 97–114. IGI Global, New York (2008)
Dubois, D., Prade, H.: An introduction to bipolar representations of information and preference. Int. J. Intell. Syst. 23, 866–877 (2008)
Dujmović, J.J.: Soft Computing Evaluation Logic: The LSP Decision Method and Its Applications. Wiley-Blackwell, Hoboken (2018)
Hayashi, C.: What is data science? Fundamental concepts and a heuristic example. In: Hayashi, C., Yajima, K., Bock, H.H., Ohsumi, N., Ohsumi, Y., Baba, Y. (eds.) Data Science, Classification, and Related Methods. STUDIES CLASS, pp. 40–51. Springer, Tokyo (1998). https://doi.org/10.1007/978-4-431-65950-1_3
Lacroix, M., Lavency, P.: Preferences: putting more knowledge into queries. In: Proceedings of the 13 International Conference on Very Large Databases, Brighton, UK, pp. 217–225 (1987)
Liétard, L., Hadjali, A., Rocacher, D.: Towards a gradual QCL model for database querying. In: Laurent, A., Strauss, O., Bouchon-Meunier, B., Yager, R.R. (eds.) IPMU 2014. CCIS, vol. 444, pp. 130–139. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-08852-5_14
Lukoianova, T., Rubin, V.L.: Veracity roadmap: is big data objective, truthful and credible? Adv. Classif. Res. Online 24(1), 4–15 (2014)
Matthé, T., De Tré, G., Zadrożny, S., Kacprzyk, J., Bronselaer, A.: Bipolar database querying using bipolar satisfaction degrees. Int. J. Intell. Syst. 26(10), 890–910 (2011)
Redman, T.: Data Quality for the Information Age. Artech-House, Norwood (1996)
Saha, B., Srivastava, D.: Data quality: the other face of big data. In: Proceedings of the 2014 IEEE 30th International Conference on Data Engineering, Chicago, USA, pp. 1294–1297 (2014)
Stevens, S.S.: On the theory of scales of measurement. Science 103(2684), 677–680 (1946)
Wang, R., Strong, D.: Beyond accuracy: what data quality means to data consumers. J. Manag. Inf. Syst. 12(4), 5–34 (1996)
Yager, R.R.: On ordered weighted averaging aggregation operators in multi-criteria decision making. IEEE Trans. Syst. Man Cybern. 18, 183–190 (1988)
Zadeh, L.A.: Fuzzy sets. Inf. Control 8(3), 338–353 (1965)
Zadeh, L.A.: Soft computing and fuzzy logic. In: Advances in Fuzzy Systems - Applications and Theory, vol. 6, pp. 796–804 (1996)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
De Tré, G., Boeckling, T., Timmerman, Y., Zadrożny, S. (2019). Handling Veracity of Nominal Data in Big Data: A Multipolar Approach. In: Cuzzocrea, A., Greco, S., Larsen, H., Saccà, D., Andreasen, T., Christiansen, H. (eds) Flexible Query Answering Systems. FQAS 2019. Lecture Notes in Computer Science(), vol 11529. Springer, Cham. https://doi.org/10.1007/978-3-030-27629-4_29
Download citation
DOI: https://doi.org/10.1007/978-3-030-27629-4_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-27628-7
Online ISBN: 978-3-030-27629-4
eBook Packages: Computer ScienceComputer Science (R0)