Strategies and Challenges in Big Data: A Short Review

Santhosh Kumar, D. K.; D‘Mello, Demian Antony

doi:10.1007/978-3-030-16660-1_4

D. K. Santhosh Kumar¹⁸ &
Demian Antony D‘Mello¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 941))

Included in the following conference series:

International Conference on Intelligent Systems Design and Applications

1171 Accesses
3 Citations

Abstract

The Big Data is the new trending technology in the field of research in recent years and is not only big in size, but also generated at brisk rate and variety, which endeavors the research upsurge in multidisciplinary fields like Government, Healthcare and business performance applications. Due to the key features (Volume, Velocity, and Variety) of Big Data it’s difficult to store and analyse with conventional tools and techniques. It acquaints unique challenges in scalability, storage, computational complexity, analytical, statistical correlation and security issues. Hence we describe the salient features of big data and how these affects the storage technologies and analytical techniques. We then present the taxonomy of Big Data sub-domains and discuss the different datasets based on data characteristics, privacy concern, and domain and application knowledge. Furthermore, we also explore research issues and challenges in big data storage technologies, privacy of data and data analytics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Jin, X., Wah, B.W., Cheng, X., Wang, Y.: Significance and challenges of big data research. Big Data Res. 2(2), 59–64 (2015)
Article Google Scholar
Mining, D.: Big-data analytics : a critical review and some future directions. Uroš Jovanovi č, Aleš Štimec Daniel Vladuši č Gregor Papa * Jurij Šilc. 10, 337–355 (2015)
Google Scholar
Sivarajah, U., Kamal, M.M., Irani, Z., Weerakkody, V.: Critical analysis of big data challenges and analytical methods. J. Bus. Res. 70, 263–286 (2017)
Article Google Scholar
Du, D., Li, A., Zhang, L.: Survey on the applications of big data in Chinese real estate enterprise. Procedia Comput. Sci. 30, 24–33 (2014)
Article Google Scholar
De Mauroandrea, A., Greco, M., Grimaldim, M., Table, V.: What is big data? A consensual definition and a review of key research topics, p. 97 (2015)
Google Scholar
Gandomi, A., Haider, M.: Beyond the hype: big data concepts, methods, and analytics. Int. J. Inf. Manag. 35, 137–144 (2015)
Article Google Scholar
Özköse, H., Uõ, P.L.Q., Gencer, C.: Yesterday, today and tomorrow of big data. Procedia-Soc. Behav. Sci. 195, 1042–1050 (2015)
Article Google Scholar
Abaker, I., Hashem, T., Yaqoob, I., Badrul, N., Mokhtar, S., Gani, A., Ullah, S.: The rise of “big data” on cloud computing: review and open research issues. Inf. Syst. 47, 98–115 (2015)
Article Google Scholar
Marr, B.: Big Data: 33 Brilliant and Free Data Sources for 2016. https://www.forbes.com/sites/bernardmarr/2016/02/12/big-data-35-brilliant-and-free-data-sources-for-2016/#4166dcf7b54d
Cao, L.: Data science: a comprehensive overview. ACM Comput. Surv. (CSUR) 50(3), 43 (2017)
Article Google Scholar
Tan, W., Blake, M.B., Saleh, I., Dustdar, S.: Social-network-sourced big data analytics. IEEE Internet Comput. 17, 62–69 (2013)
Article Google Scholar
Williams, G.J., Office, A.T.: Big data opportunities and challenges: discussions from data analytics persoectives. Comput. Intell. Mag. IEEE. 9, 62–74 (2014)
Article Google Scholar
Hu, H., Wen, Y., Chua, T.-S., Li, X.: Toward scalable systems for big data analytics: a technology tutorial. IEEE Access. 2, 652–687 (2014)
Article Google Scholar
Kaliyar, R.: Graph databases: a survey, pp. 785–790 (2015)
Google Scholar
Assunção, M.D., Calheiros, R.N., Bianchi, S., Netto, M.A.S., Buyya, R.: Big data computing and clouds: trends and future directions. J. Parallel Distrib. Comput. 79–80, 3–15 (2015)
Article Google Scholar
Chen, M., Mao, S., Liu, Y.: Big data: a survey. Mob. Netw. Appl. 19(2), 171–209 (2014)
Article Google Scholar
Singh, D., Reddy, C.K.: A survey on platforms for big data analytics. J. Big Data 2, 8 (2014)
Article Google Scholar
Khalifa, S., Elshater, Y., Sundaravarathan, K., Bhat, A.: The six pillars for building big data analytics ecosystems. ACM Comput. Surv. 49, 1–36 (2016)
Article Google Scholar
Wu, X., Zhu, X., Wu, G.-Q., Ding, W.: Data mining with big data. IEEE Trans. Knowl. Data Eng. 26, 97–107 (2014)
Article Google Scholar
Colombo, P., Ferrari, E.: Privacy aware access control for big data: a research roadmap. Big Data Res. 2, 145–154 (2015)
Article Google Scholar
Rumbold, J.M.M., Pierscionek, B.K.: What are data? A categorization of the data sensitivity spectrum. Big Data Res. 12, 49–59 (2017)
Article Google Scholar
Zhang, Y., Ren, J., Liu, J., Xu, C., Guo, H., Liu, Y.: A survey on emerging computing paradigms for big data. Chin. J. Electron. 26(1), 1–12 (2017)
Article Google Scholar
Khan, N., Yaqoob, I., Abaker, I., Hashem, T., Inayat, Z., Kamaleldin, W., Ali, M., Alam, M., Shiraz, M., Gani, A.: Big Data: Survey, Technologies, Opportunities, and Challenges (2014)
Google Scholar
Samuel, S.J., Rvp, K., Sashidhar, K., Bharathi, C.R.: A survey on big data and its research challenges. ARPN J. Eng. Appl. Sci. 10, 3343–3347 (2015)
Google Scholar
Tsai, C.W., Lai, C.F., Chao, H.C., Vasilakos, A.V.: Big data analytics: a survey. J. Big Data 2, 1–32 (2015)
Article Google Scholar
L’Heureux, A., Grolinger, K., Elyamany, H.F., Capretz, M.A.M.: Machine learning with big data: challenges and approaches. IEEE Access. 5, 7776–7797 (2017)
Article Google Scholar
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: a new data clustering algorithm and its applications. Data Mining Knowl. Discov. 1(2), 141–182 (1997)
Article Google Scholar
Kisilevich, S., Mansmann, F., Keim, D.: P-DBSCAN: a density based clustering algorithm for exploration and analysis of attractive areas using collections of geo-tagged photos, pp. 1–4 (2010)
Google Scholar
Ordonez, C., Omiecinski, E.: Efficient disk-based k-means clustering for relational databases. IEEE Trans. Knowl. Data Eng. 16, 909–921 (2004)
Article Google Scholar
Mehta, M., Agrawal, R., Rissanen, J.: SLIQ: A Fast Scalable Classifier for Data Mining (1996)
Chapter Google Scholar
Mico, L., Oncina, J., Mic, L., Oncina, J.: Dynamic Insertions in TLAESA fast NN search algorithm (2016)
Google Scholar
Djouadi, A., Bouktache, E.: A fast algorithm for the nearest-neighbor classifier. IEEE Trans. Pattern Anal. Mach. Intell. 19, 277–281 (1997)
Article Google Scholar
Han, J., Pei, J., Yin, Y.: Frequent Pattern Tree: Design and Construction, pp. 1–12 (2000)
Google Scholar
Chen, B., Way, H., Francisco, S.S., Haas, P., Jose, S., Scheuermann, P.: A New Two-Phase Sampling Based Algorithm for Discovering Association Rules (2002)
Google Scholar
Zaki, M.J.: SPADE: an efficient algorithm for mining frequent sequences. Mach. Learn. 42(1–2), 31–60 (2001)
Article Google Scholar
Zaki, M.J., Hsiao, C.: Efficient algorithms for mining closed itemsets and their lattice structure. IEEE Trans. Knowl. Data Eng. 17, 462–478 (2005)
Article Google Scholar
Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth (2001)
Google Scholar
Masseglia, F., Poncelet, P., Teisseire, M.: Incremental mining of sequential patterns in large databases. Data Knowl. Eng. 46(1), 97–121 (2003)
Article Google Scholar
Huang, J., Lin, S., Chen, M.: DPSP: Distributed Progressive Sequential Pattern Mining on the Cloud, pp. 27–34 (2010)
Chapter Google Scholar
Acharjya, D.P.: A survey on big data analytics: challenges, open research issues and tools. Int. J. Adv. Comput. Sci. Appl. 7, 511–518 (2016)
Google Scholar
Izakian, H., Abraham, A., Snášel, V.: Fuzzy clustering using hybrid fuzzy c-means and fuzzy particle swarm optimization. In: World Congress on Nature and Biologically Inspired Computing (NaBIC 2009), India, pp. 1690–1694. IEEE Press (2009). ISBN 978-1-4244-5612-3
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Canara Engineering College Mangalore, VTU, Belagavi, India
D. K. Santhosh Kumar & Demian Antony D‘Mello

Authors

D. K. Santhosh Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Demian Antony D‘Mello
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to D. K. Santhosh Kumar .

Editor information

Editors and Affiliations

Machine Intelligence Research Labs, Auburn, WA, USA
Ajith Abraham
School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
Aswani Kumar Cherukuri
Tijuana Institute of Technology, Tijuana, Mexico
Patricia Melin
Machine Intelligence Research Labs, Auburn, WA, USA
Niketa Gandhi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Santhosh Kumar, D.K., D‘Mello, D.A. (2020). Strategies and Challenges in Big Data: A Short Review. In: Abraham, A., Cherukuri, A., Melin, P., Gandhi, N. (eds) Intelligent Systems Design and Applications. ISDA 2018 2018. Advances in Intelligent Systems and Computing, vol 941. Springer, Cham. https://doi.org/10.1007/978-3-030-16660-1_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-16660-1_4
Published: 14 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-16659-5
Online ISBN: 978-3-030-16660-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics