Skip to main content
  • Book
  • © 2016

Big Data 2.0 Processing Systems

A Survey

Authors:

  • Provides readers the “big picture” and a comprehensive survey of the domain of big data processing systems and discusses various aspects of research and development
  • Describes an entire range of engines that transcend the Hadoop framework and are dedicated to specific verticals (e.g. structured data, graph data, streaming data)
  • A valuable reference guide for students, researchers and professionals in the domain of big data processing systems
  • Includes supplementary material: sn.pub/extras

Part of the book series: SpringerBriefs in Computer Science (BRIEFSCOMPUTER)

Buy it now

Buying options

eBook USD 44.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (6 chapters)

  1. Front Matter

    Pages i-xv
  2. Introduction

    • Sherif Sakr
    Pages 1-13
  3. Large-Scale Graph Processing Systems

    • Sherif Sakr
    Pages 53-73
  4. Large-Scale Stream Processing Systems

    • Sherif Sakr
    Pages 75-89
  5. Conclusions and Outlook

    • Sherif Sakr
    Pages 91-95
  6. Back Matter

    Pages 97-102

About this book

This book provides readers the “big picture” and a comprehensive survey of the domain of big data processing systems. For the past decade, the Hadoop framework has dominated the world of big data processing, yet recently academia and industry have started to recognize its limitations in several application domains and big data processing scenarios such as the large-scale processing of structured data, graph data and streaming data. Thus, it is now gradually being replaced by a collection of engines that are dedicated to specific verticals (e.g. structured data, graph data, and streaming data). The book explores this new wave of systems, which it refers to as Big Data 2.0 processing systems.

After Chapter 1 presents the general background of the big data phenomena, Chapter 2 provides an overview of various general-purpose big data processing systems that allow their users to develop various big data processing jobs for different application domains. In turn, Chapter 3 examines various systems that have been introduced to support the SQL flavor on top of the Hadoop infrastructure and provide competing and scalable performance in the processing of large-scale structured data. Chapter 4 discusses several systems that have been designed to tackle the problem of large-scale graph processing, while the main focus of Chapter 5 is on several systems that have been designed to provide scalable solutions for processing big data streams, and on other sets of systems that have been introduced to support the development of data pipelines between various types of big data processing jobs and systems. Lastly, Chapter 6 shares conclusions and an outlook on future research challenges.

Overall, the book offers a valuable reference guide for students, researchers and professionals in the domain of big data processing systems. Further, its comprehensive content will hopefully encourage readers to pursue further research on the subject.

Reviews

“This book surveys the most popular platforms for processing big data, and informs the reader about which platforms will be good with what kind of data and problems. … It is a good resource for students, academics, researchers, and professionals, as it can help them to gain a good understanding of the characteristics of big data, as well as an overview of the existing big data processing systems.” (Gulustan Dogan, Computing Reviews, January, 2017)

“The book "Big Data 2.0 Processing Systems" is a valuable and up-to-date guide through this field and provides the reader with a comprehensible and concise overview of the main developments beyond the initial Map Reduce-focused version of Hadoop.” (Prof. Dr. Erhard Rahm, Universität Leipzig, Germany)

Authors and Affiliations

  • The University of New South Wales, Sydney, Australia

    Sherif Sakr

About the author

Sherif Sakr is an academic, professional specialist and consultant of Big Data systems. He received his PhD degree in Computer and Information Science from Konstanz University, Germany in 2007. He received his BSc and MSc degrees in Computer Science from the Information Systems department at the Faculty of Computers and Information in Cairo University, Egypt, in 2000 and 2003 respectively. In 2013, Sherif has been awarded the Stanford Innovation and Entrepreneurship Certificate.
He is currently a professor of computer and information science in the Health Informatics department at King Saud bin Abdulaziz University for Health Sciences. He is also affiliated with the University of New South Wales and DATA61/CSIRO (formerly NICTA). In 2008 and 2009, Sherif held an Adjunct Lecturer position at the Department of Computing of Macquarie University. In 2011, he held a Visiting Researcher position at the eXtreme Computing Group, Microsoft Research Laboratories, Redmond, WA, USA. In 2012, he held a Research MTS position in Alcatel-Lucent Bell Labs.
Sherif has published more than 90 refereed research publications in international journals and conferences. So far, he (co)-authored three books and co-edited three other books.

Bibliographic Information

Buy it now

Buying options

eBook USD 44.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Other ways to access