Advertisement

Cassandra—A Distributed Database System: An Overview

  • Abdul WahidEmail author
  • Kanupriya Kashyap
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 755)

Abstract

In big data environment, apache Cassandra is a distributed database which offers very high availability. It is an open source database system and is designed to manage large transactional data across various server globally. Main feature of Cassandra is to provide high availability and very high fault tolerance, decentralized database system with zero downtime. A traditional relational database (RDBMSs) is used to storing data for various applications from many years, but some changes are required because application must be scale to levels that were unimaginable. But only scaling is not the main concern of changes, companies are also requires such type of applications that always available and running fast where RDBMS database fail. Apache Cassandra is a fully distributed database that has such type of architecture where it handles extreme data velocity with highly availability, scalability and recovers from fault tolerance easily. In Cassandra architecture, there is no master node to handle all the nodes in the ring or network. The data distribution among nodes in this architecture is in equal probation. Cassandra creates such type of environment where an entire datacenter can lose but still perform as if nothing happened. This paper provides a brief idea about Cassandra.

Keywords

Distributed database Cassandra Decentralized Fault tolerant 

References

  1. 1.
    http://cassandra.apache.org/. Last accessed on 20 Dec 2015
  2. 2.
    http://planetcassandra.org/. Last accessed on 20 Dec 2015
  3. 3.
    Wang, G., Tang, J.: The NoSQL principles and basic application of cassandra model. In: 2012 International Conference on Computer Science and Service System (CSSS), pp. 1332–1335. IEEE, Augt 2012Google Scholar
  4. 4.
    Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)CrossRefGoogle Scholar
  5. 5.
    Bagade, P., Chandra, A., Dhende, A.B.: Designing performance monitoring tool for NoSQL Cassandra distributed database. In: 2012 International Conference on Education and e-Learning Innovations (ICEELI), pp. 1–5. IEEE, July 2012Google Scholar
  6. 6.
    Jiang, W., Zhang, L., Qiang, W., Jin, H., Peng, Y.: MyStore: a high available distributed storage system for unstructured data. In: 2012 IEEE 14th International Conference on High Performance Computing and Communication and 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), pp. 233–240. IEEE, June 2012Google Scholar
  7. 7.
    Terry, D.B., Theimer, M.M., Petersen, K., Demers, A.J., Spreitzer, M.J., Hauser, C.H.: Managing update conflicts in Bayou, a weakly connected replicated storage system. In: ACM SIGOPS Operating Systems Review, vol. 29, no. 5, pp. 172–182. ACM, Dec 1995Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringNational Institute of Technology PatnaPatnaIndia
  2. 2.Department of Computer EngineeringNational Institute of Technology KurukshetraKurukshetraIndia

Personalised recommendations