Skip to main content

Apache Kafka

Definitions

Apache Kafka (Apache Software Foundation 2017b; Kreps et al. 2011; Goodhope et al. 2012; Wang et al. 2015; Kleppmann and Kreps 2015) is a scalable, fault-tolerant, and highly available distributed streaming platform that can be used to store and process data streams.

Kafka consists of three main components:

  • the Kafka cluster,

  • the Connect framework (Connect API),

  • and the Streams programming library (Streams API).

The Kafka cluster stores data streams, which are sequences of messages/events continuously produced by applications and sequentially and incrementally consumed by other applications. The Connect API is used to ingest data into Kafka and export data streams to external systems like distributed file systems, databases, and others. For data stream processing, the Streams API allows developers to specify sophisticated stream processing pipelines that read input streams from the Kafka cluster and write results back to Kafka.

Kafka supports many different use cases...

This is a preview of subscription content, log in via an institution.

References

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Matthias J. Sax .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Sax, M.J. (2018). Apache Kafka. In: Sakr, S., Zomaya, A. (eds) Encyclopedia of Big Data Technologies. Springer, Cham. https://doi.org/10.1007/978-3-319-63962-8_196-1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-63962-8_196-1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-63962-8

  • Online ISBN: 978-3-319-63962-8

  • eBook Packages: Springer Reference MathematicsReference Module Computer Science and Engineering

Publish with us

Policies and ethics

Chapter history

  1. Latest

    Apache Kafka
    Published:
    10 March 2022

    DOI: https://doi.org/10.1007/978-3-319-63962-8_196-2

  2. Original

    Apache Kafka
    Published:
    10 February 2018

    DOI: https://doi.org/10.1007/978-3-319-63962-8_196-1