Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-50

kafka intra-cluster replication support

Agile BoardAttach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.8.0
    • None
    • None

    Description

      Currently, Kafka doesn't have replication. Each log segment is stored in a single broker. This limits both the availability and the durability of Kafka. If a broker goes down, all log segments stored on that broker become unavailable to consumers. If a broker dies permanently (e.g., disk failure), all unconsumed data on that node is lost forever. Our goal is to replicate every log segment to multiple broker nodes to improve both the availability and the durability.

      We'd like to support the following in Kafka replication:

      1. Configurable synchronous and asynchronous replication
      2. Small unavailable window (e.g., less than 5 seconds) during broker failures
      3. Auto recovery when a failed broker rejoins
      4. Balanced load when a broker fails (i.e., the load on the failed broker is evenly spread among multiple surviving brokers)

      Here is a complete design proposal for Kafka replication - https://cwiki.apache.org/confluence/display/KAFKA/Kafka+Replication

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            junrao Jun Rao
            junrao Jun Rao
            Votes:
            3 Vote for this issue
            Watchers:
            23 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

              Estimated:
              Original Estimate - 1,764h
              1,764h
              Remaining:
              Remaining Estimate - 1,764h
              1,764h
              Logged:
              Time Spent - Not Specified
              Not Specified

              Slack

                Issue deployment