Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-4063

Flink runner supports cluster-wide artifact deployments through the Distributed Cache

Details

    • New Feature
    • Status: Open
    • P3
    • Resolution: Unresolved
    • None
    • None
    • runner-flink
    • None

    Description

      As of now, Flink effectively has a dependency on an external storage system for artifact management. This is because the Flink Distributed Cache does not actually distribute and cache blobs itself, but rather expects that each node in a running cluster has access to a well-known artifact resource.

      We should get this for free whenever https://github.com/apache/flink/pull/5580 is merged (likely in 1.5). For now, we will have to defer to external storage systems like GCS or HDFS.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              bsidhom Ben Sidhom
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: