Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-12739

Flink JobServer should bundle AWS IO libraries to support staging on S3

Details

    • Improvement
    • Status: Resolved
    • P2
    • Resolution: Fixed
    • 2.31.0
    • 2.33.0
    • jobserver
    • None

    Description

      It doesn't look like the AWS Filesystem libraries are included in the Flink JobServer.

      There is a TODO in the code to update the dependencies to support the AWS filesystem.

      https://github.com/apache/beam/blob/master/runners/flink/job-server/flink_job_server.gradle#L89

      As reported in this mailing list thread trying to use S3 to stage artifacts results in the exception

      Aug 06, 2021 1:52:51 AM
      org.apache.beam.runners.fnexecution.artifact.ArtifactStagingService$2
      finishStaging
      
      SEVERE: Error staging artifacts
      
      java.util.concurrent.ExecutionException:
      java.lang.IllegalArgumentException: No filesystem found for scheme s3
      
      at java.util.concurrent.FutureTask.report(FutureTask.java:122)
      
      at java.util.concurrent.FutureTask.get(FutureTask.java:192)
       

       

      Attachments

        Issue Links

          Activity

            People

              jeremy@lewi.us Jeremy Lewi
              jeremy@lewi.us Jeremy Lewi
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40m
                  40m