Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-13760

Add randomness to default Dataflow job name in Python sdk

Details

    • Improvement
    • Status: Resolved
    • P2
    • Resolution: Fixed
    • None
    • 2.37.0
    • runner-dataflow
    • None

    Description

      Currently, when a Dataflow job is created with the default name in python, the name is a concatenation of the word "beamapp", the username, and the time in microseconds, as seen here.

      Therefore, when two jobs are created by the same user at the same time, the jobs names collide and the second job fails.

      However, the Java SDK has already solved this problem, by appending a random hex string to the job name, seen here.

      The objective of this issue is to align the python sdk with the java sdk, by appending a random string to the default job name.

      Attachments

        Issue Links

          Activity

            People

              willnicholson Will Nicholson
              willnicholson Will Nicholson
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 48h
                  48h
                  Remaining:
                  Time Spent - 2h 10m Remaining Estimate - 45h 50m
                  45h 50m
                  Logged:
                  Time Spent - 2h 10m Remaining Estimate - 45h 50m
                  2h 10m