Details
-
Task
-
Status: Open
-
Major
-
Resolution: Unresolved
-
3.0.0
-
None
-
None
Description
We have a "locality wait time" configuration that allows a task set to downgrade locality requirement after a certain time has passed. Yet, tasks all differ in penalty of locality miss. We propose a WILDCARD task location for tasks that do have a preferred location but would rather be assigned sooner than wait if the desired location is not available.
This requirement comes from LocalShuffledRowRDD. When the number of initial mappers of LocalShuffledRDD is smaller than the number of worker nodes, it can cause serious regressions if short-running tasks all wait on their preferred locations while they could have otherwise finished quickly on non-preferred locations too.
Attachments
Issue Links
- links to