Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
2.4.0
-
None
Description
As fix of SPARK-19755 we removed custom blacklisting mechanism in spark-mesos integration which has hardcoded constant of 2 failures max before node is marked as blacklisted.
From now on the usual blacklisting mechanism is in use(when enabled), however it has downside of not counting failures of launching mesos-tasks(spark executors), i.e. only failures in spark-tasks will be counted.
squito felixcheung susanxhuynh skonto please add details as you see it
Attachments
Issue Links
- is caused by
-
SPARK-19755 Blacklist is always active for MesosCoarseGrainedSchedulerBackend. As result - scheduler cannot create an executor after some time.
- Resolved
- is related to
-
SPARK-16630 Blacklist a node if executors won't launch on it.
- Resolved