Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
2.1.1
-
None
-
Patch, Important
Description
MulticlassClassificationEvaluator is restricted to the global metrics - f1, weightedPrecision, weightedRecall, accuracy
However, we have a requirement where we would want to optimize the learning on metric for a specific label - for instance, true positive rate (label 'B')
For example : Take a fraud detection use-case with labels 'good' and 'fraud' being passed to a manual verification team. We want to maximize the true-positive rate of ('fraud') label, so that whenever the model predicts a data point as 'good', it has a strong likelihood of it being 'good', and the manual team can ignore it.
While it's ok to predict some 'good' data points as 'fraud', as it will be taken care by the manual verification team.
Attachments
Issue Links
- duplicates
-
SPARK-28044 MulticlassClassificationEvaluator support more metrics
- Resolved
- links to