[SPARK-21178] Add support for label specific metrics in MulticlassClassificationEvaluator - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Duplicate
Affects Version/s: 2.1.1
Fix Version/s: None
Component/s: ML
Labels:
- bulk-closed

Flags:

Patch, Important

Description

MulticlassClassificationEvaluator is restricted to the global metrics - f1, weightedPrecision, weightedRecall, accuracy

However, we have a requirement where we would want to optimize the learning on metric for a specific label - for instance, true positive rate (label 'B')

For example : Take a fraud detection use-case with labels 'good' and 'fraud' being passed to a manual verification team. We want to maximize the true-positive rate of ('fraud') label, so that whenever the model predicts a data point as 'good', it has a strong likelihood of it being 'good', and the manual team can ignore it.
While it's ok to predict some 'good' data points as 'fraud', as it will be taken care by the manual verification team.

Attachments

Issue Links

duplicates

SPARK-28044 MulticlassClassificationEvaluator support more metrics

Resolved

links to

[Github] Pull Request #18390 (rawataaryan9)

GitHub Pull Request #18390

Activity

People

Assignee:: Unassigned

Reporter:: Aman Rawat

Votes:: 2 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 22/Jun/17 11:55

Updated:: 20/Aug/19 11:43

Resolved:: 20/Aug/19 11:43