[SPARK-31339] Changed PipelineModel(...) to self.cls(...) in pyspark.ml.pipeline.PipelineModelReader.load() - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Minor
Resolution: Not A Problem
Affects Version/s: 2.4.5
Fix Version/s: None
Component/s: ML, PySpark
Labels:
- pull-request-available

Flags:

Patch

Description

PR: https://github.com/apache/spark/pull/28110

What changes were proposed in this pull request?
pypsark.ml.pipeline.py line 245: Change PipelineModel(...) to self.cls(...)
Why are the changes needed?
This change fixes the loading of class (which inherits from PipelineModel class) from file.
E.g. Current issue:

CustomPipelineModel(PipelineModel):
    def _transform(self, df):
        ...
 CustomPipelineModel.save('path/to/file') # works
 CustomPipelineModel.load('path/to/file') # wrong: results in PipelineModel() instead of CustomPipelineModel()
 CustomPipelineModel.transform() # wrong: results in calling PipelineModel.transform() instead of CustomPipelineModel.transform()

Does this introduce any user-facing change?
No.

Attachments

Issue Links

links to

GitHub Pull Request #28110

Activity

People

Assignee:: Unassigned

Reporter:: Suraj

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 03/Apr/20 10:52

Updated:: 28/Apr/20 18:25

Resolved:: 28/Apr/20 18:25