[BEAM-14368] Investigate load state_dict vs loading whole model - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Open
Priority: P2
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: sdk-py-core
Labels:
- run-inference
- stale-assigned

Description

Loading pytorch model as whole has some issues with pickling. Investigate it with running some experiments. If the model size is too large, the current implementation of the RunInference for PyTorch would fail because of memory limits.

1. We can pass the model class to the `load_model` of PyTorchModelLoader and load the model there. This wouldn't pickle the model object but would pickle the class and the model would be instantiated on the workers.

Attachments

Issue Links

links to

GitHub Pull Request #17494

Activity

People

Assignee:: Anand Inguva

Reporter:: Anand Inguva

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 26/Apr/22 17:22

Updated:: 04/Jun/22 23:33

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

2h 20m