Details
-
Sub-task
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
Description
As part of prototyping, I have most of the core functionalities in
https://github.com/bvaradar/hudi/tree/vb_bootstrap
This includes:
- Timeline and FileSystem View changes
- New Bootstrap Client to perform Bootstrap
- DeltaStreamer Integration
- Hive Parquet Read Optimized reader integration
Needs to be done:
- Merge Handle changes to support upsert over bootstrap file slice (Read part similar to that of (4) functionally and write part same as that of current Hoodie MergeHandle.
- Unit Testing
- Code cleanup as the current implementation has duplicated code.
- Automated integration test
- Hoodie CLI and Spark DataSource Write integration
Attachments
Issue Links
- links to