Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-2125

Create a public dataset and guideline/playbook for use by public

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Usability
    • None

    Description

      Expose a public dataset w/ schema details and how to use them. 

       

      For eg:

      • We could have a parquet dump somewhere, where one could read from generate their own hudi tables. 
      • We could have playbook to create diff types of hudi tables(COW/MOR) by reading from this source. 
      • We could add a playbook to use deltastreamer to read from this source one file at a time and inject to hudi table. 

      Attachments

        Activity

          People

            Unassigned Unassigned
            shivnarayan sivabalan narayanan
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: