Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-30128

Promote remaining "hidden" PySpark DataFrameReader options to load APIs

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 2.4.4, 3.0.0
    • 3.0.0
    • PySpark, SQL
    • None

    Description

      Following on to SPARK-29903 and similar issues (linked), there are options available to the DataFrameReader for certain source formats, but which are not exposed properly in the relevant APIs.

      These options include `timeZone` and `pathGlobFilter`. Instead of being noted under the option() method, they should be implemented directly into load APIs that support them.

      Attachments

        Issue Links

          Activity

            People

              gurwls223 Hyukjin Kwon
              nchammas Nicholas Chammas
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: