Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-10989

OrcRowInputFormat uses two different file systems

    XMLWordPrintableJSON

Details

    Description

      The OrcRowInputFormat seems to use two different FileSystem. The Flink FileSystem for listing the files and generating the InputSplits and then Hadoop's FileSystem to actually read the input splits. This can be problematic if one only configures Flink's S3 FileSystem but does not provide a S3 implementation for Hadoop's FileSystem.

      I think this is not an intuitive behaviour and can lead to hard to debug problems for a user.

      Attachments

        Activity

          People

            Unassigned Unassigned
            trohrmann Till Rohrmann
            Votes:
            1 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: