Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-18855 VectorIO API tuning/stabilization
  3. HADOOP-18854

add options to disable range merging of vectored io

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.3.5, 3.3.6
    • None
    • fs, fs/s3
    • None

    Description

      I'm seeing test failures in my PARQUET-2171 pr because assertions about the #of bytes read isn't holding -small files are being read and the vector range merging is pulling in the whole file.

      ```
      [ERROR] TestInputOutputFormat.testReadWriteWithCounter:338 bytestotal != bytesread expected:<5510> but was:<11020>
      ```

      I think for parquet i will add an option to disable vector io, but really the filesystems which support it should allow for merging to be disabled

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              stevel@apache.org Steve Loughran
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: