Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-5535

Paging Problem with Querying Directories

    XMLWordPrintableJSON

Details

    • Patch

    Description

      Problem comes with the following Drill query:
      "SELECT * FROM <<mySource>>
      WHERE (dir0='Test1' AND dir1='TestDataSourceID1')
      OR (dir0='Test2' AND dir1='TestDataSourceID2')
      LIMIT 2 OFFSET 0"

      If this call gets run twice it is randomly set which file will be in the result. So if a query is created which should page my result I won't be able to tell which source was used for the result.
      Due two the fact that if file1 contains the columns a, b, c and column b, c, d I also will get a problem with the result as the first results will for example contain the columns a, b, c and the second half of the results will contain a, b, c, d with a filled with null.

      As in the example on your webpage (https://drill.apache.org/docs/querying-directories/) where you query specific columns and order the result without any paging I am wondering if this problem only occurs while using the star in the query.

      Attachments

        Activity

          People

            Unassigned Unassigned
            LukeP2090 Lucian Poth
            Votes:
            1 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: