Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-8280

Cannot ANALYZE files containing non-ASCII column names

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.20.2
    • 1.20.3
    • Metadata
    • None

    Description

      The attached Parquet file contains a single column named "Käse". If it is saved under /tmp/utf8_col and then the Drill command

      analyze table dfs.tmp.utf8_col columns none refresh metadata;

      is run then the following error is raised during the execution of the merge_schema function.

      com.fasterxml.jackson.databind.JsonMappingException: Unrecognized character escape 'x' (code 120)
       at [Source: (String)"{"type":"tuple_schema","columns":[{"name":"K\xC3\xA4se","type":"VARCHAR","mode":"REQUIRED"}]}";
       line: 1, column: 47]

      Attachments

        1. 0_0_0.parquet
          0.4 kB
          James Turton

        Activity

          People

            dzamo James Turton
            dzamo James Turton
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: