Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-25954

LOAD DATA INPATH can't load data from empty Sequence File (.seq)

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • API
    • None

    Description

      Whether using pyhive or impala, I'v got the same error when loading data to external table via 

      LOAD DATA INPATH 'path to empty .seq file' into table external_table_name partition (part='value')

      OperationalError: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask

      P.S. In the case of loading .seq file of the size > 0M or loading that emty file via Spark I haven't any problem.

      impala v0.17.0

      pyhive '0.6.4'

      Spark  2.3.0.2.6.5.0-292

       

      Maybe, there is any setting in pyhive to fix this problem?

      More detailed response from beeline 

      ERROR : Failed with exception Wrong file format. Please check the file's format.
      org.apache.hadoop.hive.ql.metadata.HiveException: Wrong file format. Please check the file's format.
              at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:360)
              at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:162)
              at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:89)
              at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1765)
              at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1506)
              at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1303)
              at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1170)
              at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1165)        at org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:197)
              at org.apache.hive.service.cli.operation.SQLOperation.access$300(SQLOperation.java:76)
              at org.apache.hive.service.cli.operation.SQLOperation$2$1.run(SQLOperation.java:255)
              at java.security.AccessController.doPrivileged(Native Method)
              at javax.security.auth.Subject.doAs(Subject.java:422)
              at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
              at org.apache.hive.service.cli.operation.SQLOperation$2.run(SQLOperation.java:266)
              at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
              at java.util.concurrent.FutureTask.run(FutureTask.java:266)
              at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
              at java.util.concurrent.FutureTask.run(FutureTask.java:266)
              at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
              at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
              at java.lang.Thread.run(Thread.java:748)

      Error: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTas
      k (state=08S01,code=1)

      Attachments

        Activity

          People

            Unassigned Unassigned
            Krivorotenko Vladislav
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: