Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
3.1.0, 3.1.2
-
None
-
None
Description
CTAS query failure at DDL task stage due to HMS connection issue leaves the output file in target directory. Since DDL task stage happens after Tez DAG completion and MOVE Task , output file getsĀ already moved to target directory and does not get cleaned up after the query failure.
Re-executing the same query causes a duplicate file under table location hence duplicate data.