Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
Impala 3.3.0
-
ghx-label-6
Description
Major compactions creates directories in base_writeid_visibilityTxnId, which expresses that it contains all deltas +bases <= writeId, and that the compaction's transaction is visibilityTxnId. visibilityTxnId is needed to check whether the compaction is open/aborted/committed, and base directories belonging to open/aborted compactions should be ignored.
Currently Impala only checks the writeId, so if there is an open/aborted compaction, it will be used as base, and base/delta directories with smaller writeIds will be ignored, leading to potential data loss.