Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.2.1
-
None
-
None
Description
Hive concatenation generates corrupted (and therefore unreadable) ORC if there are ORC files with different schema in the same table/partition.
This may happen, for instance, if you put some data in a table, then you add a new column and you put some other data. In this case, running the ORC concatenation leads to a corrupted ORC file.
I think that the right behavior would be not to concatenate ORC files with different schema.