Details
-
Improvement
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
3.2.0
-
None
-
None
Description
If I have a file in HDFS that contains 100 blocks, and I happen to lose the first block (for whatever obscure/unlikely/dumb reason), I can no longer access the 99% of the file that's still there and accessible. In the case of some data formats (e.g. text), the remaining data may still be useful. It would be nice to have a way to extract the remaining data without having to manually reassemble the file contents from the block files. Something like hdfs dfs -copyToLocal -ignoreCorrupt <file>. It could insert some marker to show where the missing blocks are.