Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
parquet::Comparator is currently used to calculate the min & max values of an array. This should be benchmarked against arrow::compute's MinMax kernel (once it supports all necessary data types). The latter should be more aggressive with SIMD resulting in better performance.
Even if there is no performance difference the MinMax kernel should be used when computing dictionary statistics as the current implementation requires making a copy of the dictionary values array (see ARROW-12513)