Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
The current API for row filtering has a couple of issues:
- The filter function is passed a VectorizedRowBatch instead of a MutableFilterContext.
- The filter needs to know the precise location for the fields it needs out of the schema.
I'd like to propose changing it from:
Consumer<VectorizedRowBatch>
to
BiFunction<ColumnVector[], MutableFilterContext, Boolean>
That has the advantage that the data that the function should read is explicitly passed to it and we remove the dependence on VectorizedRowBatch.
Attachments
Issue Links
- depends upon
-
ORC-577 Allow row-level filtering
- Closed