Details
-
Bug
-
Status: Resolved
-
P2
-
Resolution: Fixed
-
None
-
None
Description
Spark runner implementation of GABW includes a "built-in" groupByKey, but BOBK before it already groups, so in order to avoid an unnecessary shuffle we need to force a Partitioner on the RDDs involved.
Attachments
Issue Links
- links to