[SPARK-25640] Clarify/Improve EvalType for grouped aggregate and window aggregate - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Incomplete
Affects Version/s: 2.4.0
Fix Version/s: None
Component/s: PySpark
Labels:
- bulk-closed

Target Version/s:

3.0.0

Description

Currently, grouped aggregate and window aggregate uses different EvalType, however, they map to the same user facing type PandasUDFType.GROUPED_MAP.

It makes sense to have one user facing type because it (PandasUDFType.GROUPED_MAP) can be used in both groupby and window operation.

However, the mismatching between PandasUDFType and EvalType can be confusing to developers. We should clarify and/or improve this.

See discussion at: https://github.com/apache/spark/pull/22620#discussion_r222452544

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Li Jin

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 04/Oct/18 14:41

Updated:: 25/May/21 01:51

Resolved:: 25/May/21 01:42