[SPARK-27636] Remove cached RDD blocks after PIC execution - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: 2.3.3, 2.4.2, 3.0.0
Fix Version/s: 3.0.0
Component/s: MLlib
Labels:
None

Description

Test steps to reproduce:
1) bin/spark-shell
val dataset = spark.createDataFrame(Seq(
(0L, 1L, 1.0),
(1L,2L,1.0),
(3L, 4L,1.0),
(4L,0L,0.1))).toDF("src", "dst", "weight")
val model = new PowerIterationClustering().
setMaxIter(10).
setInitMode("degree").
setWeightCol("weight")
val prediction = model.assignClusters(dataset).select("id", "cluster")

2) Open storage tab of the UI. we can see many RDD block cached, even after running the PIC.

Attachments

Issue Links

links to

GitHub Pull Request #24531

Activity

People

Assignee:: shahid

Reporter:: shahid

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 05/May/19 21:29

Updated:: 09/May/19 15:23

Resolved:: 09/May/19 14:28