[SPARK-29495] Add ability to estimate per - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Duplicate
Affects Version/s: 2.4.4
Fix Version/s: None
Component/s: ML
Labels:
None

Description

In gensim, [the LDA model|https://radimrehurek.com/gensim/models/ldamodel.html] has a parameter eval_every that allows a user to specify that the model should be evaluated every X iterations to determine its log perplexity. This helps to determine convergence of the model, and whether or not the proper number of iterations has been chosen. Spark has no similar functionality in its implementation of LDA. This should be added, as it appears the only way to achieve this functionality would be to train models of varying numbers of iterations and evaluate each's log perplexity.

Attachments

Issue Links

duplicates

SPARK-29496 Add ability to estimate perplexity every X iterations for LDA

Open

Activity

People

Assignee:: Unassigned

Reporter:: Chris Nardi

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 17/Oct/19 03:54

Updated:: 23/Oct/19 05:09

Resolved:: 17/Oct/19 03:59