Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
2.1.1
-
None
Description
SkipGram + Negative Sampling is shown to be comparative or out-performing the hierarchical softmax based approach currently implemented with Spark. Since word2vec is largely a pre-processing step, the performance often can depend on the application it is being used for, and the corpus it is estimated on. These implementation give users the choice of picking one that works best for their use-case.