Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-11215

Add multiple columns support to StringIndexer

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.4.0
    • 3.0.0
    • ML
    • None
    • Hide
      When specifying frequencyDesc or frequencyAsc as stringOrderType param in StringIndexer, in case of equal frequency, the order of strings was previously undefined. Since Spark 3.0, strings with equal frequency are further
      sorted lexicographically.
      Show
      When specifying frequencyDesc or frequencyAsc as stringOrderType param in StringIndexer, in case of equal frequency, the order of strings was previously undefined. Since Spark 3.0, strings with equal frequency are further sorted lexicographically.

    Description

      Add multiple columns support to StringIndexer, then users can transform multiple input columns to multiple output columns simultaneously. See discussion SPARK-8418.

      Attachments

        Issue Links

          Activity

            People

              viirya L. C. Hsieh
              yanboliang Yanbo Liang
              Votes:
              6 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: