Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-25135

insert datasource table may all null when select from view on parquet

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Duplicate
    • 2.3.0, 2.3.1
    • None
    • SQL

    Description

      This happens on parquet.

      How to reproduce in parquet.

      val path = "/tmp/spark/parquet"
      val cnt = 30
      spark.range(cnt).selectExpr("cast(id as bigint) as col1", "cast(id as bigint) as col2").write.mode("overwrite").parquet(path)
      spark.sql(s"CREATE TABLE table1(col1 bigint, col2 bigint) using parquet location '$path'")
      spark.sql("create view view1 as select col1, col2 from table1 where col1 > -20")
      spark.sql("create table table2 (COL1 BIGINT, COL2 BIGINT) using parquet")
      spark.sql("insert overwrite table table2 select COL1, COL2 from view1")
      spark.table("table2").show
      

      FYI, the following is orc.

      scala> val path = "/tmp/spark/orc"
      scala> val cnt = 30
      scala> spark.range(cnt).selectExpr("cast(id as bigint) as col1", "cast(id as bigint) as col2").write.mode("overwrite").orc(path)
      scala> spark.sql(s"CREATE TABLE table1(col1 bigint, col2 bigint) using orc location '$path'")
      scala> spark.sql("create view view1 as select col1, col2 from table1 where col1 > -20")
      scala> spark.sql("create table table2 (COL1 BIGINT, COL2 BIGINT) using orc")
      scala> spark.sql("insert overwrite table table2 select COL1, COL2 from view1")
      scala> spark.table("table2").show
      +----+----+
      |COL1|COL2|
      +----+----+
      |  15|  15|
      |  16|  16|
      |  17|  17|
      ...
      

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              yumwang Yuming Wang
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: