Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-7513

[SQL] Row Estimation for BigQueryTable

Details

    • New Feature
    • Status: Triage Needed
    • P2
    • Resolution: Implemented
    • None
    • 2.14.0
    • dsl-sql, io-java-gcp
    • None

    Description

      Calcite tables (org.apache.calcite.schema.Table) should implement the method org.apache.calcite.schema.Statistic getStatistic(). The Statistic instance returned by this method is used for the Volcano optimizer in Calcite. 

      Currently, org.apache.beam.sdk.extensions.sql.impl.BeamCalciteTable has not implemented getStatistic() which means it uses the implementation in org.apache.calcite.schema.impl.AbstractTable and that implementation just returns Statistics.UNKNOWN for all sources.

       

      Things needed to be implemented:

      1- Implementing getStatistic in BeamCalciteTable such that it calls a row count estimation method from BeamSqlTable and adding this method to BeamSqlTable.

      2- Implementing the row count estimation method for BigQueryTable. 

      Attachments

        Activity

          People

            riazela Alireza Samadianzakaria
            riazela Alireza Samadianzakaria
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 10h 10m
                10h 10m