Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-21059

Support external catalogs

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Hive has ability to query data from external sources such as other RDBMS, Kafka, Druid, Hbase.

      For example, to be able to query data from external sources such as a mysql table, an external table has to be explicitly created in Hive for every table in mysql that needs to be made accessible.
      Moreover, for creating such a table, the schema and login credentials have to be specified.

      By supporting "external catalogs" in Hive, we can have references to all tables in an entire mysql database by just creating one external catalog. The schema of the tables would also get automatically detected from the underlying source.

      Where possible, additional information such as statistics of the tables can also be imported from the underlying datasource, to enable Hive cost based optimizer to create optimized query plans.

      To be able to support the use of external catalog, some of the work tracked under HIVE-18685 for catalog support (including catalog in SQL syntax of Hive) is also needed.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned Assign to me
            thejas Thejas Nair

            Dates

              Created:
              Updated:

              Slack

                Issue deployment