Use the CData ODBC Driver for Hive in Microsoft Power Query

Ready to get started?

Download for a free trial:

Download Now

Learn more:

Apache Hive ODBC Driver

The Hive ODBC Driver is a powerful tool that allows you to connect with Apache Hive, directly from any applications that support ODBC connectivity.

The Driver maps SQL to HiveQL, enabling direct standard SQL-92 access to Apache Hive.



You can use the CData Hive ODBC Driver with Microsoft Power Query. In this article, you will use the ODBC driver to import Hive data into Microsoft Power Query.

The CData ODBC Driver for Hive enables you to link to Hive data in Microsoft Power Query, ensuring that you see any updates. This article details how to use the ODBC driver to import Hive data into Microsoft Power Query.

Connect to Hive as an ODBC Data Source

If you have not already, first specify connection properties in an ODBC DSN (data source name). This is the last step of the driver installation. You can use the Microsoft ODBC Data Source Administrator to create and configure ODBC DSNs.

Set the Server, Port, TransportMode, and AuthScheme connection properties to connect to Hive.

Import Hive Data

Follow the steps below to import Hive data using standard SQL:

  1. From the ribbon in Excel, click Power Query -> From Other Data Sources -> From ODBC.

  2. Enter the ODBC connection string. Below is a connection string using the default DSN created when you install the driver: Provider=MSDASQL.1;Persist Security Info=False;DSN=CData ApacheHive Source
  3. Enter the SELECT statement to import data with. For example:

    SELECT City, CompanyName FROM Customers
  4. Enter credentials, if required, and click Connect. The results of the query are displayed in the Query Editor Preview. You can combine queries from other data sources or refine the data with Power Query formulas. To load the query to the worksheet, click the Close and Load button.