Connect to HDFS in Power Automate Desktop via ODBC

Ready to get started?

Download for a free trial:

Download Now

Learn more:

HDFS ODBC Driver

The HDFS ODBC Driver is a powerful tool that allows you to connect with live data from HDFS, directly from any applications that support ODBC connectivity.

Access HDFS data like you would a database - read, write, and update HDFS HDFSData, etc. through a standard ODBC Driver interface.



The CData ODBC Driver for HDFS enables you to integrate HDFS data into workflows built using Microsoft Power Automate Desktop.

The CData ODBC Driver for HDFS enables you to access live HDFS data in workflow automation tools like Power Automate. This article shows how to integrate HDFS data into a simple workflow, moving HDFS data into a CSV file.

Through optimized data processing, CData ODBC Drivers offer unmatched performance for interacting with live HDFS data in Microsoft Power Automate. When you issue complex SQL queries from Power Automate to HDFS, the driver pushes supported SQL operations, like filters and aggregations, directly to HDFS and utilizes the embedded SQL engine to process unsupported operations client-side (e.g. SQL functions and JOIN operations).

Connect to HDFS as an ODBC Data Source

If you have not already, first specify connection properties in an ODBC DSN (data source name). This is the last step of the driver installation. You can use the Microsoft ODBC Data Source Administrator to create and configure ODBC DSNs.

In order to authenticate, set the following connection properties:

  • Host: Set this value to the host of your HDFS installation.
  • Port: Set this value to the port of your HDFS installation. Default port: 50070

When you configure the DSN, you may also want to set the Max Rows connection property. This will limit the number of rows returned, which is especially helpful for improving performance when designing workflows.

Integrate HDFS Data into Power Automate Workflows

After configuring the DSN for HDFS, you are ready to integrate HDFS data into your Power Automate workflows. Open Microsoft Power Automate, add a new flow, and name the flow.

In the flow editor, you can add the actions to connect to HDFS, query HDFS using SQL, and write the query results to a CSV document.

Add an Open SQL Connection Action

Add an "Open SQL connection" action (Actions -> Database) and configure the properties.

  • Connection string: DSN=CData HDFS Source

After configuring the action, click Save.

Add an Execute SQL Statement Action

Add an "Execute SQL statement" action (Actions -> Database) and configure the properties.

  • Get connection by: SQL connection variable
  • SQL connection: %SQLConnection% (the variable from the "Open SQL connection" action above)
  • SQL statement: SELECT * FROM Files

After configuring the action, click Save.

Add a Write to CSV File Action

Add a "Write to CSV file" action (Actions -> File) and configure the properties.

  • Variable to write to: %QueryResult% (the variable from the "Execute SQL statement" action above)
  • File path: set to a file on disk
  • Configure Advanced settings as needed.

After configuring the action, click Save.

Add a Close SQL Connection Action

Add a "Close SQL connection" action (Actions -> Database) and configure the properties.

  • SQL Connection: %SQLConnection% (the variable from the "Open SQL connection" action above)

After configuring the action, click Save.

Save & Run the Flow

Once you have configured all the actions for the flow, click the disk icon to save the flow. Click the play icon to run the flow.

Now you have a workflow to move HDFS data into a CSV file.

With the CData ODBC Driver for HDFS, you get live connectivity to HDFS data within your Microsoft Power Automate workflows.

Related Power Automate Articles

This article walks through using the CData ODBC Driver for HDFS with Power Automate Desktop. Check out our other articles for more ways to work with Power Automate (Desktop & Online):