Create Datasets from Azure Data Lake Storage in Domo Workbench and Build Visualizations of Azure Data Lake Storage Data in Domo

Ready to get started?

Download for a free trial:

Download Now

Learn more:

Azure Data Lake Storage ODBC Driver

The Azure Data Lake Storage ODBC Driver is a powerful tool that allows you to connect with live data from Azure Data Lake Storage, directly from any applications that support ODBC connectivity.

Access Azure Data Lake Storage data like you would a database - read, write, and update Azure Data Lake Storage ADLSData, etc. through a standard ODBC Driver interface.



Use the CData ODBC Driver for Azure Data Lake Storage to create datasets from Azure Data Lake Storage data in Domo Workbench and then build visualizations in the Domo service.

Domo helps you manage, analyze, and share data across your entire organization, enabling decision makers to identify and act on strategic opportunities. Domo Workbench provides a secure, client-side solution for uploading your on-premise data to Domo. The CData ODBC Driver for Azure Data Lake Storage links Domo Workbench to operational Azure Data Lake Storage data. You can build datasets from Azure Data Lake Storage data using standard SQL queries in Workbench and then create real-time visualizations of Azure Data Lake Storage data in the Domo service.

The CData ODBC Drivers offer unmatched performance for interacting with live Azure Data Lake Storage data in Domo due to optimized data processing built into the driver. When you issue complex SQL queries from Domo to Azure Data Lake Storage, the driver pushes supported SQL operations, like filters and aggregations, directly to Azure Data Lake Storage and utilizes the embedded SQL Engine to process unsupported operations (often SQL functions and JOIN operations) client-side. With built-in dynamic metadata querying, you can visualize and analyze Azure Data Lake Storage data using native Domo data types.

Connect to Azure Data Lake Storage as an ODBC Data Source

If you have not already, first specify connection properties in an ODBC DSN (data source name). This is the last step of the driver installation. You can use the Microsoft ODBC Data Source Administrator to create and configure ODBC DSNs.

Authenticating to a Gen 1 DataLakeStore Account

Gen 1 uses OAuth 2.0 in Azure AD for authentication.

For this, an Active Directory web application is required. You can create one as follows:

  1. Sign in to your Azure Account through the .
  2. Select "Azure Active Directory".
  3. Select "App registrations".
  4. Select "New application registration".
  5. Provide a name and URL for the application. Select Web app for the type of application you want to create.
  6. Select "Required permissions" and change the required permissions for this app. At a minimum, "Azure Data Lake" and "Windows Azure Service Management API" are required.
  7. Select "Key" and generate a new key. Add a description, a duration, and take note of the generated key. You won't be able to see it again.

To authenticate against a Gen 1 DataLakeStore account, the following properties are required:

  • Schema: Set this to ADLSGen1.
  • Account: Set this to the name of the account.
  • OAuthClientId: Set this to the application Id of the app you created.
  • OAuthClientSecret: Set this to the key generated for the app you created.
  • TenantId: Set this to the tenant Id. See the property for more information on how to acquire this.
  • Directory: Set this to the path which will be used to store the replicated file. If not specified, the root directory will be used.

Authenticating to a Gen 2 DataLakeStore Account

To authenticate against a Gen 2 DataLakeStore account, the following properties are required:

  • Schema: Set this to ADLSGen2.
  • Account: Set this to the name of the account.
  • FileSystem: Set this to the file system which will be used for this account.
  • AccessKey: Set this to the access key which will be used to authenticate the calls to the API. See the property for more information on how to acquire this.
  • Directory: Set this to the path which will be used to store the replicated file. If not specified, the root directory will be used.

When you configure the DSN, you may also want to set the Max Rows connection property. This will limit the number of rows returned, which is especially helpful for improving performance when designing reports and visualizations.

After creating a DSN, you will need to create a dataset for Azure Data Lake Storage in Domo Workbench using the Azure Data Lake Storage DSN and build a visualization in the Domo service based on the dataset.

Build a Dataset for Azure Data Lake Storage Data

You can follow the steps below to build a dataset based on a table in Azure Data Lake Storage in Domo Workbench using the CData ODBC Driver for Azure Data Lake Storage.

  1. Open Domo Workbench and, if you have not already, add your Domo service server to Workbench. In the Accounts submenu, click Add New, type in the server address (i.e., domain.domo.com) and click through the wizard to authenticate.
  2. In the DataSet Jobs submenu, click Add New.
  3. Name the dataset job (i.e., ODBC Azure Data Lake Storage Resources), select ODBC Connection Provider as the transport method, and click through the wizard.
  4. In the newly created DataSet Job, navigate to Source and click to configure the settings.
  5. Select System DSN for the Connection Type.
  6. Select the previously configured DSN (CData ADLS Sys) for the System DSN.
  7. Click to validate the configuration.
  8. Below the settings, set the Query to a SQL query: SELECT * FROM Resources NOTE: By connecting to Azure Data Lake Storage data using an ODBC driver, you simply need to know SQL in order to get your data, circumventing the need to know Azure Data Lake Storage-specific APIs or protocols.
  9. Click preview.
  10. Check over the generated schema, add any transformations, then save and run the dataset job.

With the dataset job run, the dataset will be accessible from the Domo service, allowing you to build visualizations, reports, and more based on Azure Data Lake Storage data.

Create Data Visualizations

With the DataSet Job saved and run in Domo Workbench, we are ready to build visualizations of the Azure Data Lake Storage data in the Domo service.

  1. Navigate to the Data Center.
  2. In the data warehouse, select the ODBC data source and drill down to our new dataset.
  3. With the dataset selected, choose to create a visualization.
  4. In the new card:
    • Drag a Dimension to the X Value.
    • Drag a Measure to the Y Value.
    • Choose a Visualization.

With the CData ODBC Driver for Azure Data Lake Storage, you can build custom datasets based on Azure Data Lake Storage data using only SQL in Domo Workbench and then build and share visualizations and reports through the Domo service.