Build Databricks-Connected Dashboards in Redash

Ready to get started?

Learn more or sign up for a free trial:

CData Connect



Use CData Connect Cloud to create a virtual MySQL Database for Databricks data and build visualizations and dashbaords from Databricks data in Redash.

Redash lets you connect and query your data sources, build dashboards to visualize data and share them with your company. When paired with CData Connect Cloud, you get instant, cloud-to-cloud access to Databricks data for visualizations, dashboards, and more. This article shows how to create a virtual database for Databricks and build visualizations from Databricks data in Redash.

CData Connect Cloud provides a pure MySQL, cloud-to-cloud interface for Databricks, allowing you to easily build visualizations from Databricks data in Redash. As you build visualizations, Redash generates SQL queries to gather data. Using optimized data processing out of the box, CData Connect Cloud pushes all supported SQL operations (filters, JOINs, etc) directly to Databricks, leveraging server-side processing to quickly return Databricks data.

Create a Virtual MySQL Database for Databricks Data

CData Connect Cloud uses a straightforward, point-and-click interface to connect to data sources and generate APIs.

  1. Log into Connect Cloud and click Databases.
  2. Select "Databricks" from Available Data Sources.
  3. Enter the necessary authentication properties to connect to Databricks.

    To connect to a Databricks cluster, set the properties as described below.

    Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.

    • Server: Set to the Server Hostname of your Databricks cluster.
    • HTTPPath: Set to the HTTP Path of your Databricks cluster.
    • Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
  4. Click Test Database.
  5. Click Privileges -> Add and add the new user (or an existing user) with the appropriate permissions.

With the virtual database created, you are ready to build visualizations in Tableau Online.

Visualize Databricks Data in Tableau Online

The steps below outline creating a new data source in Redash based on the virtual Databricks database in Connect Cloud and building a simple visualization from the data.

Create a New Data Source

  1. Log into Redash, click on your profile and click "Data Sources"
  2. Click the " New Data Source" button
  3. Select "MySQL (Amazon RDS)" as the Data Source Type (CData Connect uses SSL, which the standard MySQL connection in Redash does not support)
  4. On the configuration tab, set the following properties:
    • Name: Name the data source (e.g. Databricks (CData Connect))
    • Host: The full URL to your CData Connect instance (e.g. https://myinstance.cdatacloud.net)
    • Port: The port of the CData Connect MySQL endpoint (e.g. 3306)
    • User: A CData Connect user
    • Password: The password for the above user
    • Database name: The name of the virtual database for Databricks (e.g. Databricks1)
    • Click the checkbox to Use SSQL
  5. Click Create
  6. Click the "Test Connection" button to ensure you have configured the connection properly

With the new Data Source created, we are ready to visualize our Databricks data.

Create a Databricks Data Visualization

  1. Click Create -> New Query
  2. Select the newly created Data Source (you can explore the data structure in the New Query wizard)
  3. Write a SQL statement to retrieve the data, for example:
    SELECT City, CompanyName FROM Customers WHERE Country = 'US'
  4. Click the "Execute" button to load Databricks data into Redash via CData Connect
  5. Use the Visualization Editor to create and analyze graphs from Databricks data
  6. You can schedule the query to refresh and update the visualization periodically

SQL Access to Databricks Data from Cloud Applications

At this point, you have a direct, cloud-to-cloud connection to Databricks data from Redash. You can create new visualizations, build dashboards, and more. For more information on gaining SQL access to data from more than 100 SaaS, Big Data, and NoSQL sources from cloud applications like Redash, refer to our Connect Cloud page.