We are proud to share our inclusion in the 2024 Gartner Magic Quadrant for Data Integration Tools. We believe this recognition reflects the differentiated business outcomes CData delivers to our customers.
Get the Report →Visualize Databricks Data in TIBCO Spotfire through OData
Integrate Databricks data into dashboards in TIBCO Spotfire.
OData is a major protocol enabling real-time communication among cloud-based, mobile, and other online applications. The CData Connect Server provides Databricks data to OData consumers like TIBCO Spotfire. This article shows how to use the Connect Server and Spotfire's built-in support for OData to access Databricks data in real time.
About Databricks Data Integration
Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:
- Access all versions of Databricks from Runtime Versions 9.1 - 13.X to both the Pro and Classic Databricks SQL versions.
- Leave Databricks in their preferred environment thanks to compatibility with any hosting solution.
- Secure authenticate in a variety of ways, including personal access token, Azure Service Principal, and Azure AD.
- Upload data to Databricks using Databricks File System, Azure Blog Storage, and AWS S3 Storage.
While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.
Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.
Getting Started
Configuring Connect Server
To work with live Databricks data in TIBCO Spotfire, we need to connect to Databricks from Connect Server, provide user access to the new virtual database, and create OData endpoints for the Databricks data.
Add a Connect Server User
Create a User to connect to Databricks from TIBCO Spotfire through Connect Server.
- Click Users -> Add
- Configure a User
- Click Save Changes and make note of the Authtoken for the new user
Connect to Databricks from Connect Server
CData Connect Server uses a straightforward, point-and-click interface to connect to data sources and generate APIs.
- Open Connect Server and click Connections
- Select "Databricks" from Available Data Sources
- Enter the necessary authentication properties to connect to Databricks.
To connect to a Databricks cluster, set the properties as described below.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
- Server: Set to the Server Hostname of your Databricks cluster.
- HTTPPath: Set to the HTTP Path of your Databricks cluster.
- Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
- Click Save Changes
- Click Privileges -> Add and add the new user (or an existing user) with the appropriate permissions (SELECT is all that is required for Reveal).
Add Databricks OData Endpoints in Connect Server
After connecting to Databricks, create OData Endpoints for the desired table(s).
- Click OData -> Tables -> Add Tables
- Select the Databricks database
- Select the table(s) you wish to work with and click Next
- (Optional) Edit the resource to select specific fields and more
- Save the settings
(Optional) Configure Cross-Origin Resource Sharing (CORS)
When accessing and connecting to multiple domains from an application such as Ajax, there is a possibility of violating the limitations of cross-site scripting. In that case, configure the CORS settings in OData -> Settings.
- Enable cross-origin resource sharing (CORS): ON
- Allow all domains without '*': ON
- Access-Control-Allow-Methods: GET, PUT, POST, OPTIONS
- Access-Control-Allow-Headers: Authorization
Save the changes to the settings.
Create Data Visualizations on External Databricks Data
- Open Spotfire and click Data -> Add data...
- Then, click "Connect to" -> OData -> New Connection. In the OData Connection dialog, enter the following information:
- Service URL: Enter the Connect Server's OData endpoint. For example:
http://localhost:8080/odata.rsc
- Authentication Method: Select Username and Password.
- Username: Enter the username of a Connect Server user. You can create API users on the Security tab of the administration console.
- Password: Enter the authtoken of a Connect Server user.
- Service URL: Enter the Connect Server's OData endpoint. For example:
- Select the tables and columns you want to add to the dashboard. This example uses Customers.
-
If you want to work with the live data, click the Keep Data Table External option. This option enables your dashboards to reflect changes to the data in real time.
If you want to load the data into memory and process the data locally, click the Import Data Table option. This option is better for offline use or if a slow network connection is making your dashboard less interactive.
- After you have selected a table, Spotfire uses the data to detect number, time, and other categories. You are now on your way to creating new visualizations for analytics, reporting, and more.
Free Trial & More Information
If you are interested in connecting to your Databricks data (or data from any of our other supported data sources) from TIBCO Spotfire, sign up for a free trial of CData Connect Server today! For more information on Connect Server and to see what other data sources we support, refer to our CData Connect page.