How to Easily Query Databricks Data in OpenOffice Base



Connect to Databricks data in OpenOffice Base using the CData ODBC Driver for Databricks and generate reports with live data insights.

Apache OpenOffice Base is a versatile, free database tool that makes connecting and managing data easy. With CData ODBC Driver for Databricks, you can seamlessly integrate and query live Databricks data, bringing real-time insights directly into OpenOffice.

This guide simplifies the setup, showing you how to connect Databricks to OpenOffice Base so you can easily generate reports, analyze trends, and make informed decisions right within your familiar workspace.

About Databricks Data Integration

Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:

  • Access all versions of Databricks from Runtime Versions 9.1 - 13.X to both the Pro and Classic Databricks SQL versions.
  • Leave Databricks in their preferred environment thanks to compatibility with any hosting solution.
  • Secure authenticate in a variety of ways, including personal access token, Azure Service Principal, and Azure AD.
  • Upload data to Databricks using Databricks File System, Azure Blog Storage, and AWS S3 Storage.

While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.

Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.


Getting Started


Overview

Here is an overview of the steps:

  1. CONFIGURE: Configure the DSN for Databricks data in the CData ODBC Driver for Databricks, using the required connection properties.
  2. CONNECT: Set up the ODBC connection in OpenOffice Base using the configured DSN.
  3. IMPORT: Once connected, review the available metadata and tables.
  4. CREATE: Create a report based on the queried Databricks data data.

Configure the Databricks DSN Using the CData ODBC Driver for Databricks

To start, configure the DSN (Data Source Name) for Databricks data in your system using the CData ODBC Driver for Databricks. Download and install a 30-day free trial with all the features from here.

Once installed, launch the ODBC Data Source Administrator:

  • On Windows: Search for ODBC Data Source Administrator in the Start menu and open the application.
  • On Mac: Open Applications, go to Utilities, and select ODBC Manager.
  • On Linux: Use the command line to launch ODBC Data Source Administrator or use unixODBC if installed.

Once launched, double-click on the CData Databricks data Source and enter the required values to establish a connection:

To connect to a Databricks cluster, set the properties as described below.

Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.

  • Server: Set to the Server Hostname of your Databricks cluster.
  • HTTPPath: Set to the HTTP Path of your Databricks cluster.
  • Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).

Setup an ODBC Connection in OpenOffice Base

Once you've set up the DSN, it's time to connect to it in OpenOffice Base and start querying data:

  • Launch OpenOffice Base and select Database from the home screen.
  • In the Database Wizard, choose ODBC from the 'Connect to an existing database' option and click Next.
  • Click Browse to locate and select the DSN you created, then click OK.
  • Enter the username associated with the DSN, click Test Connection to verify, and then click Next.
  • Finally, click Finish to save the new database file to your desired directory. This will connect OpenOffice Base to your live Databricks data data, ready for querying and analysis.

Your connection is now established — ready to query and analyze data seamlessly within OpenOffice Base.


Review the Metadata and Tables

After the database file is created and the connection is established, the table list will automatically display all available Databricks data objects.

  • Review the Databricks data Object List: In the left pane, click on Tables to view the Databricks data objects now available within OpenOffice Base.
  • View Object Data: Click on any object to view its contents. The Databricks data data will display directly within OpenOffice Base, allowing you to review records and fields with ease.

That's it! You have now successfully established a connection to Databricks data in OpenOffice Base, with data readily available for analysis and reporting.


Create a Report

Now, let's generate a report based on your Databricks data data.

  • Open your database and go to the Reports tab, then select Create Report Using Wizard.
  • In the Report Wizard, choose the Databricks data table from the Table or Query section. Select the columns from Available Fields that you want to include in your report and move them to the Fields in Report section using the arrows.
  • After configuring the fields, click Finish to generate your report.
  • Analyze, adjust, save, and publish the report as needed.

Your report is now created, drawing live data directly from Databricks data, and ready for analysis.


Try CData Free for 30 Days

Experience the power of live Databricks data data at your fingertips and seamlessly integrate it into OpenOffice Base. Start your free trial today!

Ready to get started?

Download a free trial of the Databricks ODBC Driver to get started:

 Download Now

Learn more:

Databricks Icon Databricks ODBC Driver

The Databricks ODBC Driver is a powerful tool that allows you to connect with live data from Databricks, directly from any applications that support ODBC connectivity.

Access Databricks data like you would a database - read, write, and update through a standard ODBC Driver interface.