We are proud to share our inclusion in the 2024 Gartner Magic Quadrant for Data Integration Tools. We believe this recognition reflects the differentiated business outcomes CData delivers to our customers.
Get the Report →Access Databricks Data from MySQL in PHP
Connect to Databricks through the standard MySQL libraries in PHP.
You can use the CData SQL Gateway and ODBC Driver for Databricks to access Databricks data from MySQL clients, without needing to perform an ETL or cache data. Follow the steps below to connect to Databricks data in real time through PHP's standard MySQL interfaces, mysqli and PDO_MySQL.
About Databricks Data Integration
Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:
- Access all versions of Databricks from Runtime Versions 9.1 - 13.X to both the Pro and Classic Databricks SQL versions.
- Leave Databricks in their preferred environment thanks to compatibility with any hosting solution.
- Secure authenticate in a variety of ways, including personal access token, Azure Service Principal, and Azure AD.
- Upload data to Databricks using Databricks File System, Azure Blog Storage, and AWS S3 Storage.
While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.
Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.
Getting Started
Connect to Databricks Data
If you have not already done so, provide values for the required connection properties in the data source name (DSN). You can use the built-in Microsoft ODBC Data Source Administrator to configure the DSN. This is also the last step of the driver installation. See the "Getting Started" chapter in the help documentation for a guide to using the Microsoft ODBC Data Source Administrator to create and configure a DSN.
To connect to a Databricks cluster, set the properties as described below.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
- Server: Set to the Server Hostname of your Databricks cluster.
- HTTPPath: Set to the HTTP Path of your Databricks cluster.
- Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
Configure the SQL Gateway
See the SQL Gateway Overview to set up connectivity to Databricks data as a virtual MySQL database. You will configure a MySQL remoting service that listens for MySQL requests from clients. The service can be configured in the SQL Gateway UI.
Connect in PHP
The following examples show how to use object-oriented interfaces to connect and execute queries. Initialize the connection object with the following parameters to connect to the virtual MySQL database:
- Host: Specify the remote host location where the service is running. In this case "localhost" is used for the remote host setting since the service is running on the local machine.
- Username: Specify the username for a user you authorized on the SQL Gateway's Users tab.
- Password: Specify the password for the authorized user account.
- Database Name: Specify the system DSN as the database name.
- Port: Specify the port the service is running on; port 3306 in this example.
mysqli
<?php $mysqli = new mysqli("localhost", "user", "password", "CData Databricks Sys","3306"); ?>
PDO
<?php $pdo = new PDO('mysql:host=localhost;dbname=CData Databricks Sys;port=3306', 'user', 'password'); ?>
Query in PHP
With the connection established, you can then access tables. The following steps walk through the example:
- Query the table; for example, Customers. The results will be stored as an associative array in the $result object.
- Iterate over each row and column, printing the values to display in the PHP page.
- Close the connection.
mysqli
$result = $mysqli->query("SELECT City, CompanyName FROM Customers WHERE Country = 'US'"); while($row = $result->fetch_assoc()) { foreach ($row as $k=>$v) { echo "$k : $v"; echo "<br>"; } } $mysqli->close();
PDO
$result = $pdo->query("SELECT City, CompanyName FROM Customers WHERE Country = 'US'"); while($row = $result->fetch(PDO::FETCH_ASSOC)) { foreach ($row as $k=>$v) { echo "$k : $v"; echo "<br>"; } } $result = null; $pdo = null;