How to Connect Databricks Data to Gemini Enterprise via CData Connect AI

Jerod Johnson
Jerod Johnson
Director, Technology Evangelism
Connect CData Connect AI Remote MCP to Gemini Enterprise to securely read and take actions on live Databricks data in real time using natural language.

Gemini Enterprise is Google's enterprise AI assistant, available as part of Google Workspace. With native support for Custom MCP Server data stores, Gemini Enterprise can be extended to query and act on live enterprise data via the Model Context Protocol (MCP). When combined with CData Connect AI Remote MCP, Gemini Enterprise can interact with Databricks data in real time using natural language — without data replication or custom integration logic.

CData Connect AI offers a dedicated cloud-to-cloud interface for connecting to Databricks data via a single managed MCP endpoint. The CData Connect AI Remote MCP Server enables secure communication between Gemini Enterprise and Databricks, allowing users to ask questions and take actions on live Databricks data through natural language prompts.

This article explains how to connect Gemini Enterprise to live Databricks data through CData Connect AI by creating a Custom MCP Server data store — giving users access to Databricks data directly from the Gemini Enterprise chat interface.

About Databricks Data Integration

Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:

  • Access all versions of Databricks from Runtime Versions 9.1 - 13.X to both the Pro and Classic Databricks SQL versions.
  • Leave Databricks in their preferred environment thanks to compatibility with any hosting solution.
  • Secure authenticate in a variety of ways, including personal access token, Azure Service Principal, and Azure AD.
  • Upload data to Databricks using Databricks File System, Azure Blog Storage, and AWS S3 Storage.

While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.

Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.


Getting Started


Prerequisites

  1. A CData Connect AI account with at least one active connection (e.g., Databricks)
  2. A Gemini Enterprise account (trial available)
  3. A Google Cloud project with billing enabled
  4. The Google Cloud CLI installed and configured
  5. In your Google Cloud account:
    • Override the organization policy for Custom MCP data stores (learn more).
    • Grant the Discovery Engine Editor role to the administrator (learn more).

Step 1: Configure Databricks connectivity for Gemini Enterprise

Connectivity to Databricks from Gemini Enterprise is made possible through CData Connect AI Remote MCP. To interact with Databricks data from Gemini Enterprise, start by creating and configuring a Databricks connection in CData Connect AI.

  1. Log into Connect AI, click Sources, and then click Add Connection
  2. Select "Databricks" from the Add Connection panel
  3. Enter the necessary authentication properties to connect to Databricks.

    To connect to a Databricks cluster, set the properties as described below.

    Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.

    • Server: Set to the Server Hostname of your Databricks cluster.
    • HTTPPath: Set to the HTTP Path of your Databricks cluster.
    • Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
  4. Click Save & Test
  5. Navigate to the Permissions tab in the Add Databricks Connection page and update the User-based permissions.

Create an OAuth App in CData Connect AI

Gemini Enterprise uses OAuth 2.0 Authorization Code with PKCE to authenticate users against the CData Connect AI MCP Server. This requires creating a user-based OAuth App in your CData Connect AI account.

  1. Click the Gear icon () in the top-right corner of Connect AI to open Settings.
  2. Navigate to OAuth Apps and click + Create App. The Create OAuth App dialog appears.
  3. Enter the following settings:
    • Name — Enter a descriptive name (e.g., GeminiEnterpriseOAuth).
    • Authentication Flow — Select User-based (Authorization Code).
    • Callback URL — Enter https://vertexaisearch.cloud.google.com/oauth-redirect.
  4. Click Confirm. CData Connect AI creates the OAuth App and generates a Client ID and Client Secret.
  5. Copy both the Client ID and Client Secret values. You will need them in Step 5.

With the connection configured and an OAuth App created, we are ready to create the custom MCP server data store in Gemini Enterprise.

Step 2: Create the custom MCP server data store

  1. Open Gemini Enterprise and navigate to the Data stores screen.
  2. Click Create data store.
  3. On the Select a data source page, enter Custom MCP Server in the Search sources field. The Custom MCP Server card displays.
  4. Click Add MCP server. The MCP Server Configuration page displays.
  5. In the Authentication settings section, enter values in the following required fields:
    • MCP Server URL: https://mcp.cloud.cdata.com/mcp
    • Authorization URL: https://cloud-login.cdata.com/authorize
    • Token URL: https://cloud-login.cdata.com/oauth/token
    • Client ID and Client Secret: From the OAuth App created in Step 1
  6. Click Login, and complete the sign-in.
  7. Click Continue, and the Advanced options section opens.
  8. In the MCP Server Description field, enter a description that helps Gemini Enterprise understand what the server does and when to use it. For more information, see Write effective MCP server descriptions and instructions.

  9. Click Continue.

  10. In the Configure your data connector section, select the Location of your data connector from the Multi-region field list.

  11. In Your data connector name, enter a name for your data store.

  12. Click Create. Gemini Enterprise creates your data store and displays your data stores on the Data Stores page.

    Note: By default, no tools or actions from your custom MCP servers are enabled. You must enable the tools or actions.

Step 3: Enable actions

After creating the custom MCP server data store, you must enable at least one tool or action before it can be used in Gemini Enterprise.

  1. Go to your custom MCP server data store.
  2. Open the Actions tab and select Reload custom actions to reauthenticate.

    Note: This action performs a tools/list call on the MCP server to retrieve available tools, which are then displayed on the screen.

  3. Select the actions to enable.
  4. Click Enable actions.

Step 4: Connect the MCP server data store to a Gemini Enterprise app

After creating the custom MCP server data store and enabling actions, you must connect the data store to a Gemini Enterprise app before it can be used.

  1. In the Google Cloud console, go to the Gemini Enterprise page.
  2. From the navigation menu, click Apps.
  3. Select the Gemini Enterprise app where you want to connect your data store.
  4. From the navigation menu of the app, click Connected data sources.
  5. Click Add existing data stores and select your data store.
  6. Click Connect.

Step 5: Query live Databricks data with natural language

With the data store connected, Gemini Enterprise users can interact with live Databricks data using natural language from the Gemini Enterprise web application. Each user authenticates with their own Connect AI credentials via the OAuth flow on first use.

  1. Open Gemini Enterprise, click Connections and authorize CData Connect AI.
  2. Ask natural language questions about your Databricks data:
    • "Show me all Databricks data from the last 30 days"
    • "What are the top records in Databricks data by revenue?"
    • "List all active Databricks data and their current status"
    • "Summarize Databricks data activity for this quarter"
  3. The agent automatically discovers available connections in Connect AI, identifies the most relevant Databricks connection, generates SQL, and returns results — all without requiring the user to write queries or understand the underlying data structure.

Get CData Connect AI

To get live data access to 350+ SaaS, Big Data, and NoSQL sources directly from Gemini Enterprise and other AI platforms, try CData Connect AI today!

Ready to get started?

Learn more about CData Connect AI or sign up for free trial access:

Free Trial