Integrate Theia IDE with Live Lakebase Data via CData Connect AI
Theia IDE is an open-source, cloud and desktop IDE platform that provides a flexible, extensible development environment with built-in AI capabilities. Its AI features support multiple LLM providers and MCP (Model Context Protocol) tool integrations, allowing developers to interact with live external data sources directly from chat-based agents inside the IDE.
By integrating Theia IDE with CData Connect AI through the built-in MCP server, Theia's AI agents gain governed, real-time access to live Lakebase data. This enables developers to list catalogs, explore schemas, and query records from Lakebase data without leaving the editor or writing custom integration code.
This article explains how to configure Lakebase connectivity in Connect AI, generate the required personal access token, register the CData Connect AI MCP Server in Theia IDE, enable AI features with an LLM provider, and verify the integration by querying live Lakebase data from the Theia AI Chat.
Step 1: Configure Lakebase connectivity for Theia IDE
Connectivity to Lakebase from Theia IDE is made possible through Connect AI's Remote MCP Server. To interact with Lakebase data from Theia IDE, start by creating and configuring a Lakebase connection in Connect AI.
- Log into Connect AI, click Sources, and then click Add Connection
- Select Lakebase from the Add Connection panel
-
Enter the necessary authentication properties to connect to Lakebase.
To connect to Databricks Lakebase, start by setting the following properties:
- DatabricksInstance: The Databricks instance or server hostname, provided in the format instance-abcdef12-3456-7890-abcd-abcdef123456.database.cloud.databricks.com.
- Server: The host name or IP address of the server hosting the Lakebase database.
- Port (optional): The port of the server hosting the Lakebase database, set to 5432 by default.
- Database (optional): The database to connect to after authenticating to the Lakebase Server, set to the authenticating user's default database by default.
OAuth Client Authentication
To authenicate using OAuth client credentials, you need to configure an OAuth client in your service principal. In short, you need to do the following:
- Create and configure a new service principal
- Assign permissions to the service principal
- Create an OAuth secret for the service principal
For more information, refer to the Setting Up OAuthClient Authentication section in the Help documentation.
OAuth PKCE Authentication
To authenticate using the OAuth code type with PKCE (Proof Key for Code Exchange), set the following properties:
- AuthScheme: OAuthPKCE.
- User: The authenticating user's user ID.
For more information, refer to the Help documentation.
- Click Save & Test
- Navigate to the Permissions tab and update user-based permissions
Add a Personal Access Token
A Personal Access Token (PAT) is used to authenticate the connection to Connect AI from Theia IDE. It is best practice to create a separate PAT for each integration to maintain granular access control.
- Click the gear icon () at the top right of the Connect AI app to open Settings
- On the Settings page, go to the Access Tokens section and click Create PAT
- Give the PAT a descriptive name and click Create
- Copy the token when displayed and store it securely. It will not be shown again
With the Lakebase connection configured and a PAT generated, Theia IDE can now connect to Lakebase data through Connect AI.
Step 2: Configure Connect AI MCP in Theia IDE
Next, register the CData Connect AI Remote MCP Server in Theia IDE so that the built-in AI agents can discover and call live data tools through Connect AI.
- Download and install the Theia IDE
- Open Theia IDE and navigate to Settings (or press Ctrl + ,) to open the Settings view
-
In the Settings panel, expand AI Features and select MCP
-
Click Edit in settings.json to open the configuration file and paste the following JSON:
{ "ai-features.mcp.mcpServers": { "cdata": { "serverUrl": "https://mcp.cloud.cdata.com/mcp", "serverAuthToken": "Basic your_base64_encoded_email_PAT", "serverAuthTokenHeader": "Authorization" } } }Note: Theia IDE will use Basic authentication with Connect AI. Combine your Connect AI user email and the PAT you created earlier in the format email:PAT, base64 encode the combined string, and prefix it with Basic. For example, given [email protected]:ABC123...XYZ789, the serverAuthToken value becomes something like: Basic dXNlckBkb21haW4uY29tOkFCQzEyMy4uLlhZWjc4OQ==
- Save the settings.json file
Enable AI and configure an LLM provider
Theia IDE requires AI features to be enabled and at least one LLM provider configured to power the agent's reasoning.
- Return to Settings and under AI Features, select AI Enablement
-
Check the Enable AI box to activate Theia's AI capabilities
- Under AI Features, choose your preferred LLM provider (e.g., Anthropic, OpenAI, Google, Hugging Face) and enter your API key
With the MCP server registered and an LLM provider configured, Theia's AI agents are ready to query live Lakebase data through Connect AI.
Step 3: Query live Lakebase data from the Theia AI Chat
With the integration complete, use the Theia AI Chat panel to interact with live Lakebase data.
- Open the AI Chat panel from the right sidebar of the Theia IDE
- At the bottom of the chat, click the Toggle Capabilities Configuration icon (or press Ctrl + Shift + .) to open the capabilities panel
-
Under Generic Capabilities, expand MCP and check the cdata server (and any specific tools you want to expose) to make the Connect AI tools available to the agent
-
Type @AppTester in the chat input followed by your prompt, for example:
- List all catalogs in my cdata mcp
- Show the available schemas and tables for Lakebase
- Query the top 5 records from a table in Lakebase data
-
The agent calls the Connect AI MCP Server and returns live results from Lakebase data
At this point, your Theia IDE communicates with the Connect AI MCP Server and retrieves live Lakebase data through remote MCP directly from the editor.
Get CData Connect AI
To access hundreds of SaaS, Big Data, and NoSQL sources directly from your cloud applications, try CData Connect AI today! Start a free 14-day trial of CData Connect AI today, and as always, our world-class Support Team is available to assist you with any questions you may have.