How to Connect to Live Google Cloud Storage Data from Sourcegraph Amp (via CData Connect AI)

Somya Sharma
Somya Sharma
Technical Marketing Engineer
Integrate Sourcegraph Amp with CData Connect AI to query and manage live Google Cloud Storage data securely in real time.

Sourcegraph Amp is a modern AI agent environment designed for building intelligent, production-ready assistants capable of stateful reasoning, automatic context management, and native MCP (Model Context Protocol) integration. When combined with CData Connect AI, you can leverage Amp to create agents that interact with your Google Cloud Storage data in real time using natural language or SQL-based queries.

CData Connect AI provides a secure, cloud-to-cloud interface for accessing Google Cloud Storage data. Through the Connect AI Remote MCP Server, Amp connects directly to Google Cloud Storage, enabling live data queries and operations without replication. With optimized pushdown capabilities, CData Connect AI executes SQL operations including filters, aggregations, and joins directly in Google Cloud Storage for fast, real-time performance.

In this article, we demonstrate how to configure the Amp agent to conversationally explore your Google Cloud Storage data using natural language or SQL. With Connect AI, you can easily build agents that have secure, live access to Google Cloud Storage along with hundreds of other enterprise data sources.

Prerequisites

  1. An active CData Connect AI
  2. The Sourcegraph Amp VS Code extension or Amp CLI installed
  3. Node.js v20 or higher installed
  4. Access to Google Cloud Storage

Step 1: Configure Google Cloud Storage Connectivity for Sourcegraph Amp

Connectivity to Google Cloud Storage from Amp is made possible through CData Connect AI Remote MCP. To interact with Google Cloud Storage data from Amp, we start by creating and configuring a Google Cloud Storage connection in CData Connect AI.

  1. Log into Connect AI, click Sources, and then click Add Connection
  2. Adding a Connection
  3. Select "Google Cloud Storage" from the Add Connection panel
  4. Selecting a data source
  5. Enter the necessary authentication properties to connect to Google Cloud Storage.

    Authenticate with a User Account

    You can connect without setting any connection properties for your user credentials. After setting InitiateOAuth to GETANDREFRESH, you are ready to connect.

    When you connect, the Google Cloud Storage OAuth endpoint opens in your default browser. Log in and grant permissions, then the OAuth process completes

    Authenticate with a Service Account

    Service accounts have silent authentication, without user authentication in the browser. You can also use a service account to delegate enterprise-wide access scopes.

    You need to create an OAuth application in this flow. See the Help documentation for more information. After setting the following connection properties, you are ready to connect:

    • InitiateOAuth: Set this to GETANDREFRESH.
    • OAuthJWTCertType: Set this to "PFXFILE".
    • OAuthJWTCert: Set this to the path to the .p12 file you generated.
    • OAuthJWTCertPassword: Set this to the password of the .p12 file.
    • OAuthJWTCertSubject: Set this to "*" to pick the first certificate in the certificate store.
    • OAuthJWTIssuer: In the service accounts section, click Manage Service Accounts and set this field to the email address displayed in the service account Id field.
    • OAuthJWTSubject: Set this to your enterprise Id if your subject type is set to "enterprise" or your app user Id if your subject type is set to "user".
    • ProjectId: Set this to the Id of the project you want to connect to.

    The OAuth flow for a service account then completes.

    Configuring a connection (Salesforce is shown)
  6. Click Save & Test

Step 2: Set Up Amp for CData Connect AI

Copy the MCP Endpoint

Amp communicates with Connect AI through the hosted MCP endpoint:

https://mcp.cloud.cdata.com/mcp

This endpoint provides secure, cloud-to-cloud communication between Amp and your Connect AI workspace.

Generate Base64 Credentials

To authenticate Amp with Connect AI, generate your Base64-encoded credentials. For example, in PowerShell:

{Convert}::ToBase64String{(Text.Encoding)}::ASCII.GetBytes("[email protected]:yourPAT")

Replace [email protected] with your Connect AI email and yourPAT with your Personal Access Token.

Register the MCP Server in Amp

Once you have your Base64 string, register the CData Connect AI MCP server with Amp using the following command:

amp mcp add cdata-connect-ai -- npx -y mcp-remote@latest https://mcp.cloud.cdata.com/mcp --header "Authorization: Basic "

This adds your Connect AI configuration to Amp's settings file, enabling communication with CData Connect AI.

Verify Your Connection and Explore Data

  1. Create a New Thread
  2. Start a new Amp session to begin interacting with your data:

    amp thread new
    Starting a new Amp thread

  3. Enter the Interactive Chat
  4. Connect to the new thread using:

    amp
    .

    Opening interactive chat in Amp

  5. Verify MCP Servers
  6. Inside the Amp shell, check your registered MCP servers:

    list mcp
    .

    Listing MCP servers

  7. Confirm Your Data Source
  8. Confirm that your connected Google Cloud Storage data appears as a catalog by running

    getCatalogs
    .

    Verifying connected data sources

Step 3: Build Intelligent Agents with Live Google Cloud Storage Data Access

With your Amp application configured and connected to CData Connect AI, you can now build sophisticated agents that interact with your Google Cloud Storage data using natural language. The MCP integration provides your agents with powerful data access capabilities.

Available MCP Tools for your Agent

Your Amp application has access to the following CData Connect AI MCP tools:

  • getCatalogs: Lists all data source catalogs (e.g., GoogleCloudStorage1)
  • getSchemas: Returns database schemas within the connected catalog
  • getTables: Lists all tables and views available under a given schema
  • getColumns: Returns column definitions for a specific table or view
  • queryData: Executes SQL queries (SELECT, INSERT, UPDATE, DELETE)
  • getProcedures: Lists stored procedures or API endpoints
  • getProcedureParameters: Returns metadata for stored procedure parameters
  • executeProcedure: Invokes stored procedures (e.g., Google Cloud Storage actions)

Key Features of Amp

Amp provides several production-ready capabilities that make it ideal for building intelligent, data-aware AI agents:

  • Automatic Context Management: Amp maintains and recalls conversational context automatically, enabling seamless multi-turn interactions without manual state tracking.
  • Stateful Conversations: Preserve context and memory across multiple queries to create natural, human-like conversations.
  • Native MCP Integration: Amp natively supports the Model Context Protocol (MCP), allowing secure, real-time access to live data from CData Connect AI and other MCP-compatible servers.
  • Tool-Oriented Architecture: Tools are treated as first-class components with managed invocation, input validation, and error handling.
  • Efficient Context Handling: Amp optimizes prompts dynamically, ensuring relevant information is preserved even when approaching model token limits.
  • Cross-Source Querying: Combine and query multiple connected data sources within a single conversational workflow.
  • Fine-Grained Permission Controls: Define and enforce tool access levels to maintain data governance and secure integrations.
  • Developer-Friendly CLI and SDK: Manage MCP connections, configure agents, and test workflows easily from the Amp CLI or VS Code extension.

Example Use Cases

Here are some examples of what your Amp agents can do with live data access through CData Connect AI:

  • Data Analysis Agent: Identify trends and anomalies in Google Cloud Storage data.
  • Report Generation Agent: Generate reports from natural language prompts.
  • Interactive Chatbot: Explain insights conversationally using live data.
  • Data Quality Agent: Monitor and flag real-time data inconsistencies.
  • Automated Workflow Agent: Trigger alerts based on defined data conditions.

Testing Your Agent

Once your agent is running, you can interact with it through natural language queries. For example:

  • "Show me all new leads from the past 30 days."
  • "What are the top-performing campaigns this quarter?"
  • "Analyze revenue growth and highlight anomalies."
  • "Generate a summary report of current opportunities."
  • "Find all records where status is pending approval."

Get CData Connect AI

To get live data access to hundreds of SaaS, Big Data, and NoSQL sources directly from your Amp agent environment, try CData Connect AI today!

Ready to get started?

Learn more about CData Connect AI or sign up for free trial access:

Free Trial