Build AI Agents with Amazon Athena Data in IBM watsonx Orchestrate using CData Connect AI

Mohsin Turki
Mohsin Turki
Technical Marketing Engineer
Learn how to build intelligent agents in IBM watsonx Orchestrate that can securely access and act on live Amazon Athena data through CData Connect AI.

IBM watsonx Orchestrate is a no-code and pro-code AI agent platform that lets you rapidly build and deploy intelligent automation workflows. You can design collaborative agents capable of executing real business tasks that connect seamlessly to large language models (LLMs) and enterprise systems.

Agents in watsonx Orchestrate can reason, plan, and integrate with APIs and tools to deliver automated workflows that augment human teams.

CData Connect AI integrates IBM watsonx Orchestrate with over 350+ enterprise data sources and cloud services. With Connect AI, agents can query and act on live Amazon Athena data in real time — without replication or data movement. Connect AI provides governed, secure access using the Model Context Protocol (MCP) while keeping your data within its original system.

Together, IBM watsonx Orchestrate and CData Connect AI enable you to:

  • Build AI agents that interact directly with governed enterprise data through CData's MCP server.
  • Manage authentication securely — credentials and tokens remain encrypted and private.
  • Query live Amazon Athena data without copying or replicating it.
  • Maintain governance and compliance using CData's unified connectivity platform.

In this article, you will learn how to connect Amazon Athena to CData Connect AI, import Remote MCP tools into IBM watsonx Orchestrate, and build secure, real-time agent workflows using live Amazon Athena data.

Prerequisites

  1. A CData Connect AI account – Sign up for a free trial or log in to your existing account.
  2. An IBM watsonx Orchestrate account – Start a 30-day free trial.
  3. A Amazon Athena account with valid credentials.
  4. Python 3.x and pip installed for using the IBM watsonx Orchestrate Agent Development Kit (ADK) to manage or import MCP toolkits.

Overview

Here is a quick overview of the steps covered in this article:

  1. Connect: Add a connection to your Amazon Athena in CData Connect AI and verify that the connection is successful.
  2. Build: Create an AI agent in IBM watsonx Orchestrate by authenticating the connection to Amazon Athena from Connect AI and importing the MCP tools using the ADK.
  3. Deploy: Deploy and test your AI agent in watsonx Orchestrate to confirm real-time interaction with live Amazon Athena data through CData Connect AI.

About Amazon Athena Data Integration

CData provides the easiest way to access and integrate live data from Amazon Athena. Customers use CData connectivity to:

  • Authenticate securely using a variety of methods, including IAM credentials, access keys, and Instance Profiles, catering to diverse security needs and simplifying the authentication process.
  • Streamline their setup and quickly resolve issue with detailed error messaging.
  • Enhance performance and minimize strain on client resources with server-side query execution.

Users frequently integrate Athena with analytics tools like Tableau, Power BI, and Excel for in-depth analytics from their preferred tools.

To learn more about unique Amazon Athena use cases with CData, check out our blog post: https://www.cdata.com/blog/amazon-athena-use-cases.


Getting Started


Step 1: Configure Amazon Athena Connectivity for IBM watsonx

Before connecting IBM watsonx Orchestrate to Amazon Athena, you must first create a connection to Amazon Athena in CData Connect AI. This connection enables the Remote MCP server to access live Amazon Athena data through a secure, governed endpoint.

Note: If you have already added a Amazon Athena connection in CData Connect AI, you can skip this step and continue to the next section.

1.1 Add a Connection in CData Connect AI

  1. Log in to CData Connect AI, click Sources in the left panel, and then click Add Connection on top-right.
  2. Search and select Amazon Athena from the Add Connection panel.
  3. Enter the required authentication properties to connect to Amazon Athena.

    Authenticating to Amazon Athena

    To authorize Amazon Athena requests, provide the credentials for an administrator account or for an IAM user with custom permissions: Set AccessKey to the access key Id. Set SecretKey to the secret access key.

    Note: Though you can connect as the AWS account administrator, it is recommended to use IAM user credentials to access AWS services.

    Obtaining the Access Key

    To obtain the credentials for an IAM user, follow the steps below:

    1. Sign into the IAM console.
    2. In the navigation pane, select Users.
    3. To create or manage the access keys for a user, select the user and then select the Security Credentials tab.

    To obtain the credentials for your AWS root account, follow the steps below:

    1. Sign into the AWS Management console with the credentials for your root account.
    2. Select your account name or number and select My Security Credentials in the menu that is displayed.
    3. Click Continue to Security Credentials and expand the Access Keys section to manage or create root account access keys.

    Authenticating from an EC2 Instance

    If you are using the CData Data Provider for Amazon Athena 2018 from an EC2 Instance and have an IAM Role assigned to the instance, you can use the IAM Role to authenticate. To do so, set UseEC2Roles to true and leave AccessKey and SecretKey empty. The CData Data Provider for Amazon Athena 2018 will automatically obtain your IAM Role credentials and authenticate with them.

    Authenticating as an AWS Role

    In many situations it may be preferable to use an IAM role for authentication instead of the direct security credentials of an AWS root user. An AWS role may be used instead by specifying the RoleARN. This will cause the CData Data Provider for Amazon Athena 2018 to attempt to retrieve credentials for the specified role. If you are connecting to AWS (instead of already being connected such as on an EC2 instance), you must additionally specify the AccessKey and SecretKey of an IAM user to assume the role for. Roles may not be used when specifying the AccessKey and SecretKey of an AWS root user.

    Authenticating with MFA

    For users and roles that require Multi-factor Authentication, specify the MFASerialNumber and MFAToken connection properties. This will cause the CData Data Provider for Amazon Athena 2018 to submit the MFA credentials in a request to retrieve temporary authentication credentials. Note that the duration of the temporary credentials may be controlled via the TemporaryTokenDuration (default 3600 seconds).

    Connecting to Amazon Athena

    In addition to the AccessKey and SecretKey properties, specify Database, S3StagingDirectory and Region. Set Region to the region where your Amazon Athena data is hosted. Set S3StagingDirectory to a folder in S3 where you would like to store the results of queries.

    If Database is not set in the connection, the data provider connects to the default database set in Amazon Athena.

  4. Click Save & Test to validate the connection.
  5. Navigate to the Permissions tab on the Add Amazon Athena Connection page and update user-based permissions as needed.

1.2 Create a Personal Access Token (PAT)

A Personal Access Token (PAT) is used to authenticate IBM watsonx Orchestrate when connecting to CData Connect AI. It is recommended to create a separate PAT for each integration to maintain fine-grained access control.

  1. Click the gear icon () in the upper-right corner of the Connect AI interface to open Settings.
  2. On the Settings page, open the Access Tokens section and click Create PAT.
  3. Provide a descriptive name and click Create.
  4. Note: The PAT is displayed only once at creation. Copy and store it securely for later use when configuring IBM watsonx Orchestrate.

With the connection configured and a PAT generated, you are now ready to connect to live Amazon Athena data from IBM watsonx Orchestrate and begin importing MCP tools through the Agent Development Kit (ADK).


Step 2: Build an AI Agent with Your Amazon Athena data

Once the connection to Amazon Athena is established in CData Connect AI, you can begin building an AI agent in IBM watsonx Orchestrate. This section walks you through adding the connection inside watsonx Orchestrate and configuring authentication for secure access.

2.1 Add a Connection for Amazon Athena in IBM watsonx Orchestrate

This step links your CData Connect AI Remote MCP server to IBM watsonx Orchestrate to manage the connection to Amazon Athena through the Agent Development Kit (ADK). Follow the steps below to configure and authenticate it.

  1. Log in to your IBM watsonx Orchestrate account.
  2. On the home screen, click the three-line Menu icon on the left panel, expand the Manage section, and select Connections.
  3. Click Add new connection.
  4. Under Define connection details, enter a unique Connection ID (for example, mcp-cdata) to identify it later.
  5. Enter the Display name as CData Connect AI MCP, then click Save and continue and confirm the dialog.
  6. Under Configure draft connection, select Authentication type as Key-Value Pair from the drop-down.
  7. Scroll down to Credential type and select the Team credentials radio button.
  8. Enter the following details:
    1. Key: Enter Authorization.
    2. Value: Enter your CData Connect AI registered email and Personal Access Token (PAT) from Step 1, separated by a colon (:), and prefixed with the word Basic. For example: Basic [email protected]:DVolmeSplUCJUdXVFdj.
  9. Click Connect to establish the connection successfully, then click Next.
  10. Under Configure live connection, click Paste draft configuration to copy the settings.
  11. Repeat the same credential configuration as in the draft connection (Team credentials with the same Key-Value Pair).
  12. Click Connect to validate, then click Finish and confirm the dialog to close the connection setup.

2.2 Install the ADK and Requirements

After adding a connection to Amazon Athena in CData Connect AI, install the IBM watsonx Orchestrate Agent Development Kit (ADK) to manage and test the connection.

  1. Install Python 3.x and pip if they are not already installed.
  2. Verify the installation by running the following commands in your terminal:
    python --version and pip --version
  3. Create a project directory and set up a Python virtual environment for the ADK with the following command:
    python -m venv venv
  4. Activate the virtual environment:
    On Windows: venv\Scripts\activate
    On macOS or Linux: source venv/bin/activate
  5. With your environment activated, install the ADK:
    pip install ibm-watsonx-orchestrate
  6. After installation, confirm that the ADK is working by running:
    orchestrate --help
  7. If everything is set up correctly, you will see a list of available ADK CLI commands, similar to the screenshot below.

2.3 Connect the ADK to IBM watsonx Orchestrate

Now that the ADK is installed, connect it to your IBM watsonx Orchestrate SaaS instance so you can manage environments and import MCP tools from CData Connect AI.

You need your watsonx Orchestrate API Key and Service Instance URL. Follow these steps:

  1. Log in back to your IBM watsonx Orchestrate instance.
  2. Click the profile icon in the top-right corner, open the menu, and select Settings.
  3. On the Settings page, open the API details tab and click Generate API key. A pop-up will appear with your new API Key. Copy and save it securely. You will not be able to view it again later.
  4. Copy your Service Instance URL.
  5. Return to your terminal and run the following command to connect the ADK to your IBM watsonx Orchestrate environment:
    orchestrate env add -n <environment-name> -u <service-instance-url> --type mcsp --activate

    Parameter Description Example
    <environment-name> A name for your ADK environment. CData-Env
    <service-instance-url> Your IBM watsonx Orchestrate Service Instance URL copied from your instance settings. https://api.dl.watson-orchestrate.ibm.com/instances/20250605-1433-1621-306a-df42bcdd849c

    Example command:
        orchestrate env add -n wxO-AWS -u
        https://api.dl.watson-orchestrate.ibm.com/instances/20250605-1433-1621-306a-df42bcdd849c --type mcsp --activate
        
  6. When prompted, paste your API Key into the terminal and press Enter to activate the environment. Once completed, a confirmation message will appear indicating that your environment has been created and activated.

2.4 Import the Connect AI MCP Server Toolkits into IBM watsonx Orchestrate

Now that your environment is connected and authenticated, you can import the CData Connect AI Remote MCP toolkits into IBM watsonx Orchestrate. This step registers the live Amazon Athena tools so that your agents can use them directly through the Agent Development Kit (ADK).

  1. Return to your terminal or command prompt with the virtual environment activated.
  2. Run the following command to import the CData Connect AI MCP toolkit:

        orchestrate toolkits import --kind mcp --name cdata-mcp --description "CData Connect AI MCP Toolkit" --url "https://mcp.cloud.cdata.com/mcp/" --transport "streamable_http" --tools "getCatalogs,getSchemas,getTables,getColumns,queryData,getProcedures,getProcedureParameters,executeProcedure" --app-id "mcp-cdata"
        
  3. Replace the parameters as needed:
    Parameter Description Example
    --name Specifies a name for the imported toolkit. cdata-mcp
    --description A short description of the toolkit. "CData Connect AI MCP Toolkit"
    --url The URL of the CData Connect AI MCP server. https://mcp.cloud.cdata.com/mcp/
    --transport The communication protocol used for the MCP connection. streamable_http
    --tools A comma-separated list of tools to import from the CData Connect AI MCP server.
    The following Connect AI MCP tools provide metadata and query access for live Amazon Athena data:
    • getCatalogs
    • getSchemas
    • getTables
    • getColumns
    • queryData
    • getProcedures
    • getProcedureParameters
    • executeProcedure
    "getCatalogs,getSchemas,getTables,getColumns,queryData,getProcedures,getProcedureParameters,executeProcedure"
    --app-id The connection name created earlier in Step 2.1 (for example, mcp-cdata). mcp-cdata
  4. The ADK connects to the CData Connect AI MCP server, validates the available tools, and imports them into your IBM watsonx Orchestrate instance. The process may take up to 30 seconds while the tool schemas are verified.
  5. Once the import is complete, confirm that the toolkit is available by running:
    orchestrate toolkits list

You have now successfully imported the CData Connect AI MCP toolkits into IBM watsonx Orchestrate. Your agents can now call live Amazon Athena data through CData Connect AI using the imported MCP tools.

2.5 Create an Agent in IBM watsonx Orchestrate

You are now ready to create an AI agent in IBM watsonx Orchestrate that uses the imported CData Connect AI MCP tools. This agent allows you to query and interact with live Amazon Athena data directly through the Connect AI integration.

Follow these steps:

  1. On the IBM watsonx Orchestrate Dashboard, click the three-line Menu icon on the left panel, expand the Build section, and select Agent Builder.
  2. On the Build agents and tools page, click the Create agent button in the top-right corner.
  3. Select Create from scratch, enter a desired Name (for example, CData Connect AI), add a short Description, and click Create.
  4. On the agent's main page, click Toolset in the left panel, then click the Add tool button.
  5. Select Add from file or MCP server, then choose Import from MCP server.
  6. In the top-left Select MCP server drop-down, choose the MCP toolkit you imported in the previous step (for example, cdata-mcp).
  7. Turn on all imported CData Connect AI MCP tools as shown, then click Close to save and exit the dialog.
  8. This completes the connection between Amazon Athena data and IBM watsonx Orchestrate by enabling the imported CData Connect AI tools.

Your agent is now ready to use the CData Connect AI MCP tools to query and process live Amazon Athena data directly within IBM watsonx Orchestrate. You can now start building workflows, testing SQL-based queries, and automating actions using real-time data through Connect AI.



Step 3: Test and Deploy the Built Agent

Your agent setup is now complete, and you can begin testing it directly in the IBM watsonx Orchestrate preview interface. This step ensures that your CData Connect AI MCP integration is working correctly and that your agent can query live Amazon Athena data in real time.

  1. In the agent's Preview chat interface, type List catalogs to view available data catalogs retrieved through the connected CData MCP server.
  2. Next, run a sample query by including your Amazon Athena catalog name for better context. For example: Briefly summarize the most recent sprint's performance in the Amazon Athena catalog.

You can further fine-tune your agent before deployment to improve accuracy, usability, and responsiveness. Here are a few ways to enhance your agent:

  • Add natural-language variations — test multiple phrasings of the same request to ensure your agent understands similar intents consistently.
  • Refine the agent's description and behavior — update the Description and Behavior fields in Agent Builder for clearer task context and responses.
  • Combine more MCP tools or connections — import other CData MCP toolkits or data sources to enable multi-source workflows.
  • Test filtering, summaries, and drill-downs — run varied queries in Preview to validate accuracy and performance.

Once you're ready, deploy the agent by clicking the Deploy button at the top to share it with your team and enable access for all users.


Build AI Agents in Minutes with CData Connect AI

With CData Connect AI, you can securely connect and query over 350+ enterprise data sources directly within your AI environments — no data replication or complex setup required. CData's governed SQL-based connectivity makes it easy for tools like IBM watsonx Orchestrate to access, analyze, and automate workflows with real-time data.

Start building your next AI-powered integration today with a free trial of CData Connect AI.

Or better yet, experience it yourself with the live demo — try it now here.

Ready to get started?

Learn more about CData Connect AI or sign up for free trial access:

Free Trial