Create Reports from Amazon Athena Data in Google Data Studio

Ready to get started?

Learn more or sign up for a free trial:

CData Connect



Use CData Connect Cloud to create a virtual MySQL Database for Amazon Athena data and create custom reports in Google Data Studio.

Google Data Studio allows you to create branded reports with data visualizations to share with your clients. When paired with CData Connect Cloud, you get instant, cloud-to-cloud access to Amazon Athena data for visualizations, dashboards, and more. This article shows how to create a virtual database for Amazon Athena and build reports from Amazon Athena data in Google Data Studio.

CData Connect Cloud provides a pure cloud-to-cloud interface for Amazon Athena, allowing you to easily build reports from live Amazon Athena data in Google Data Studio — without replicating the data. As you build visualizations, Google Data Studio generates queries to gather data. Using optimized data processing out of the box, CData Connect Cloud pushes all supported query operations (filters, JOINs, etc) directly to Amazon Athena, leveraging server-side processing to quickly return Amazon Athena data.

This article requires a CData Connect Cloud instance and the CData Connect Cloud Connector for Google Data Studio. Get more information on the CData Connect Cloud and sign up for a free trial at https://www.cdata.com/connect.


Connect to Amazon Athena from Connect Cloud

CData Connect Cloud uses a straightforward, point-and-click interface to connect to data sources and generate APIs.

  1. Log into Connect Cloud and click Databases.
  2. Select "Amazon Athena" from Available Data Sources.
  3. Enter the necessary authentication properties to connect to Amazon Athena.

    Authenticating to Amazon Athena

    To authorize Amazon Athena requests, provide the credentials for an administrator account or for an IAM user with custom permissions: Set AccessKey to the access key Id. Set SecretKey to the secret access key.

    Note: Though you can connect as the AWS account administrator, it is recommended to use IAM user credentials to access AWS services.

    Obtaining the Access Key

    To obtain the credentials for an IAM user, follow the steps below:

    1. Sign into the IAM console.
    2. In the navigation pane, select Users.
    3. To create or manage the access keys for a user, select the user and then select the Security Credentials tab.

    To obtain the credentials for your AWS root account, follow the steps below:

    1. Sign into the AWS Management console with the credentials for your root account.
    2. Select your account name or number and select My Security Credentials in the menu that is displayed.
    3. Click Continue to Security Credentials and expand the Access Keys section to manage or create root account access keys.

    Authenticating from an EC2 Instance

    If you are using the CData Data Provider for Amazon Athena 2018 from an EC2 Instance and have an IAM Role assigned to the instance, you can use the IAM Role to authenticate. To do so, set UseEC2Roles to true and leave AccessKey and SecretKey empty. The CData Data Provider for Amazon Athena 2018 will automatically obtain your IAM Role credentials and authenticate with them.

    Authenticating as an AWS Role

    In many situations it may be preferable to use an IAM role for authentication instead of the direct security credentials of an AWS root user. An AWS role may be used instead by specifying the RoleARN. This will cause the CData Data Provider for Amazon Athena 2018 to attempt to retrieve credentials for the specified role. If you are connecting to AWS (instead of already being connected such as on an EC2 instance), you must additionally specify the AccessKey and SecretKey of an IAM user to assume the role for. Roles may not be used when specifying the AccessKey and SecretKey of an AWS root user.

    Authenticating with MFA

    For users and roles that require Multi-factor Authentication, specify the MFASerialNumber and MFAToken connection properties. This will cause the CData Data Provider for Amazon Athena 2018 to submit the MFA credentials in a request to retrieve temporary authentication credentials. Note that the duration of the temporary credentials may be controlled via the TemporaryTokenDuration (default 3600 seconds).

    Connecting to Amazon Athena

    In addition to the AccessKey and SecretKey properties, specify Database, S3StagingDirectory and Region. Set Region to the region where your Amazon Athena data is hosted. Set S3StagingDirectory to a folder in S3 where you would like to store the results of queries.

    If Database is not set in the connection, the data provider connects to the default database set in Amazon Athena.

  4. Click Test Database.
  5. Click Privileges -> Add and add the new user (or an existing user) with the appropriate permissions.

With the virtual database created, you are ready to connect to Amazon Athena data from Google Data Studio.

Visualize Live Amazon Athena Data in Google Data Studio

The steps below outline connecting to CData Connect Cloud from Google Data Studio to create a new Amazon Athena data source and build a simple visualization from the data.

  1. Log into Google Data Studio, click data sources, create a new data source, and choose CData Connect Cloud Connector.
  2. Authorize the Connector to connect to an external service (your Connect Cloud instance).
  3. Use your instance name (myinstance in myinstance.cdatacloud.net), username, and password to connect to your Connect Cloud instance.
    • Username: myinstance/username
    • Password: your Connect Cloud password
  4. Select a Database (e.g. AmazonAthena1) and click Next.
  5. Select a Table (e.g. Customers) and click Next.
  6. Click Connect.
  7. If needed, modify columns, click Create Report, and add the data source to the report.
  8. Select a visualization style and add it to the report.
  9. Select Dimensions and Measures to customize your visualization.

Optional: Connect with the MySQL Connector

If you need to work with data from a custom SQL query, you can use the MySQL Connector. Connect using the server information for your Connect Cloud instance (server address, port, username, and password).

SQL Access to Amazon Athena Data from Cloud Applications

Now you have a direct, cloud-to-cloud connection to live Amazon Athena data from your Google Data Studio workbook. You can create more data sources and new visualizations, build reports, and more — all without replicating Amazon Athena data.

Try CData Connect Cloud and get SQL data access to 200+ SaaS, Big Data, and NoSQL sources directly from your cloud applications.