Ready to get started?

Learn more about CData Cloud Hub or sign up for a free trial:

Learn More

Create Reports from Amazon S3 Data in Google Data Studio

Use the CData Cloud Hub to create a virtual MySQL Database for Amazon S3 data and create custom reports in Google Data Studio.

Google Data Studio allows you to create branded reports with data visualizations to share with your clients. When paired with the CData Cloud Hub, you get instant, cloud-to-cloud access to Amazon S3 data for visualizations, dashboards, and more. This article shows how to create a virtual database for Amazon S3 and build reports from Amazon S3 data in Google Data Studio.

The CData Cloud Hub provides a pure MySQL, cloud-to-cloud interface for Amazon S3, allowing you to easily build reports from live Amazon S3 data in Google Data Studio — without replicating the data to a natively supported database. As you build visualizations, Google Data Studio generates SQL queries to gather data. Using optimized data processing out of the box, the CData Cloud Hub pushes all supported SQL operations (filters, JOINs, etc) directly to Amazon S3, leveraging server-side processing to quickly return Amazon S3 data.

Create a Virtual MySQL Database for Amazon S3 Data

CData Cloud Hub uses a straightforward, point-and-click interface to connect to data sources and generate APIs.

  1. Login to Cloud Hub and click Databases.
  2. Select "Amazon S3" from Available Data Sources.
  3. Enter the necessary authentication properties to connect to Amazon S3.

    To authorize Amazon S3 requests, provide the credentials for an administrator account or for an IAM user with custom permissions. Set AccessKey to the access key Id. Set SecretKey to the secret access key.

    Note: You can connect as the AWS account administrator, but it is recommended to use IAM user credentials to access AWS services.

    For information on obtaining the credentials and other authentication methods, refer to the Getting Started section of the Help documentation.

  4. Click Test Database.
  5. Click Privileges -> Add and add the new user (or an existing user) with the appropriate permissions.

With the virtual database created, you are ready to connect to Amazon S3 data from any MySQL client.

Visualize Live Amazon S3 Data in Google Data Studio

The steps below outline connecting to the CData Cloud Hub from Google Data Studio to create a new Amazon S3 data source and build a simple visualization from the data.

  1. Log into Google Data Studio, click data sources, create a new data source and choose MySQL.
  2. Choose the basic configuration and set the connection properties:
    • Host name or IP: myinstance.cdatacloud.net
    • Port: 3306
    • Database: amazons3db
    • Username: your Cloud Hub username
    • Password: your Cloud Hub password
    • Click Enable SSL, then upload the certificates

      Server Certificate

      Google requires users to upload a server certificate for Server-side SSL. There are two ways to get the server certificate for your Cloud Hub instance:
      1. Web Browser: log into your Cloud Hub instance from a web browser and follow the procedure for getting the Server SSL Certificate from your browser. Save the certificate to a file, like cert.pem
      2. MySQL Client: log into your instance from a MySQL client and submit the following query:

                SELECT @@cloudhub_ssl_cert;
                
        Copy the results and save them to a file, like cert.pem

      Client Certificates

      Create a private/public key pair to be used for client authentication. For more information, refer to our Knowledge Base article: https://www.cdata.com/kb/entries/ssl_certificate_generator.rst.

  3. Click Authenticate
  4. Select the table to visualize or enter a custom query and click Connect.
    NOTE: JOINs are not supported in the user interface, but they are supported as custom queries.
  5. If needed, modify columns, click Create Report and add the data source to the report.
  6. Select a visualization style and add it to the report.
  7. Select Dimensions and Measures to customize your visualization.

SQL Access to Amazon S3 Data from Cloud Applications

Now you have a direct, cloud-to-cloud connection to live Amazon S3 data from your Google Data Studio workbook. You can create more data sources and new visualizations, build reports, and more — all without replicating Amazon S3 data.

To get SQL data access to 100+ SaaS, Big Data, and NoSQL sources directly from your cloud applications, see the CData Cloud Hub.