Build Parquet-Connected Dashboards in Redash

Ready to get started?

Learn more or sign up for a free trial:

CData Connect



Use CData Connect Cloud to create a virtual MySQL Database for Parquet data and build visualizations and dashbaords from Parquet data in Redash.

Redash lets you connect and query your data sources, build dashboards to visualize data and share them with your company. When paired with CData Connect Cloud, you get instant, cloud-to-cloud access to Parquet data for visualizations, dashboards, and more. This article shows how to create a virtual database for Parquet and build visualizations from Parquet data in Redash.

CData Connect Cloud provides a pure MySQL, cloud-to-cloud interface for Parquet, allowing you to easily build visualizations from Parquet data in Redash. As you build visualizations, Redash generates SQL queries to gather data. Using optimized data processing out of the box, CData Connect Cloud pushes all supported SQL operations (filters, JOINs, etc) directly to Parquet, leveraging server-side processing to quickly return Parquet data.

Create a Virtual MySQL Database for Parquet Data

CData Connect Cloud uses a straightforward, point-and-click interface to connect to data sources and generate APIs.

  1. Log into Connect Cloud and click Databases.
  2. Select "Parquet" from Available Data Sources.
  3. Enter the necessary authentication properties to connect to Parquet.

    Connect to your local Parquet file(s) by setting the URI connection property to the location of the Parquet file.

  4. Click Test Database.
  5. Click Privileges -> Add and add the new user (or an existing user) with the appropriate permissions.

With the virtual database created, you are ready to build visualizations in Tableau Online.

Visualize Parquet Data in Tableau Online

The steps below outline creating a new data source in Redash based on the virtual Parquet database in Connect Cloud and building a simple visualization from the data.

Create a New Data Source

  1. Log into Redash, click on your profile and click "Data Sources"
  2. Click the " New Data Source" button
  3. Select "MySQL (Amazon RDS)" as the Data Source Type (CData Connect uses SSL, which the standard MySQL connection in Redash does not support)
  4. On the configuration tab, set the following properties:
    • Name: Name the data source (e.g. Parquet (CData Connect))
    • Host: The full URL to your CData Connect instance (e.g. https://myinstance.cdatacloud.net)
    • Port: The port of the CData Connect MySQL endpoint (e.g. 3306)
    • User: A CData Connect user
    • Password: The password for the above user
    • Database name: The name of the virtual database for Parquet (e.g. Parquet1)
    • Click the checkbox to Use SSQL
  5. Click Create
  6. Click the "Test Connection" button to ensure you have configured the connection properly

With the new Data Source created, we are ready to visualize our Parquet data.

Create a Parquet Data Visualization

  1. Click Create -> New Query
  2. Select the newly created Data Source (you can explore the data structure in the New Query wizard)
  3. Write a SQL statement to retrieve the data, for example:
    SELECT Id, Column1 FROM SampleTable_1 WHERE Column2 = 'SAMPLE_VALUE'
  4. Click the "Execute" button to load Parquet data into Redash via CData Connect
  5. Use the Visualization Editor to create and analyze graphs from Parquet data
  6. You can schedule the query to refresh and update the visualization periodically

SQL Access to Parquet Data from Cloud Applications

At this point, you have a direct, cloud-to-cloud connection to Parquet data from Redash. You can create new visualizations, build dashboards, and more. For more information on gaining SQL access to data from more than 100 SaaS, Big Data, and NoSQL sources from cloud applications like Redash, refer to our Connect Cloud page.