Query HDFS Data in MySQL Workbench

Ready to get started?

Learn more or sign up for a free trial:

CData Connect

Create a virtual MySQL database for HDFS data in CData Connect (or Connect Cloud) and work with live HDFS data in MySQL Workbench.

MySQL Workbench allows users to administer MySQL environments and gain better visibility into databases. When paired with CData Connect (on-premise or Connect Cloud), you get live access to HDFS data as if it were a MySQL database. This article shows how to create a virtual database for HDFS in Connect and work with live HDFS data in MySQL Workbench.

Create a Virtual MySQL Database for HDFS Data

CData Connect uses a straightforward, point-and-click interface to connect to data sources and generate APIs.

  1. Login to Connect and click Databases.
  2. Select "HDFS" from Available Data Sources.
  3. Enter the necessary authentication properties to connect to HDFS.

    In order to authenticate, set the following connection properties:

    • Host: Set this value to the host of your HDFS installation.
    • Port: Set this value to the port of your HDFS installation. Default port: 50070
  4. Click Test Database.
  5. Click Privileges -> Add and add the new user (or an existing user) with the appropriate permissions.

With the virtual database created, you are ready to connect to HDFS from MySQL Workbench.

Query HDFS from MySQL Workbench

The steps below outline connecting to the virtual HDFS database in Connect from MySQL Workbench and issuing basic queries to work with live HDFS data.

Connect to HDFS through Connect

  1. In MySQL Workbench, click to add a new MySQL connection.
  2. Name the connection (CData Connect).
  3. Set the Hostname, Port, and Username parameters to connect to the SQL Gateway.
  4. Click Store in Vault to set and store the password.
  5. Click Test Connection to ensure the connection is configured properly and click OK.

Query HDFS Data

  1. Open the connection you just created (CData Connect).
  2. Click File -> New Query Tab.
  3. Write a SQL query to retrieve HDFS data, like SELECT * FROM hdfsdb.Files;

With access to live HDFS data from MySQL Workbench, you can easily query and update HDFS, just like you would a MySQL database. Request a demo of the CData Connect and start working with HDFS as a cloud-based MySQL database today.