Ready to get started?

Learn more about the CData JDBC Driver for Presto or download a free trial:

Download Now

Connect to Presto Data in RapidMiner

Integrate Presto data with standard components and data source configuration wizards in RapidMiner Studio.

This article shows how you can easily integrate the CData JDBC driver for Presto into your processes in RapidMiner. This article uses the CData JDBC Driver for Presto to transfer Presto data to a process in RapidMiner.

Connect to Presto in RapidMiner as a JDBC Data Source

You can follow the procedure below to establish a JDBC connection to Presto:

  1. Add a new database driver for Presto: Click Connections -> Manage Database Drivers.
  2. In the resulting wizard, click the Add button and enter a name for the connection.
  3. Enter the prefix for the JDBC URL: jdbc:presto:
  4. Enter the path to the cdata.jdbc.presto.jar file, located in the lib subfolder of the installation directory.
  5. Enter the driver class: cdata.jdbc.presto.PrestoDriver
  6. Create a new Presto connection: Click Connections -> Manage Database Connections.
  7. Enter a name for your connection.
  8. For Database System, select the Presto driver you configured previously.
  9. Enter your connection string in the Host box.

    Set the Server and Port connection properties to connect, in addition to any authentication properties that may be required.

    To enable TLS/SSL, set UseSSL to true.

    Authenticating with LDAP

    In order to authenticate with LDAP, set the following connection properties:

    • AuthScheme: Set this to LDAP.
    • User: The username being authenticated with in LDAP.
    • Password: The password associated with the User you are authenticating against LDAP with.

    Authenticating with Kerberos

    In order to authenticate with KERBEROS, set the following connection properties:

    • AuthScheme: Set this to KERBEROS.
    • KerberosKDC: The Kerberos Key Distribution Center (KDC) service used to authenticate the user.
    • KerberosRealm: The Kerberos Realm used to authenticate the user with.
    • KerberosSPN: The Service Principal Name for the Kerberos Domain Controller.
    • KerberosKeytabFile: The Keytab file containing your pairs of Kerberos principals and encrypted keys.
    • User: The user who is authenticating to Kerberos.
    • Password: The password used to authenticate to Kerberos.

    Built-in Connection String Designer

    For assistance in constructing the JDBC URL, use the connection string designer built into the Presto JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.

    java -jar cdata.jdbc.presto.jar

    Fill in the connection properties and copy the connection string to the clipboard.

    A typical connection string is below:

    Server=127.0.0.1;Port=8080;
  10. Enter your username and password if necessary.

You can now use your Presto connection with the various RapidMiner operators in your process. To retrieve Presto data, drag the Retrieve operator from the Operators view. With the Retrieve operator selected, you can then define which table to retrieve in the Parameters view by clicking the folder icon next to the "repository entry." In the resulting Repository Browser, you can expand your connection node to select the desired example set.

Finally, wire the output to the Retrieve process to a result, and run the process to see the Presto data.