We are proud to share our inclusion in the 2024 Gartner Magic Quadrant for Data Integration Tools. We believe this recognition reflects the differentiated business outcomes CData delivers to our customers.
Get the Report →How to Seamlessly Import Presto Data into IBM SPSS Modeler
Integrate Presto data into IBM SPSS Modeler using the CData ODBC Driver for real-time insights and advanced data analysis.
IBM SPSS Modeler is a powerful data mining and predictive analytics platform that enables organizations to extract valuable insights from their data. By connecting Presto data data to SPSS Modeler via the CData ODBC Driver for Presto, you can leverage real-time access for advanced data mining, predictive modeling, and statistical analysis.
This guide takes you through the steps of connecting IBM SPSS Modeler to Presto data, enabling seamless data import, preparation, and analysis. With the CData ODBC Driver for Presto, you can unlock the full potential of your Presto data data within IBM SPSS Modeler for actionable insights.
About Presto Data Integration
Accessing and integrating live data from Trino and Presto SQL engines has never been easier with CData. Customers rely on CData connectivity to:
- Access data from Trino v345 and above (formerly PrestoSQL) and Presto v0.242 and above (formerly PrestoDB)
- Read and write access all of the data underlying your Trino or Presto instances
- Optimized query generation for maximum throughput.
Presto and Trino allow users to access a variety of underlying data sources through a single endpoint. When paired with CData connectivity, users get pure, SQL-92 access to their instances, allowing them to integrate business data with a data warehouse or easily access live data directly from their preferred tools, like Power BI and Tableau.
In many cases, CData's live connectivity surpasses the native import functionality available in tools. One customer was unable to effectively use Power BI due to the size of the datasets needed for reporting. When the company implemented the CData Power BI Connector for Presto they were able to generate reports in real-time using the DirectQuery connection mode.
Getting Started
Overview
Here is an overview of the steps:
- CONFIGURE THE ODBC DRIVER: Set up a connection to Presto data in the CData ODBC Driver for Presto by entering the required connection properties.
- SET UP ODBC CONNECTION IN SPSS MODELER: Establish the ODBC connection within IBM SPSS Modeler by selecting the configured DSN.
- IMPORT AND PROCESS DATA: Import the Presto data data into SPSS Modeler, then review, filter, transform, and prepare the data for predictive analytics and statistical modeling.
Configure the Presto DSN Using the CData ODBC Driver
To start, configure the DSN (Data Source Name) for Presto data in your system using the CData ODBC Driver. Download and install a 30-day free trial with all the features from here.
Once installed, launch the ODBC Data Source Administrator:
- On Windows: Search for ODBC Data Source Administrator in the Start menu and open the application.
- On Mac: Open Applications, go to Utilities, and select ODBC Manager.
- On Linux: Use the command line to launch ODBC Data Source Administrator or use unixODBC if installed.
Once launched, double-click on the CData Presto data Source and enter the required values to establish a connection:
Set the Server and Port connection properties to connect, in addition to any authentication properties that may be required.
To enable TLS/SSL, set UseSSL to true.
Authenticating with LDAP
In order to authenticate with LDAP, set the following connection properties:
- AuthScheme: Set this to LDAP.
- User: The username being authenticated with in LDAP.
- Password: The password associated with the User you are authenticating against LDAP with.
Authenticating with Kerberos
In order to authenticate with KERBEROS, set the following connection properties:
- AuthScheme: Set this to KERBEROS.
- KerberosKDC: The Kerberos Key Distribution Center (KDC) service used to authenticate the user.
- KerberosRealm: The Kerberos Realm used to authenticate the user with.
- KerberosSPN: The Service Principal Name for the Kerberos Domain Controller.
- KerberosKeytabFile: The Keytab file containing your pairs of Kerberos principals and encrypted keys.
- User: The user who is authenticating to Kerberos.
- Password: The password used to authenticate to Kerberos.
Setup an ODBC Connection in IBM SPSS Modeler
After configuring the DSN, it's time to connect to it in IBM SPSS Modeler:
- Launch IBM SPSS Modeler, log in, and create a new stream.
- From the Sources palette, locate the Database node, and drag it onto the canvas.
- Double-click the Database node to open the configuration dialog. Select
, browse to select the configured DSN, then click OK. - In the Database dialog, browse to select the table(s) you’d like to import, preview the data, and click OK to finalize.
You are now ready to process and analyze the Presto data data in IBM SPSS Modeler.
Process Data: Filter, Categories, and Model
Once the tables are imported, you can refine, filter, categorize, and model your Presto data data in SPSS Modeler:
- Filtering: Double-click your Database connection, go to the Filter section, and select/deselect fields to focus on relevant data. This improves processing speed and model accuracy.
- Set Data Types and Roles: Categorize your fields and assign roles to each data type by navigating to the Types section.
- Perform a Basic Analysis: Drag and drop the Analysis node next to your Database node, connect them, and click the Play button to run the stream and analyze the data.
You have now performed a simple analysis, enabling SPSS Modeler to process and display insights from your database.
Unlock the Potential of Your Presto Data with CData
With the CData ODBC Driver for Presto, connecting Presto data data to IBM SPSS Modeler is seamless. Start your free trial today and unlock the full potential of your real-time data for advanced analytics and decision-making.