Connect to Webflow Data in RapidMiner
This article shows how you can easily integrate the CData JDBC driver for Webflow into your processes in RapidMiner. This article uses the CData JDBC Driver for Webflow to transfer Webflow data to a process in RapidMiner.
Connect to Webflow in RapidMiner as a JDBC Data Source
You can follow the procedure below to establish a JDBC connection to Webflow:
- Add a new database driver for Webflow: Click Connections -> Manage Database Drivers.
- In the resulting wizard, click the Add button and enter a name for the connection.
- Enter the prefix for the JDBC URL:
jdbc:api:
- Enter the path to the cdata.jdbc.api.jar file, located in the lib subfolder of the installation directory.
- Enter the driver class:
cdata.jdbc.api.APIDriver
- Create a new Webflow connection: Click Connections -> Manage Database Connections.
- Enter a name for your connection.
- For Database System, select the Webflow driver you configured previously.
- Enter your connection string in the Host box.
Authentication
Webflow uses OAuth 2.0 authentication to ensure secure access to sites, CMS collections, e-commerce data, and other resources. This authentication method allows you to securely connect to your Webflow workspace and manage resources with proper authorization.
OAuth 2.0 Setup and Configuration
Step 1: Create a Webflow OAuth Application
To set up OAuth authentication:
- Visit the Webflow Developer Portal
- Navigate to "Apps & Integrations" in your Webflow account
- Click "Register an App" to create a new OAuth application
- Configure the application name, description, and redirect URI (CallbackURL)
- Copy the Client ID and Client Secret for use in your connection
Required Connection Properties
- AuthScheme: Set this to OAuth (required)
- OAuthClientId: Client ID from your Webflow OAuth application (required)
- OAuthClientSecret: Client secret from your Webflow OAuth application (required)
- CallbackURL: Redirect URI specified in your OAuth application (required)
- InitiateOAuth: Set to GETANDREFRESH for automatic token management (recommended)
Required OAuth Scopes
The Webflow API Profile requires the following OAuth scopes:
- sites:read - Read access to site information and configuration
- pages:read - Read access to site pages
- cms:read - Read access to CMS collections and items
- forms:read - Read access to forms and form submissions
- assets:read - Read access to media assets and folders
- ecommerce:read - Read access to products, orders, and inventory
- authorized_user:read - Read access to the authorized user
Built-in Connection String Designer
For assistance in constructing the JDBC URL, use the connection string designer built into the Webflow JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.
java -jar cdata.jdbc.api.jar
Fill in the connection properties and copy the connection string to the clipboard.
A typical connection string is below:
Profile=C:\profiles\Webflow.apip;AuthScheme=OAuth;InitiateOAuth=GETANDREFRESH;OAuthClientId=your_client_id;OAuthClientSecret=your_client_secret;CallbackUrl=your_callback_url;
- Enter your username and password if necessary.
You can now use your Webflow connection with the various RapidMiner operators in your process. To retrieve Webflow data, drag the Retrieve operator from the Operators view.
With the Retrieve operator selected, you can then define which table to retrieve in the Parameters view by clicking the folder icon next to the "repository entry." In the resulting Repository Browser, you can expand your connection node to select the desired example set.
Finally, wire the output to the Retrieve process to a result, and run the process to see the Webflow data.