How to work with Oracle Eloqua Reporting Data in Apache Spark using SQL
Apache Spark is a fast and general engine for large-scale data processing. When paired with the CData JDBC Driver for Oracle Eloqua Reporting, Spark can work with live Oracle Eloqua Reporting data. This article describes how to connect to and query Oracle Eloqua Reporting data from a Spark shell.
The CData JDBC Driver offers unmatched performance for interacting with live Oracle Eloqua Reporting data due to optimized data processing built into the driver. When you issue complex SQL queries to Oracle Eloqua Reporting, the driver pushes supported SQL operations, like filters and aggregations, directly to Oracle Eloqua Reporting and utilizes the embedded SQL engine to process unsupported operations (often SQL functions and JOIN operations) client-side. With built-in dynamic metadata querying, you can work with and analyze Oracle Eloqua Reporting data using native data types.
Install the CData JDBC Driver for Oracle Eloqua Reporting
Download the CData JDBC Driver for Oracle Eloqua Reporting installer, unzip the package, and run the JAR file to install the driver.
Start a Spark Shell and Connect to Oracle Eloqua Reporting Data
- Open a terminal and start the Spark shell with the CData JDBC Driver for Oracle Eloqua Reporting JAR file as the jars parameter:
$ spark-shell --jars /CData/CData JDBC Driver for Oracle Eloqua Reporting/lib/cdata.jdbc.oracleeloquareporting.jar
- With the shell running, you can connect to Oracle Eloqua Reporting with a JDBC URL and use the SQL Context load() function to read a table.
Oracle Eloqua Reporting supports the following authentication methods:
- Basic authentication (User and Password)
- OAuth 2.0 code grant flow
- OAuth 2.0 password grant flow
Basic Authentication (User and Password)
To perform authentication with a user and password, specify these properties:
- AuthScheme: Basic.
- Company: The company name associated with your Oracle Eloqua Reporting account.
- User: Your login account name.
- Password: Your login password.
OAuth Authentication (Code Grant Flow)
To authenticate with the OAuth code grant flow, you must set AuthScheme to OAuth and create a custom OAuth application. For information about how to create a custom OAuth application, see the Help documentation.
Then set the following properties:
- InitiateOAuth: GETANDREFRESH. Used to automatically get and refresh the OAuthAccessToken.
- OAuthClientId: The client Id assigned when you registered your application.
- OAuthClientSecret: The client secret that was assigned when you registered your application.
- CallbackURL: The redirect URI that was defined when you registered your application.
When you connect, the driver opens Oracle Eloqua Reporting's OAuth endpoint in your default browser. Log in and grant permissions to the application. When the access token expires, the driver refreshes it automatically.
OAuth Authentication (Password Grant Flow)
With the OAuth password grant flow, you can use your OAuth application's credentials alongside your user credentials to authenticate without the need to grant permission manually via a browser prompt. You must create an OAuth app (see the Help documentation) to use this authentication method.
Set the following properties:
- AuthScheme: OAuthPassword
- Company: The company's unique identifier.
- User: Your login account name.
- Password: Your login password.
- OAuthClientId: The client Id assigned when you registered your custom OAuth application.
- OAuthClientSecret: The client secret assigned when you registered your custom OAuth application.
Built-in Connection String Designer
For assistance in constructing the JDBC URL, use the connection string designer built into the Oracle Eloqua Reporting JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.
java -jar cdata.jdbc.oracleeloquareporting.jar
Fill in the connection properties and copy the connection string to the clipboard.
Configure the connection to Oracle Eloqua Reporting, using the connection string generated above.
scala> val oracleeloquareporting_df = spark.sqlContext.read.format("jdbc").option("url", "jdbc:oracleeloquareporting:AuthScheme=Basic;User=user;Password=password;Company=MyCompany;").option("dbtable","").option("driver","cdata.jdbc.oracleeloquareporting.OracleEloquaReportingDriver").load() - Once you connect and the data is loaded you will see the table schema displayed.
Register the Oracle Eloqua Reporting data as a temporary table:
scala> oracleeloquareporting_df.registerTable("")-
Perform custom SQL queries against the Data using commands like the one below:
scala> oracleeloquareporting_df.sqlContext.sql("SELECT , FROM WHERE = ").collect.foreach(println)You will see the results displayed in the console, similar to the following:
Using the CData JDBC Driver for Oracle Eloqua Reporting in Apache Spark, you are able to perform fast and complex analytics on Oracle Eloqua Reporting data, combining the power and utility of Spark with your data. Download a free, 30 day trial of any of the 200+ CData JDBC Drivers and get started today.