Discover how a bimodal integration strategy can address the major data management challenges facing your organization today.
Get the Report →Analyze SAP Data in R
Use standard R functions and the development environment of your choice to analyze SAP data with the CData JDBC Driver for SAP.
Access SAP data with pure R script and standard SQL on any machine where R and Java can be installed. You can use the CData JDBC Driver for SAP and the RJDBC package to work with remote SAP data in R. By using the CData Driver, you are leveraging a driver written for industry-proven standards to access your data in the popular, open-source R language. This article shows how to use the driver to execute SQL queries to SAP and visualize SAP data by calling standard R functions.
About SAP Data Integration
CData provides the easiest way to access and integrate live data from SAP. Customers use CData connectivity to:
- Access every edition of SAP, including SAP R/3, SAP NetWeaver, SAP ERP / ECC 6.0, and SAP S/4 HANA on premises data that is exposed by the RFC.
- Perform actions like sending IDoc or IDoc XML files to the server and creating schemas for functions or queries through SQL stored procedures.
-
Connect optimally depending on where a customer's SAP instance is hosted.
- Customers using SAP S/4HANA cloud public edition will use SAP NetWeaver Gateway connectivity
- Customers using SAP S/4HANA private edition will use either SAP ERP or SAP NetWeaver Gateway connectivity.
While most users leverage our tools to replicate SAP data to databases or data warehouses, many also integrate live SAP data with analytics tools such as Tableau, Power BI, and Excel.
Getting Started
Install R
You can match the driver's performance gains from multi-threading and managed code by running the multithreaded Microsoft R Open or by running open R linked with the BLAS/LAPACK libraries. This article uses Microsoft R Open 3.2.3, which is preconfigured to install packages from the Jan. 1, 2016 snapshot of the CRAN repository. This snapshot ensures reproducibility.
Load the RJDBC Package
To use the driver, download the RJDBC package. After installing the RJDBC package, the following line loads the package:
library(RJDBC)
Connect to SAP as a JDBC Data Source
You will need the following information to connect to SAP as a JDBC data source:
- Driver Class: Set this to cdata.jdbc.saperp.SAPERPDriver
- Classpath: Set this to the location of the driver JAR. By default this is the lib subfolder of the installation folder.
The DBI functions, such as dbConnect and dbSendQuery, provide a unified interface for writing data access code in R. Use the following line to initialize a DBI driver that can make JDBC requests to the CData JDBC Driver for SAP:
driver <- JDBC(driverClass = "cdata.jdbc.saperp.SAPERPDriver", classPath = "MyInstallationDir\lib\cdata.jdbc.saperp.jar", identifier.quote = "'")
You can now use DBI functions to connect to SAP and execute SQL queries. Initialize the JDBC connection with the dbConnect function.
The driver supports connecting to an SAP system using the SAP Java Connector (SAP JCo). Install the files (sapjco3.jar and sapjco3.dll) to the appropriate directory for the hosting application or platform. See the "Getting Started" chapter in the help documentation for information on using the SAP JCo files.
In addition, you can connect to an SAP system using Web services (SOAP). To use Web services, you must enable SOAP access to your SAP system and set the Client, RFCUrl, User, and Password properties, under the Authentication section.
For more information, see this guide on obtaining the connection properties needed to connect to any SAP system.
Built-in Connection String Designer
For assistance in constructing the JDBC URL, use the connection string designer built into the SAP JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.
java -jar cdata.jdbc.saperp.jar
Fill in the connection properties and copy the connection string to the clipboard.
Below is a sample dbConnect call, including a typical JDBC connection string:
conn <- dbConnect(driver,"jdbc:saperp:Host=sap.mydomain.com;User=EXT90033;Password=xxx;Client=800;System Number=09;ConnectionType=Classic;Location=C:/mysapschemafolder;")
Schema Discovery
The driver models SAP APIs as relational tables, views, and stored procedures. Use the following line to retrieve the list of tables:
dbListTables(conn)
Execute SQL Queries
You can use the dbGetQuery function to execute any SQL query supported by the SAP API:
mara <- dbGetQuery(conn,"SELECT MANDT, MBRSH FROM MARA")
You can view the results in a data viewer window with the following command:
View(mara)
Plot SAP Data
You can now analyze SAP data with any of the data visualization packages available in the CRAN repository. You can create simple bar plots with the built-in bar plot function:
par(las=2,ps=10,mar=c(5,15,4,2))
barplot(mara$MBRSH, main="SAP MARA", names.arg = mara$MANDT, horiz=TRUE)