We are proud to share our inclusion in the 2024 Gartner Magic Quadrant for Data Integration Tools. We believe this recognition reflects the differentiated business outcomes CData delivers to our customers.
Get the Report →How to Import Spark Data into SQL Server using SSIS
Easily back up Spark data to SQL Server using the SSIS components for Spark.
Using SQL Server as a backup for critical business data provides an essential safety net against loss. Backing up data to SQL Server enables business users to more easily connect that data with features like reporting, analytics, and more.
This example demonstrates how to use the CData SSIS Tasks for Spark inside of a SQL Server SSIS workflow to transfer Spark data into a Microsoft SQL Server database.
Add the Components
To get started, add a new Spark source and SQL Server ADO.NET destination to a new data flow task.
Create a New Connection Manager
Follow the steps below to save Spark connection properties in a connection manager.
- In the Connection Manager window, right-click and then click New Connection. The Add SSIS Connection Manager dialog is displayed.
- In the Connection Manager type menu, select SparkSQL. The CData Spark Connection Manager is displayed.
- Configure connection properties.
Set the Server, Database, User, and Password connection properties to connect to SparkSQL.
Configure the Spark Source
Follow the steps below to specify the query to be used to extract Spark data.
- Double-click the Spark source to open the source component editor.
- In the Connection Manager menu, select the connection manager previously created.
- Specify the query to use for the data extraction. For example:
SELECT City, Balance FROM Customers
- Close the Spark Source control and connect it to the ADO.NET Destination.
Configure the SQL Server Destination
Follow the steps below to specify the SQL server table to load the Spark data into.
- Open the ADO.NET Destination and add a New Connection. Enter your server and database information here.
- In the Data access mode menu, select "table or view".
- In the Table Or View menu, select the table or view to populate.
- Configure any properties you wish to on the Mappings screen.
Run the Project
You can now run the project. After the SSIS Task has finished executing, your database will be populated with Spark data.