How to Import Parquet Data into SQL Server using SSIS



Easily back up Parquet data to SQL Server using the SSIS components for Parquet.

Using SQL Server as a backup for critical business data provides an essential safety net against loss. Backing up data to SQL Server enables business users to more easily connect that data with features like reporting, analytics, and more.

This example demonstrates how to use the CData SSIS Tasks for Parquet inside of a SQL Server SSIS workflow to transfer Parquet data into a Microsoft SQL Server database.

Add the Components

To get started, add a new Parquet source and SQL Server ADO.NET destination to a new data flow task.

Create a New Connection Manager

Follow the steps below to save Parquet connection properties in a connection manager.

  1. In the Connection Manager window, right-click and then click New Connection. The Add SSIS Connection Manager dialog is displayed.
  2. In the Connection Manager type menu, select Parquet. The CData Parquet Connection Manager is displayed.
  3. Configure connection properties.

    Connect to your local Parquet file(s) by setting the URI connection property to the location of the Parquet file.

Configure the Parquet Source

Follow the steps below to specify the query to be used to extract Parquet data.

  1. Double-click the Parquet source to open the source component editor.
  2. In the Connection Manager menu, select the connection manager previously created.
  3. Specify the query to use for the data extraction. For example: SELECT Id, Column1 FROM SampleTable_1 WHERE Column2 = 'SAMPLE_VALUE'
  4. Close the Parquet Source control and connect it to the ADO.NET Destination.

Configure the SQL Server Destination

Follow the steps below to specify the SQL server table to load the Parquet data into.

  1. Open the ADO.NET Destination and add a New Connection. Enter your server and database information here.
  2. In the Data access mode menu, select "table or view".
  3. In the Table Or View menu, select the table or view to populate.
  4. Configure any properties you wish to on the Mappings screen.

Run the Project

You can now run the project. After the SSIS Task has finished executing, your database will be populated with Parquet data.

Ready to get started?

Download a free trial of the Parquet SSIS Component to get started:

 Download Now

Learn more:

Parquet Icon Parquet SSIS Components

Powerful SSIS Source & Destination Components that allows you to easily connect SQL Server with Parquet through SSIS Workflows.

Use the Parquet Data Flow Components to synchronize with Parquet ParquetData, and more. Perfect for data synchronization, local back-ups, workflow automation, and more!