Build Data Flows from Scrapfly to SQL Server using SSIS

Jerod Johnson
Jerod Johnson
Director, Technology Evangelism
Easily back up Scrapfly data to SQL Server using the SSIS components for Scrapfly.

Using SQL Server as a backup for critical business data provides an essential safety net against loss. Backing up data to SQL Server enables business users to more easily connect that data with features like reporting, analytics, and more.

This example demonstrates how to use the CData SSIS Tasks for Scrapfly inside of a SQL Server SSIS workflow to transfer Scrapfly data into a Microsoft SQL Server database.

Add the Components

To get started, add a new Scrapfly source and SQL Server ADO.NET destination to a new data flow task.

Create a New Connection Manager

Follow the steps below to save Scrapfly connection properties in a connection manager.

  1. In the Connection Manager window, right-click and then click New Connection. The Add SSIS Connection Manager dialog is displayed.
  2. In the Connection Manager type menu, select API. The CData Scrapfly Connection Manager is displayed.
  3. Configure connection properties.

    The Scrapfly API uses API Key authentication. The API key is passed as the key query parameter on every request.

    Using API Key Authentication

    Your Scrapfly API key is required to create a connection. To obtain your API key:

    1. Log into your Scrapfly account at scrapfly.io.
    2. Navigate to Dashboard and select API Keys.
    3. Copy your API key (begins with scp-live- for production or scp-test- for the test environment).

    After obtaining your API key, set the following connection properties:

    • AuthScheme: Set this to APIKey.
    • APIKey: Set this to your Scrapfly API key.

    Example connection string:

    Profile=C:\profiles\Scrapfly.apip;AuthScheme=APIKey;ProfileSettings='APIKey=your_api_key';
    

Configure the Scrapfly Source

Follow the steps below to specify the query to be used to extract Scrapfly data.

  1. Double-click the Scrapfly source to open the source component editor.
  2. In the Connection Manager menu, select the connection manager previously created.
  3. Specify the query to use for the data extraction. For example:
    SELECT ,  FROM Account WHERE  = ''
    
  4. Close the Scrapfly Source control and connect it to the ADO.NET Destination.

Configure the SQL Server Destination

Follow the steps below to specify the SQL server table to load the Scrapfly data into.

  1. Open the ADO.NET Destination and add a New Connection. Enter your server and database information here.
  2. In the Data access mode menu, select "table or view".
  3. In the Table Or View menu, select the table or view to populate.
  4. Configure any properties you wish to on the Mappings screen.

Run the Project

You can now run the project. After the SSIS Task has finished executing, your database will be populated with Scrapfly data.

Ready to get started?

Connect to live data from Scrapfly with the API Driver

Connect to Scrapfly