Import Google Data Catalog Data into SQL Server using SSIS

Ready to get started?

Download for a free trial:

Download Now

Learn more:

Google Data Catalog SSIS Components

Powerful SSIS Source & Destination Components that allows you to easily connect SQL Server with Google Data Catalog through SSIS Workflows.

Use the Google Data Catalog Data Flow Components to synchronize with Google Data Catalog Schemas, Tables, and more. Perfect for data synchronization, local back-ups, workflow automation, and more!



Easily back up Google Data Catalog data to SQL Server using the SSIS components for Google Data Catalog.

Using SQL Server as a backup for critical business data provides an essential safety net against loss. Backing up data to SQL Server enables business users to more easily connect that data with features like reporting, analytics, and more.

This example demonstrates how to use the CData SSIS Tasks for Google Data Catalog inside of a SQL Server SSIS workflow to transfer Google Data Catalog data into a Microsoft SQL Server database.

Add the Components

To get started, add a new Google Data Catalog source and SQL Server ADO.NET destination to a new data flow task.

Create a New Connection Manager

Follow the steps below to save Google Data Catalog connection properties in a connection manager.

  1. In the Connection Manager window, right-click and then click New Connection. The Add SSIS Connection Manager dialog is displayed.
  2. In the Connection Manager type menu, select GoogleDataCatalog. The CData Google Data Catalog Connection Manager is displayed.
  3. Configure connection properties.

    Google Data Catalog uses the OAuth authentication standard. Authorize access to Google APIs on behalf on individual users or on behalf of users in a domain.

    Before connecting, specify the following to identify the organization and project you would like to connect to:

    • OrganizationId: The ID associated with the Google Cloud Platform organization resource you would like to connect to. Find this by navigating to the cloud console.

      Click the project selection drop-down, and select your organization from the list. Then, click More -> Settings. The organization ID is displayed on this page.

    • ProjectId: The ID associated with the Google Cloud Platform project resource you would like to connect to.

      Find this by navigating to the cloud console dashboard and selecting your project from the Select from drop-down. The project ID will be present in the Project info card.

    When you connect, the OAuth endpoint opens in your default browser. Log in and grant permissions to the application to completes the OAuth process. For more information, refer to the OAuth section in the Help documentation.

Configure the Google Data Catalog Source

Follow the steps below to specify the query to be used to extract Google Data Catalog data.

  1. Double-click the Google Data Catalog source to open the source component editor.
  2. In the Connection Manager menu, select the connection manager previously created.
  3. Specify the query to use for the data extraction. For example: SELECT Type, DatasetName FROM Schemas WHERE ProjectId = 'bigquery-public-data'
  4. Close the Google Data Catalog Source control and connect it to the ADO.NET Destination.

Configure the SQL Server Destination

Follow the steps below to specify the SQL server table to load the Google Data Catalog data into.

  1. Open the ADO.NET Destination and add a New Connection. Enter your server and database information here.
  2. In the Data access mode menu, select "table or view".
  3. In the Table Or View menu, select the table or view to populate.
  4. Configure any properties you wish to on the Mappings screen.

Run the Project

You can now run the project. After the SSIS Task has finished executing, your database will be populated with Google Data Catalog data.