Access Live Google Data Catalog Data in TIBCO Data Virtualization

Ready to get started?

Learn more:

TIBCO DV Adapters


Use the CData TIBCO DV Adapter for Google Data Catalog to create a Google Data Catalog data source in TIBCO Data Virtualization Studio and gain access to live Google Data Catalog data from your TDV Server.

TIBCO Data Virtualization (TDV) is an enterprise data virtualization solution that orchestrates access to multiple and varied data sources. When paired with the CData TIBCO DV Adapter for Google Data Catalog, you get federated access to live Google Data Catalog data directly within TIBCO Data Virtualization. This article walks through deploying an adapter and creating a new data source based on Google Data Catalog.

With built-in optimized data processing, the CData TIBCO DV Adapter offers unmatched performance for interacting with live Google Data Catalog data. When you issue complex SQL queries to Google Data Catalog, the adapter pushes supported SQL operations, like filters and aggregations, directly to Google Data Catalog. Its built-in dynamic metadata querying allows you to work with and analyze Google Data Catalog data using native data types.

Deploy the Google Data Catalog TIBCO DV Adapter

  1. In a console, navigate to the bin folder in the TDV Server installation directory. If there is a current version of the adapter installed, you will need to undeploy it.

    .\server_util.bat -server localhost -user admin -password ******** -undeploy -version 1 -name GoogleDataCatalog
    
  2. Extract the CData TIBCO DV Adapter to a local folder and deploy the JAR file (tdv.googledatacatalog.jar) to the server from the extract location.

    .\server_util.bat -server localhost -user admin -password ******** -deploy -package /PATH/TO/tdv.googledatacatalog.jar
    

You may need to restart the server to ensure the new JAR file is loaded properly, which can be accomplished by running the composite.bat script located at: C:\Program Files\TIBCO\TDV Server <version>\bin. Note that reauthenticating to the TDV Studio is required after restarting the server.

Sample Restart Call

.\composite.bat monitor restart

Authenticate with Google Data Catalog Using OAuth

Since Google Data Catalog authenticates using the OAuth protocol and TDV Studio does not support browser-based authentication internally, you will need to create and run a simple Java application to retrieve the OAuth tokens. Once retrieved, the tokens are used to connect to Google Data Catalog directly from the adapter.

The following code sample shows how to authenticate with Google Data Catalog. You will simply need to execute the Java application with the tdv.googledatacatalog.jar file in the class path.

GoogleDataCatalogOAuth oauth = new GoogleDataCatalogOAuth();  
oauth.generateOAuthSettingsFile("InitiateOAuth=GETANDREFRESH;" + 
                                  "ProjectId=YourProjectId;" + 
                                  "OAuthSettingsLocation=C:\googledatacatalog\OAuthSettings.txt;");

Once you deploy the adapter and authenticate, you can create a new data source for Google Data Catalog in TDV Studio.

Create a Google Data Catalog Data Source in TDV Studio

With the CData TIBCO DV Adapter for Google Data Catalog, you can easily create a data source for Google Data Catalog and introspect the data source to add resources to TDV.

Create the Data Source

  1. Right-click on the folder you wish to add the data source to and select New -> New Data Source.
  2. Scroll until you find the adapter (e.g. Google Data Catalog) and click Next.
  3. Name the data source (e.g. CData Google Data Catalog Source).
  4. Fill in the required connection properties.

    Google Data Catalog uses the OAuth authentication standard. Authorize access to Google APIs on behalf on individual users or on behalf of users in a domain.

    Before connecting, specify the following to identify the organization and project you would like to connect to:

    • OrganizationId: The ID associated with the Google Cloud Platform organization resource you would like to connect to. Find this by navigating to the cloud console.

      Click the project selection drop-down, and select your organization from the list. Then, click More -> Settings. The organization ID is displayed on this page.

    • ProjectId: The ID associated with the Google Cloud Platform project resource you would like to connect to.

      Find this by navigating to the cloud console dashboard and selecting your project from the Select from drop-down. The project ID will be present in the Project info card.

    When you connect, the OAuth endpoint opens in your default browser. Log in and grant permissions to the application to completes the OAuth process. For more information, refer to the OAuth section in the Help documentation.

    NOTE: Set the OAuthSettingsLocation property in the DV Adapter to the same value you used when performing the OAuth authentication (see above).

  5. Click Create & Close.

Introspect the Data Source

Once the data source is created, you can introspect the data source by right-clicking and selecting Open. In the dashboard, click Add/Remove Resources and select the Tables, Views, and Stored Procedures to include as part of the data source. Click Next and Finish to add the selected Google Data Catalog tables, views, and stored procedures as resources.

After creating and introspecting the data source, you are ready to work with Google Data Catalog data in TIBCO Data Virtualization just like you would any other relational data source. You can create views, query using SQL, publish the data source, and more.