Back Up Hugging Face data to SQL Server through SSIS

Jerod Johnson
Jerod Johnson
Director, Technology Evangelism
Effortlessly backup data to SQL Server by utilizing the CData API Driver for ADO.NET. In this article, we will employ an SSIS workflow to populate a database with Hugging Face data data.

This article illustrates using the Hugging Face ADO.NET Data Provider within a SQL Server SSIS workflow for the direct transfer of Hugging Face data to a Microsoft SQL Server database. It's worth noting that the identical process detailed below is applicable to any CData ADO.NET Data Providers, enabling the direct connection of SQL Server with remote data through SSIS.

  1. Open Visual Studio and create a new Integration Services project.
  2. Add a new Data Flow task from the toolbox onto the Control Flow screen.
  3. In the Data Flow screen, add an ADO.NET Source and an OLE DB Destination from the toolbox.

  4. Add a new connection and select .NET Providers\CData ADO.NET Provider for Hugging Face.
  5. In the connection manager, enter the connection details for Hugging Face data.

    HuggingFace Hub uses token-based authentication to enable access to its API. The API provides access to machine learning models, datasets, spaces, papers, and other resources on the HuggingFace Hub platform.

    Using API Key Authentication

    To authenticate to HuggingFace Hub, you will need to provide an API Key (Access Token). To obtain your access token:

    1. Log in to your HuggingFace account at https://huggingface.co
    2. Navigate to Settings > Access Tokens
    3. Click "New token" to create a new access token
    4. Select the appropriate permissions (read or write)
    5. Copy the token value

    After obtaining your access token, set the following connection properties:

    • AuthScheme: Set this to APIKey.
    • APIKey: Set this to your HuggingFace access token.

    Example connection string

    Profile=C:\profiles\HuggingFace.apip;ProfileSettings='APIKey=hf_xxxxxxxxxxxxxxxxxxxx';
    
  6. Open the DataReader editor and set the following information:

    • ADO.NET connection manager: In the Connection Managers menu, select the Data Connection you just created.
    • Data access mode: Select 'SQL command'.
    • SQL command text: In the DataReader Source editor, open the Component Properties tab and enter a SELECT command, such as the one below:
      SELECT ,  FROM Collections WHERE  = ''
  7. Close the DataReader editor and drag the arrow below the DataReader Source to connect it to the OLE DB Destination.
  8. Open the OLE DB Destination and enter the following information in the Destination Component Editor.

    • Connection manager: Add a new connection. Enter your server and database information here. In this example, SQLExpress is running on a separate machine.
    • Data access mode: Set your data access mode to "table or view" and select the table or view to populate in your database.
  9. Configure any properties you wish on the Mappings screen.

  10. Close the OLE DB Destination Editor and run the project. After the SSIS task has finished executing, your database will be populated with data obtained from Hugging Face data.

Ready to get started?

Connect to live data from Hugging Face with the API Driver

Connect to Hugging Face