Create Datasets from Hugging Face in Domo Workbench and Build Visualizations of Hugging Face Data in Domo
Domo helps you manage, analyze, and share data across your entire organization, enabling decision makers to identify and act on strategic opportunities. Domo Workbench provides a secure, client-side solution for uploading your on-premise data to Domo. The CData ODBC Driver for Hugging Face links Domo Workbench to operational Hugging Face data. You can build datasets from Hugging Face data using standard SQL queries in Workbench and then create real-time visualizations of Hugging Face data in the Domo service.
The CData ODBC Drivers offer unmatched performance for interacting with live Hugging Face data in Domo due to optimized data processing built into the driver. When you issue complex SQL queries from Domo to Hugging Face, the driver pushes supported SQL operations, like filters and aggregations, directly to Hugging Face and utilizes the embedded SQL Engine to process unsupported operations (often SQL functions and JOIN operations) client-side. With built-in dynamic metadata querying, you can visualize and analyze Hugging Face data using native Domo data types.
Connect to Hugging Face as an ODBC Data Source
If you have not already, first specify connection properties in an ODBC DSN (data source name). This is the last step of the driver installation. You can use the Microsoft ODBC Data Source Administrator to create and configure ODBC DSNs.
HuggingFace Hub uses token-based authentication to enable access to its API. The API provides access to machine learning models, datasets, spaces, papers, and other resources on the HuggingFace Hub platform.
Using API Key Authentication
To authenticate to HuggingFace Hub, you will need to provide an API Key (Access Token). To obtain your access token:
- Log in to your HuggingFace account at https://huggingface.co
- Navigate to Settings > Access Tokens
- Click "New token" to create a new access token
- Select the appropriate permissions (read or write)
- Copy the token value
After obtaining your access token, set the following connection properties:
- AuthScheme: Set this to APIKey.
- APIKey: Set this to your HuggingFace access token.
Example connection string
Profile=C:\profiles\HuggingFace.apip;ProfileSettings='APIKey=hf_xxxxxxxxxxxxxxxxxxxx';
When you configure the DSN, you may also want to set the Max Rows connection property. This will limit the number of rows returned, which is especially helpful for improving performance when designing reports and visualizations.
After creating a DSN, you will need to create a dataset for Hugging Face in Domo Workbench using the Hugging Face DSN and build a visualization in the Domo service based on the dataset.
Build a Dataset for Hugging Face Data
You can follow the steps below to build a dataset based on a table in Hugging Face in Domo Workbench using the CData ODBC Driver for Hugging Face.
- Open Domo Workbench and, if you have not already, add your Domo service server to Workbench. In the Accounts submenu, click Add New, type in the server address (i.e., domain.domo.com) and click through the wizard to authenticate.
- In the DataSet Jobs submenu, click Add New.
- Name the dataset job (i.e., ODBC Hugging Face Collections), select ODBC Connection Provider as the transport method, and click through the wizard.
- In the newly created DataSet Job, navigate to Source and click to configure the settings.
- Select System DSN for the Connection Type.
- Select the previously configured DSN (CData API Sys) for the System DSN.
- Click to validate the configuration.
- Below the settings, set the Query to a SQL query:
SELECT * FROM Collections
NOTE: By connecting to Hugging Face data using an ODBC driver, you simply need to know SQL in order to get your data, circumventing the need to know Hugging Face-specific APIs or protocols. - Click preview.
- Check over the generated schema, add any transformations, then save and run the dataset job.
With the dataset job run, the dataset will be accessible from the Domo service, allowing you to build visualizations, reports, and more based on Hugging Face data.
Create Data Visualizations
With the DataSet Job saved and run in Domo Workbench, we are ready to build visualizations of the Hugging Face data in the Domo service.
- Navigate to the Data Center.
- In the data warehouse, select the ODBC data source and drill down to our new dataset.
- With the dataset selected, choose to create a visualization.
- In the new card:
- Drag a Dimension to the X Value.
- Drag a Measure to the Y Value.
- Choose a Visualization.
With the CData ODBC Driver for Hugging Face, you can build custom datasets based on Hugging Face data using only SQL in Domo Workbench and then build and share visualizations and reports through the Domo service.