Replicate DataRobot Data from PowerShell

Ready to get started?

Download for a free trial:

Download Now

Learn more:

DataRobot ODBC Driver

The DataRobot ODBC Driver is a powerful tool that allows you to connect with live DataRobot data, directly from any applications that support ODBC connectivity.

Access DataRobot like you would a database - access Projects, Models, Predictions, etc. through a standard ODBC Driver interface.



Write a quick PowerShell script to query DataRobot data. Use connectivity to the live data to replicate DataRobot data to SQL Server.



The CData ODBC Driver for DataRobot enables out-of-the-box integration with Microsoft's built-in support for ODBC. The ODBC driver instantly integrates connectivity to the real DataRobot data with PowerShell.

You can use the .NET Framework Provider for ODBC built into PowerShell to quickly automate integration tasks like replicating DataRobot data to other databases. This article shows how to replicate DataRobot data to SQL Server in 5 lines of code.

You can also write PowerShell code to download DataRobot data. See the examples below.

Create an ODBC Data Source for DataRobot

If you have not already, first specify connection properties in an ODBC DSN (data source name). This is the last step of the driver installation. You can use the Microsoft ODBC Data Source Administrator to create and configure ODBC DSNs.

To connect to DataRobot, the following connection properties are required: User, Password, and PredictionInstance. DataRobotKey may also be required depending on your type of DataRobot predictions instance. If using the Predictions API, DataFile is required. The CSV DataFile should include a header row as the first row of the datafile. APIKey is not required, but can be supplied. If not supplied, the driver will handle obtaining an APIKey.

User, DataRobotKey, and APIKey are the credentials for the DataRobot account.

ProjectID, DataFile, and ModelId are the parameters for the project, dataset, and model type.

Connect to DataRobot

The code below shows how to use the DSN to initialize the connection to DataRobot data in PowerShell:

$conn = New-Object System.Data.Odbc.OdbcConnection $conn.ConnectionString = "DSN=CData DataRobot Source x64"

Back Up DataRobot Data to SQL Server

After you enable caching, you can use the code below to replicate data to SQL Server.

Set the following connection properties to configure the caching database:

  • CacheProvider: The name of the ADO.NET provider. This can be found in the Machine.config for your version of .NET. For example, to configure SQL Server, enter System.Data.SqlClient.

  • CacheConnection: The connection string of properties required to connect to the database. Below is an example for SQL Server:

    Server=localhost;Database=RSB;User Id=sqltest;Password=sqltest;

The SQL query in the example can be used to refresh the entire cached table, including its schema. Any already existing cache is deleted.

$conn.Open() # Create and execute the SQL Query $SQL = "CACHE DROP EXISTING SELECT * FROM " + $Predictions $cmd = New-Object System.Data.Odbc.OdbcCommand($sql,$conn) $count = $cmd.ExecuteNonQuery() $conn.Close()

The driver gives you complete control over the caching functionality. See the help documentation for more caching commands and usage examples. See the help documentation for steps to replicate to other databases.

Other Operations

To retrieve DataRobot data in PowerShell, call the Fill method of the OdbcDataAdapter method. To execute data manipulation commands, initialize the OdbcCommand object and then call ExecuteNonQuery. Below are some more examples commands to DataRobot through the .NET Framework Provider for ODBC:

Retrieve DataRobot Data

$sql="SELECT Id, Prediction1Value from Predictions" $da= New-Object System.Data.Odbc.OdbcDataAdapter($sql, $conn) $dt= New-Object System.Data.DataTable $da.Fill($dt) $dt.Rows | foreach { $dt.Columns | foreach ($col in dt{ Write-Host $1[$_] } }