Replicate Lakebase Data to Multiple Databases via the CData Sync CLI
Always-on applications rely on automatic failover capabilities and real-time access to data. CData Sync for Lakebase integrates live Lakebase data into your mirrored databases, always-on cloud databases, and other databases such as your reporting server: Automatically synchronize with remote Lakebase data from Windows or any machine running Java.
You can use Sync's command-line interface (CLI) to easily control almost all aspects of the replication. You can use the CLI to replicate Lakebase data to one or many databases without any need to change your configuration.
Connect to Lakebase Data
You can save connection strings and other settings like email notifications in XML configuration files.
The following example shows how to replicate to SQLite.
Windows
<?xml version="1.0" encoding="UTF-8" ?> <CDataSync><DatabaseType>SQLite</DatabaseType> <DatabaseProvider>System.Data.SQLite</DatabaseProvider> <ConnectionString>DatabricksInstance=lakebase;Server=127.0.0.1;Port=5432;Database=my_database;InitiateOAuth=GETANDREFRESH;</ConnectionString> <ReplicateAll>False</ReplicateAll> <NotificationUserName></NotificationUserName> <DatabaseConnectionString>Data Source=C:\my.db</DatabaseConnectionString> <TaskSchedulerStartTime>09:51</TaskSchedulerStartTime> <TaskSchedulerInterval>Never</TaskSchedulerInterval> </CDataSync>
Java
<?xml version="1.0" encoding="UTF-8" ?> <CDataSync><DatabaseType>SQLite</DatabaseType><DatabaseProvider>org.sqlite.JDBC</DatabaseProvider> <ConnectionString>DatabricksInstance=lakebase;Server=127.0.0.1;Port=5432;Database=my_database;InitiateOAuth=GETANDREFRESH;</ConnectionString> <ReplicateAll>False</ReplicateAll> <NotificationUserName></NotificationUserName> <DatabaseConnectionString>Data Source=C:\my.db</DatabaseConnectionString> </CDataSync>To connect to Databricks Lakebase, start by setting the following properties:
- DatabricksInstance: The Databricks instance or server hostname, provided in the format instance-abcdef12-3456-7890-abcd-abcdef123456.database.cloud.databricks.com.
- Server: The host name or IP address of the server hosting the Lakebase database.
- Port (optional): The port of the server hosting the Lakebase database, set to 5432 by default.
- Database (optional): The database to connect to after authenticating to the Lakebase Server, set to the authenticating user's default database by default.
OAuth Client Authentication
To authenicate using OAuth client credentials, you need to configure an OAuth client in your service principal. In short, you need to do the following:
- Create and configure a new service principal
- Assign permissions to the service principal
- Create an OAuth secret for the service principal
For more information, refer to the Setting Up OAuthClient Authentication section in the Help documentation.
OAuth PKCE Authentication
To authenticate using the OAuth code type with PKCE (Proof Key for Code Exchange), set the following properties:
- AuthScheme: OAuthPKCE.
- User: The authenticating user's user ID.
For more information, refer to the Help documentation.
Configure Replication Queries
Sync enables you to control replication with standard SQL. The REPLICATE statement is a high-level command that caches and maintains a table in your database. You can define any SELECT query supported by the Lakebase API. The statement below caches and incrementally updates a table of Lakebase data:
REPLICATE Orders;
You can specify a file containing the replication queries. This enables you to use the same replication queries to replicate to several databases.
Run Sync
After you have configured the connection strings and replication queries, you can run Sync with the following command-line options:
Windows
LakebaseSync.exe -g MySQLiteConfig.xml -f LakebaseSync.sql
Java
java -Xbootclasspath/p:c:\sqlitejdbc.jar -jar LakebaseSync.jar -g MySQLiteConfig.xml -f LakebaseSync.sql