CData JDBC Driver for ADLS は、Aqua Data Studio のようなIDE のウィザード・アナリティクスにAzure Data Lake Storage データを統合します。本記事では、Azure Data Lake Storage データをコネクションマネージャーに接続してクエリを実行する手順を説明します。
JDBC Data Source の設定
コネクションマネージャーで、新しいJDBC データソースとして、接続プロパティ設定を行い、保存します。Azure Data Lake Storage データがAqua Data Studio ツールから使えるようになります。
- Aqua Data Studio で、Server メニューから Register Server を選択します。
- Register Server フォームで、 'Generic - JDBC' コネクションを選びます。
- 次のJDBC 接続プロパティを設定します:
- Name:任意の名前、データソースの名前など。
- Driver Location:Browse ボタンをクリックして、インストールディレクトリのlib フォルダの cdata.jdbc.adls.jar ファイルを選択します。
- Driver:クラス名に cdata.jdbc.adls.ADLSDriver を入力。
- URL:jdbc:adls: から始まるJDBC URL を入力します。接続プロパティをカンマ区切りで書きます。接続プロパティの詳細は、ヘルプドキュメントを参照してください。一般的な接続文字列は:
jdbc:adls:Schema=ADLSGen2;Account=myAccount;FileSystem=myFileSystem;AccessKey=myAccessKey;
Authenticating to a Gen 1 DataLakeStore Account
Gen 1 uses OAuth 2.0 in Azure AD for authentication.
For this, an Active Directory web application is required. You can create one as follows:
- Sign in to your Azure Account through the
[.
]- Select "Azure Active Directory".
- Select "App registrations".
- Select "New application registration".
- Provide a name and URL for the application. Select Web app for the type of application you want to create.
- Select "Required permissions" and change the required permissions for this app. At a minimum, "Azure Data Lake" and "Windows Azure Service Management API" are required.
- Select "Key" and generate a new key. Add a description, a duration, and take note of the generated key. You won't be able to see it again.
To authenticate against a Gen 1 DataLakeStore account, the following properties are required:
- Schema: Set this to ADLSGen1.
- Account: Set this to the name of the account.
- OAuthClientId: Set this to the application Id of the app you created.
- OAuthClientSecret: Set this to the key generated for the app you created.
- TenantId: Set this to the tenant Id. See the property for more information on how to acquire this.
- Directory: Set this to the path which will be used to store the replicated file. If not specified, the root directory will be used.
Authenticating to a Gen 2 DataLakeStore Account
To authenticate against a Gen 2 DataLakeStore account, the following properties are required:
- Schema: Set this to ADLSGen2.
- Account: Set this to the name of the account.
- FileSystem: Set this to the file system which will be used for this account.
- AccessKey: Set this to the access key which will be used to authenticate the calls to the API. See the property for more information on how to acquire this.
- Directory: Set this to the path which will be used to store the replicated file. If not specified, the root directory will be used.
Azure Data Lake Storage データにクエリを実行
接続したテーブルにクエリを実行してみます。