We are proud to share our inclusion in the 2024 Gartner Magic Quadrant for Data Integration Tools. We believe this recognition reflects the differentiated business outcomes CData delivers to our customers.
Get the Report →A PostgreSQL Interface for Amazon Athena Data (MySQL Remoting via JDBC)
Use the Remoting features of the Amazon Athena JDBC Driver to create a PostgreSQL entry-point for data access.
There are a vast number of PostgreSQL clients available on the Internet. From standard Drivers to BI and Analytics tools, PostgreSQL is a popular interface for data access. Using the remoting features of our JDBC Drivers, you can now create PostgreSQL entry-points that you can connect to from any standard client.
To access Amazon Athena data as a PostgreSQL database, use the Remoting feature of the CData JDBC Driver for Amazon Athena and the MySQL foreign data wrapper (FDW) from EnterpriseDB. In this article, we install the FDW and query Amazon Athena data from PostgreSQL Server.
About Amazon Athena Data Integration
CData provides the easiest way to access and integrate live data from Amazon Athena. Customers use CData connectivity to:
- Authenticate securely using a variety of methods, including IAM credentials, access keys, and Instance Profiles, catering to diverse security needs and simplifying the authentication process.
- Streamline their setup and quickly resolve issue with detailed error messaging.
- Enhance performance and minimize strain on client resources with server-side query execution.
Users frequently integrate Athena with analytics tools like Tableau, Power BI, and Excel for in-depth analytics from their preferred tools.
To learn more about unique Amazon Athena use cases with CData, check out our blog post: https://www.cdata.com/blog/amazon-athena-use-cases.
Getting Started
Configure the Connection to Amazon Athena
Follow the steps below to configure the driver's MySQL daemon to use the credentials and other connection properties needed to connect to Amazon Athena. The MySQL daemon exposes Amazon Athena data as a MySQL database named CDataAmazonAthena. Add connection properties to the databases section of the configuration file for the daemon. The configuration file for the daemon is located in the lib subfolder of the installation directory for the driver.
Below is a typical connection string:
[databases]
amazon athena = "AWSAccessKey='a123';AWSSecretKey='s123';AWSRegion='IRELAND';Database='sampledb';S3StagingDirectory='s3://bucket/staging/';"
Additionally, create a user in the users section.
You can find all of the configuration options for the MySQL daemon in the help documentation.
Start the Remoting Service
Follow the steps below to enable the MySQL Remoting feature of the CData JDBC Driver for Amazon Athena.
The driver creates a default configuration suitable for testing: Simply start the service to connect to Amazon Athena data.
- Start the MySQL Remoting Service with the following command:
java -jar cdata.jdbc.amazonathena.jar -f cdata.jdbc.amazonathena.remoting.ini
Build and Install the MySQL Foreign Data Wrapper
The Foreign Data Wrapper can be installed as an extension to PostgreSQL, without recompiling PostgreSQL.
If pgxn is available for your operating system, you can install with the following:
pgxn install mysql_fdw USE_PGXS=1
Otherwise, follow the steps below to build it yourself:
- Install the MySQL C client library and obtain the source for the EnterpriseDB FDW for MySQL; from GitHub, for example.
- Build the FDW. Add the pg_config and mysql_config executables to your PATH:
env PATH=/usr/local/pgsql/bin:/usr/local/mysql/bin:$PATH make USE_PGXS=1
-
Install the FDW:
make USE_PGXS=1 install
To complete the installation, you will need to load the libmysqlclient library into the environment; for example by adding it to the path.
Query Amazon Athena Data as a PostgreSQL Database
After you have installed the extension, follow the steps below to start executing queries to Amazon Athena data:
- Log into your database.
-
Load the extension for the database:
postgres=#CREATE EXTENSION mysql_fdw;
-
Create a server object for Amazon Athena data:
postgres=# CREATE SERVER AmazonAthena FOREIGN DATA WRAPPER mysql_fdw OPTIONS (host '127.0.0.1', port '3309');
-
Create a user mapping for the username and password of a user known to the MySQL daemon.
postgres=# CREATE USER MAPPING for postgres SERVER AmazonAthena OPTIONS (username 'admin', password 'test');
-
Create the local schema:
postgres=# CREATE SCHEMA AmazonAthena_db;
-
Import all the tables in the Amazon Athena database you defined in the daemon configuration file:
postgres=# IMPORT FOREIGN SCHEMA "AmazonAthena" FROM SERVER AmazonAthena INTO AmazonAthena_db;
You can now execute read/write commands to Amazon Athena:
postgres=# SELECT * FROM AmazonAthena_db."customers";