ODBC
HDFS Logo

HDFS ODBC Driver

SQL-based access to HDFS through ODBC for seamless integration with BI, ETL, and reporting with just a few clicks.

download buy now

Other Technologies

The HDFS ODBC Driver is a powerful tool that allows you to connect with live data from HDFS, directly from any applications that support ODBC connectivity. Access HDFS data like you would a database - read, write, and update HDFS HDFSData, etc. through a standard ODBC Driver interface.

ODBC architecture

HDFS ODBC Connectivity Features

  • SQL access to Hadoop Distributed File System data
  • Use SQL Stored Procedures to perform actions like creating files, appending data, setting permission, and more
  • Connect to live Apache HDFS data, for real-time data access with the Apache HDFS ODBC Driver
  • Full support for data aggregation and complex JOINs in SQL queries
  • Generate table schema automatically based on existing Apache HDFS data or manually for greater control of the content you need
  • Seamless integration with leading BI, reporting, and ETL tools and with custom applications via the HDFS Connector.

Target Service, API

The driver connects to Hadoop Distributed File System. Big data file storage.

Schema, Data Model

Models HDFS directories and files as tables. Supports various file formats.

Key Objects

Files, Directories, and Metadata. Hadoop file system access.

Operations

Read and write operations on HDFS. File format parsing. WebHDFS support.

Authentication

Kerberos or simple authentication. NameNode connection required.

Start a 30-day Free Trial Today

Robust ODBC Access to Apache HDFS

Full-featured and consistent SQL access to any supported data source through ODBC

  • Standards-Compliant ODBC

    ODBC 3.8 compliant native protocol based drivers.

  • Unicode, 32-bit and 64-bit support

    Support 32-bit and 64-bit applications and unicode ODBC APIs.

  • Cross-platform ODBC (Windows/Linux/Mac)

    Available for Microsoft Windows, Unix, and Mac (macOS / OS X) platforms.

  • Certified ODBC Compatibility*

    Our drivers undergo extensive testing and are certified to be compatible with leading analytics and reporting applications like Tableau, Microsoft Excel, and many more.

    * Please visit our knowledge base to access how-to articles.
  • Embedded Remoting Capabilities

    Our exclusive Remoting feature allows hosting the ODBC connection on a server to enable connections from various clients on any platform (Java, .NET, C++, PHP, Python, etc.), using any standards based technology (ODBC, JDBC, etc.). ODBC Remoting is enabled using the popular MySQL and SQL (TDS) wire protocol server.

  • String, Date, Numeric SQL Functions

    The driver includes a library of 50 plus functions that can manipulate column values into the desired result. Popular examples include Regex, JSON, and XML processing functions.

  • Collaborative Query Processing

    Our drivers enhance the data source's capabilities by additional client-side processing, when needed, to enable analytic summaries of data such as SUM, AVG, MAX, MIN, etc.

  • Easily Customizable and Configurable

    The data model exposed by our ODBC Drivers can easily be customized to add or remove tables/columns, change data types, etc. without requiring a new build. These customizations are supported at runtime using human-readable schema files that are easy to edit.

  • Enterprise-class Secure Connectivity

    Includes standard Enterprise-class security features such as TLS/ SSL data encryption for all client-server communications.

  • Replication and Caching

    Our replication and caching commands make it easy to copy data to local and cloud data stores such as Oracle, SQL Server, Google Cloud SQL, etc. The replication commands include many features that allow for intelligent incremental updates to cached data.

See what you can do with HDFS ODBC Driver

Server
SQL
SQL Server Linked-Server

Connect to HDFS from SQL Server as Linked-Server. You can query HDFS from familiar Transact-SQL without need of setting up another application.

Informatica
Data Integration & ETL

Integrate HDFS to your system through popular ETL / EAI tools such as Informatica PowerCenter. Standard SQL and Stored Procedures enable fast delivery of your integration projects.

Reporting & BI
Reporting & BI

Connect HDFS from any ODBC compatible tools:

Desktop
MS Access / Excel / PowerQuery
MS Access / Excel / PowerQuery

Your familiar tools can connect to HDFS using CData ODBC.

Connect to HDFS — empower every team


  • Easiest way to connect to HDFS
  • Dashboard / Reports that connect with HDFS
  • Data drives business, but IT teams are overwhelmed by accessing numerous siloed sources like HDFS. CData ODBC Drivers offer the fastest, easiest way to connect HDFS to BI tools like SAP BusinessObjects, Crystal Reports, Qlik, or Access.

Integrate HDFS to your system:

  • Custom application integration with SQL
  • SQL Server Linked Server
  • Informatica PowerCenter

Need to integrate your internal systems with HDFS? CData ODBC Drivers let you do it easily—access HDFS as a SQL Server Linked Server or connect via tools like Informatica PowerCenter using an SQL-based driver, no extra middleware required.

  • Data Client of your choice
  • Excel, MS Access Linked-table, PowerQuery

Still exporting and importing CSVs manually or with scripts? CData ODBC Drivers give you live access to HDFS — no middleware or files needed. Use your preferred tools and stay in control of your HDFS data.

Management & Data Consumer
IT Department
IT Pro, Infrastructure

Frequently Asked HDFS ODBC Driver Questions

Learn more about HDFS ODBC drivers for data and analytics integration

Not natively. However, CData offers an ODBC driver for HDFS that allows you to connect to HDFS data from any ODBC-compliant application, just like you would access a traditional database. This can be useful for tasks like:

  • Accessing HDFS from applications: Connect to HDFS data in popular tools and applications including Microsoft Excel, Tableau, Power BI, and many others.
  • Real-time data: You can work with live HDFS data within these applications, enabling tasks like reporting and analysis.
  • Connecting systems: Build data integrations between HDFS and other systems.

The HDFS ODBC driver is an ODBC 3.8 compliant native protocol-based driver with comprehensive ANSI SQL-92 support. This means that virtually any application that can connect to data via ODBC, can use the CData ODBC driver for real-time integration. Download a fully functional free trial of the HDFS ODBC driver today to get started.

Yes, the CData ODBC driver for HDFS provides universal ODBC data connectivity for HDFS. The HDFS ODBC driver offers a simple SQL-based layer of abstraction that simplifies real-time data access for users and applications, enabling them to communicate with HDFS using a standardized set of functions. Virtually any application on any platform (including Windows, macOS, or *nix) can use the CData ODBC driver for real-time integration.

Connectivity to HDFS via ODBC is easy. First, download and install the HDFS ODBC driver. The download page includes separate installers for Windows, macOS, and Unix, so please select the HDFS ODBC driver that matches the platform where you intend to use the driver.

Once the installation is complete, navigate to the ODBC driver documentation page. Here, you'll find a wealth of information about the installed driver. The step-by-step instructions for creating a DSN and using it to connect to HDFS via ODBC are just the beginning. The documentation also provides extensive configuration details for using the HDFS ODBC driver with all your favorite applications and development tools, ensuring you have all the support you need.

All of the CData ODBC drivers, including the HDFS ODBC driver are available for download online. To get started, download a fully functional free trial of the HDFS ODBC driver today.

To install the HDFS driver, simply download one of the HDFS ODBC driver installers available online. The download page includes separate Windows, macOS, and Unix installers, so please select the HDFS ODBC driver that matches the platform where you intend to use the driver. The installers are comprehensive setup utilities that will install all the components required to use the HDFS ODBC driver on your system.

The HDFS ODBC driver is available in both 32-bit and 64-bit binaries. The HDFS driver installer will allow you to install both options, and you can select the ODBC driver binary that is appropriate for the system where you intend to use the driver.

Absolutely. The HDFS ODBC driver seamlessly supports pyODBC connectivity. For example, check out this article on our online knowledgebase that demonstrates how you can connect to HDFS from PyCharm using pyODBC and the HDFS ODBC driver.

However, besides ODBC support, we also provide native Python data connectivity for HDFS. To learn more, check out the HDFS Python Connector online.

Download the HDFS ODBC Driver today!

Get the latest HDFS ODBC Drivers for Windows, Linux, and macOS. Install in minutes and connect your HDFS data to Power BI, Tableau, and more.