The Hive ADO.NET Data Provider enables user to easily connect to Hive data from .NET applications. Rapidly create and deploy powerful .NET applications that integrate with Apache Hive-compatible distributions.
Hive .NET Connectivity Features
- Maps SQL to HiveQL, enabling direct standard SQL-92 access to Apache Hive
- Enables SQL-92 capabilities on Apache Hive NoSQL data.
- Flexible NoSQL flattening - automatic schema generation, flexible querying etc.
- Connect to live Apache Hive data, for real-time data access with the Apache Hive JDBC Driver
- Full support for data aggregation and complex JOINs in SQL queries
- Secure connectivity through modern cryptography, including TLS 1.2, SHA-256, ECC, etc.
- Seamless integration with leading BI, reporting, and ETL tools and with custom applications via the Hive Connector.
Target Service, API
The driver connects to Apache Hive. Data warehouse on Hadoop.
Schema, Data Model
Models Hive databases and tables. Supports partitioned and bucketed tables.
Key Objects
Databases, Tables, Partitions, and Views. Hive metastore access.
Operations
HiveQL queries through SQL interface. Read and write operations. UDF support.
Authentication
Kerberos, LDAP, or no authentication. HiveServer2 connection.
See what you can do with Hive ADO.NET provider
Use Hive from SQL Server Analysis Service (SSAS) multi-dimensional cubes. Keep your analytical data modeling and access to any source including cloud and on-premises.
The Hive ADO.NET Provider allows developers to build applications that connect to Hive using familiar SQL and Entity Framework. Integrate Hive to your mission -critical applications or create easy side-by-side applications.
You can connect from ADO.NET compliant low-code development tools:
You can connect Hive from .NET-based reporting and analytics tools:
Standard ADO.NET Access to Hive
The Apache Hive ADO.NET Provider offers the most natural way to access Hive data from any .NET application. Simply use Apache Hive Data Provider objects to connect and access data just as you would access any traditional database. You will be able to use the Apache Hive Data Provider through Visual Studio Server Explorer, in code through familiar classes, and in data controls like DataGridView, GridView, DataSet, etc.
The CData ADO.NET Provider for Apache Hive hides the complexity of accessing data and provides additional powerful security features, smart caching, batching, socket management, and more.
Working with DataAdapters, DataSets, DataTables, etc.
The Apache Hive Data Provider has the same ADO.NET architecture as the native .NET data providers for SQL Server and OLEDB, including: HiveConnection, HiveCommand, HiveDataAdapter, HiveDataReader, HiveDataSource, HiveParameter, etc. Because of this you can now access Hive data in an easy, familiar way.
For example:
using (HiveConnection conn = new HiveConnection("...")) {
string select = "SELECT * FROM HiveData";
HiveCommand cmd = new HiveCommand(select, conn);
HiveDataAdapter adapter = new HiveDataAdapter(cmd);
using (adapter) {
DataTable table = new DataTable();
adapter.Fill(table);
...
}
}
More Than Read-Only: Full Update/CRUD Support
Apache Hive Data Provider goes beyond read-only functionality to deliver full support for Create, Read, Update, and Delete operations (CRUD). Your end-users can interact with the data presented by the Apache Hive Data Provider as easily as interacting with a database table.
using (HiveConnection connection = new HiveConnection(connectionString)) {
HiveDataAdapter dataAdapter = new HiveDataAdapter(
"SELECT Id, Where FROM HiveData", connection);
dataAdapter.UpdateCommand = new HiveCommand(
"UPDATE HiveData SET Where = @Where " +
"WHERE Id = @ID", connection);
dataAdapter.UpdateCommand.Parameters.AddWithValue("@Where", "Where");
dataAdapter.UpdateCommand.Parameters.AddWithValue("@Id", "80000173-1387137645");
DataTable HiveDataTable = new DataTable();
dataAdapter.Fill(HiveDataTable);
DataRow firstrow = HiveDataTable.Rows[0];
firstrow["Where"] = "New Location";
dataAdapter.Update(HiveDataTable);
}
ADO.NET Provider Performance
With traditional approaches to remote access, performance bottlenecks can spell disaster for applications. Regardless if an application is created for internal use, a commercial project, web, or mobile application, slow performance can rapidly lead to project failure. Accessing data from any remote source has the potential to create these problems. Common issues include:
- Network Connections - Slow network connections and latency issues are common in mobile applications.
- Service Delays - Delays due to service interruptions, resulting in server hardware or software updates.
- Large Data - Intentional or unintentional requests for large amounts of data.
- Disconnects - Complete loss of network connectivity.
The CData ADO.NET Provider for Apache Hive solves these issues by supporting powerful smart caching technology that can greatly improve the performance and dramatically reduce application bottlenecks.
Smart Caching
Smart caching is a configurable option that works by storing queried data into a local database. Enabling smart caching creates a persistent local cache database that contains a replica of data retrieved from the remote source. The cache database is small, lightweight, blazing-fast, and it can be shared by multiple connections as persistent storage.
Caching with our ADO.NET Providers is highly configurable, including options for:
- Auto Cache - Maintain an automatic local cache of data on all requests. The provider will automatically load data into the cache database each time you execute a SELECT query. Each row returned by the query will be inserted or updated as necessary into the corresponding table in the cache database.
- Explicit Cache - Cache only on demand. Developers decide exactly what data gets stored in the cache and when it is updated. Explicit caching provides full control over the cache contents by using explicit execution of CACHE statements.
- No Cache - All requests access only live data and no local cache file is created.
This powerful caching functionality increases application performance and allows applications to disconnect and continue limited functioning without writing code for additional local storage and/or data serialization/deserialization.
More information about ADO.NET Provider caching and best caching practices is available in the included help files.
Visual Studio Integration & Server Explorer
Working with the new Apache Hive ADO.NET Provider is easy. As a fully-managed .NET Data Provider, the Apache Hive Data Provider integrates seamlessly with the Visual Studio development environment as well as any .NET application.
As an ADO.NET Data Provider, Apache Hive ADO.NET Provider can be used to access and explore Apache Hive data directly from the Visual Studio Server Explorer.
It's easy. As a standard ADO.NET adapter, developers can connect the Server Explorer to Apache Hive ADO.NET Provider just like connecting to any standard database.
- Add a new Data Connection from the Server Explorer and select the Apache Hive Data Source
- Configure the basic connection properties to access your Apache Hive account data.
Explore all of the data available! Apache Hive ADO.NET Provider makes it easy to access live Apache Hive data from Visual Studio.
Developer Integration: Databind to Hive
Connecting Web, Desktop, and Mobile .NET applications with Apache Hive is just like working with SQL Server. It is even possible to integrate Apache Hive ADO.NET Provider into applications without writing code.
Developers are free to access the Apache Hive ADO.NET Provider in whatever way they like best. Either visually through the Visual Studio Winforms or Webforms designers, or directly through code.
- Developers can connect the Apache Hive Data Source directly to form components by configuring the object's smart
tags.
- Add a new Data Connection from the Server Explorer and select the Apache Hive Data Source. Then, select the
feed, view, or services you would like to connect the object to.
Done! It's just like connecting to SQL Server.
Popular ADO Videos:
