We are proud to share our inclusion in the 2024 Gartner Magic Quadrant for Data Integration Tools. We believe this recognition reflects the differentiated business outcomes CData delivers to our customers.
Get the Report →Visualize HDFS Data in Sisense
Create an ElastiCube in Sisense app with access to HDFS data.
Sisense lets you join, analyze, and picture data to make more intelligent business decisions and craft effective strategies. The CData JDBC Driver for HDFS makes it easy to integrate with HDFS data in Sisense. This article shows how to create an ElastiCube that connects to HDFS data and use the ElastiCube to visualize HDFS data in Sisense.
Configure the Connection to HDFS
Before creating the ElastiCube, note the installation location for the JAR file for the JDBC Driver (typically C:\Program Files\CData\CData JDBC Driver for HDFS 20XX\lib) or copy the jar file (cdata.jdbc.hdfs.HDFS.jar) to a new folder in the Sisense JDBC driver directory (typically C:\ProgramData\Sisense\DataConnectors\jdbcdrivers).
- In the Data page of the Sisense application, create a new ElastiCube (or open an existing one).
- In the Model Editor, click "+ Data" to open the Add Data dialog box.
- Click Generic JDBC to open the JDBC settings.
- Set the connection string property to the JDBC URL for HDFS, adding required properties.
In order to authenticate, set the following connection properties:
- Host: Set this value to the host of your HDFS installation.
- Port: Set this value to the port of your HDFS installation. Default port: 50070
Built-in Connection String Designer
For assistance in constructing the JDBC URL, use the connection string designer built into the HDFS JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.
java -jar cdata.jdbc.hdfs.jar
Fill in the connection properties and copy the connection string to the clipboard.
When you configure the JDBC URL, you may also want to set the Max Rows connection property. This will limit the number of rows returned, which is especially helpful for improving performance when designing reports and visualizations.
A typical example follows:
jdbc:hdfs:Host=sandbox-hdp.hortonworks.com;Port=50070;Path=/user/root;User=root;
- Set the JDBC JARs folder property to the location of the CData JDBC Driver JAR file (see above).
- Set the driver's class name to the class name for the JDBC Driver: cdata.jdbc.hdfs.HDFSDriver
- Leave the username and password properties blank.
- Click Next.
Add HDFS Data to an ElastiCube
Once you are connected to HDFS, you can add views to your ElastiCubes.
- From the Tables list, select the tables and/or views you wish to work with.
- (Optional) Click "+" to customize the data you want to import with SQL.
- Click Done.
- Click Build to build the ElastiCube for analytics.
Visualize HDFS Data
With HDFS tables added to your ElastiCube, you can perform analytics on your HDFS data.
- Navigate to the Analytics page of the Sisense application
- Select a Dashboard (or create a new one)
- Select your Data Source and click Create
- Click "+ Select Data" and choose fields to add to your visualization.
With the CData JDBC Driver for HDFS, you can access HDFS data right in Sisense for powerful visualization and analytics. Download a free, 30-day trial and start working with HDFS data in Sisense today!