Excel Spreadsheet Automation with the QUERY Formula

Ready to get started?

Download a free trial of the Excel Add-In for Phoenix to get started:

 Download Now

Learn more:

Apache Phoenix Icon Excel Add-In for Phoenix

The Phoenix Excel Add-In is a powerful tool that allows you to connect with Apache Phoenix, directly from Microsoft Excel.

Use Excel to read, write, and update HBase through Apache Phoenix. Perfect for mass imports / exports / updates, data cleansing & de-duplication, Excel based data analysis, and more!

Pull data, automate spreadsheets, and more with the QUERY formula.

The CData Excel Add-In for Phoenix provides formulas that can query Phoenix data. The following three steps show how you can automate the following task: Search Phoenix data for a user-specified value and then organize the results into an Excel spreadsheet.

The syntax of the CDATAQUERY formula is the following: =CDATAQUERY(Query, [Connection], [Parameters], [ResultLocation]);

This formula requires three inputs:

  • Query: The declaration of the Phoenix data records you want to retrieve, written in standard SQL.
  • Connection: Either the connection name, such as ApachePhoenixConnection1, or a connection string. The connection string consists of the required properties for connecting to Phoenix data, separated by semicolons.

    Connect to Apache Phoenix via the Phoenix Query Server. Set the Server and Port (if different from the default port) properties to connect to Apache Phoenix. The Server property will typically be the host name or IP address of the server hosting Apache Phoenix.

    Authenticating to Apache Phoenix

    By default, no authentication will be used (plain). If authentication is configured for your server, set AuthScheme to NEGOTIATE and set the User and Password properties (if necessary) to authenticate through Kerberos.

  • ResultLocation: The cell that the output of results should start from.

Pass Spreadsheet Cells as Inputs to the Query

The procedure below results in a spreadsheet that organizes all the formula inputs in the first column.

  1. Define cells for the formula inputs. In addition to the connection inputs, add another input to define a criterion for a filter to be used to search Phoenix data, such as Id.
  2. In another cell, write the formula, referencing the cell values from the user input cells defined above. Single quotes are used to enclose values such as addresses that may contain spaces.
  3. =CDATAQUERY("SELECT * FROM MyTable WHERE Id = '"&B3&"'","Server="&B1&";Port="&B2&";Provider=ApachePhoenix",B4)
  4. Change the filter to change the data.