We are proud to share our inclusion in the 2024 Gartner Magic Quadrant for Data Integration Tools. We believe this recognition reflects the differentiated business outcomes CData delivers to our customers.
Get the Report →How to pipe Parquet Data to CSV in PowerShell
Use standard PowerShell cmdlets to access Parquet tables.
The CData Cmdlets Module for Parquet is a standard PowerShell module offering straightforward integration with Parquet. Below, you will find examples of using our Parquet Cmdlets with native PowerShell cmdlets.
Creating a Connection to Your Parquet Data
Connect to your local Parquet file(s) by setting the URI connection property to the location of the Parquet file.
$conn = Connect-Parquet -URI "$URI"
Selecting Data
Follow the steps below to retrieve data from the SampleTable_1 table and pipe the result into to a CSV file:
Select-Parquet -Connection $conn -Table SampleTable_1 | Select -Property * -ExcludeProperty Connection,Table,Columns | Export-Csv -Path c:\mySampleTable_1Data.csv -NoTypeInformation
You will notice that we piped the results from Select-Parquet into a Select-Object cmdlet and excluded some properties before piping them into an Export-Csv cmdlet. We do this because the CData Cmdlets append Connection, Table, and Columns information onto each "row" in the result set, and we do not necessarily want that information in our CSV file.
The Connection, Table, and Columns are appended to the results in order to facilitate piping results from one of the CData Cmdlets directly into another one.