Convert Parquet to TSV

Add your Parquet data and automatically convert it to TSV.

Parquet input options
This format does not have any input options.
TSV output options
This format does not have any output options.

Parquet

Apache Parquet is a columnar storage format optimized for use with big data processing frameworks. It offers efficient data compression and encoding schemes, which leads to significant storage savings and improved read performance.

Parquet is designed to support complex nested data structures and enables efficient querying and manipulation of specific columns without reading the entire dataset.

Compression

Parquet supports various compression algorithms such as Snappy, Gzip, and LZO. These compression techniques help in reducing the storage space and improving the performance of data processing tasks.

TSV

Tab-separated values (TSV) is a simple and widely used format for storing tabular data. It is human-readable and easy to generate and parse.

Row delimiter

The row delimiter is the character used to separate each row in the TSV data. This is usually a new line character (LF), or a carriage return plus a new line character (CRLF).

We will automatically detect this and parse the rows correctly.

Value separator

The value separator is the character used to separate each value inside a row.

For TSV files, as the name implies it is usually a tab character, but it can be different depending on the software used to generate the TSV file.

Character encoding

Depending on what software you used to generate the TSV file, it might have a different character encoding

If no character encoding is specified we will automatically try to guess it, so you don't have to worry about it if you're unsure.

Convert Parquet