Convert CSV to Avro

Add your CSV data and automatically convert it to Avro.

CSV input options
Avro output options

CSV

Comma-separated values (CSV) is a simple and widely used format for storing tabular data. It is human-readable and easy to generate and parse.

Row delimiter

The row delimiter is the character used to separate each row in the CSV data. This is usually a new line character (LF), or a carriage return plus a new line character (CRLF).

We will automatically detect this and parse the rows correctly.

Value separator

The value separator is the character used to separate each value inside a row.

For CSV files, as the name implies it is usually a comma, but it can be different depending on the software used to generate the CSV file, we support the following separators:

  • Comma ,
  • Tab \t
  • Pipe |
  • Hash #
  • Semicolon ;

Character encoding

Depending on what software you used to generate the CSV file, it might have a different character encoding

If no character encoding is specified we will automatically try to guess it, so you don't have to worry about it if you're unsure.

Avro

Apache Avro is a row-oriented remote procedure call and data serialization framework developed within Apache's Hadoop project. It uses JSON for defining data types and protocols, and serializes data in a compact binary format.

Avro is designed to support complex nested data structures and enables efficient querying and manipulation of specific columns without reading the entire dataset.

Convert CSV