[Avg. reading time: 3 minutes]

Common Data Formats

CSV (Comma-Separated Values)

A simple text-based format where each row is a record and columns are separated by commas.

Example:

name,age,city
Rachel,30,New York
Phoebe,25,San Francisco

Use Cases:

  • Data exchange between systems
  • Lightweight storage

Pros:

  • Human-readable
  • Easy to generate and parse

Cons:

  • No support for nested or complex structures
  • No schema enforcement
  • Inefficient for very large data

TSV (Tab-Separated Values)

Like CSV but uses tabs instead of commas.

Example:

name    age    city
Rachel   30     New York
Phoebe     25     San Francisco

Use Cases:

Similar to CSV but avoids issues with commas in data

Pros:

  • Easy to read and parse
  • Handles data with commas

Cons:

  • Same as CSV: no schema, no nested data

#bigdata #dataformat #csv #parquet #arrow Ver 5.5.3

Last change: 2025-10-15