About 22,500 results
Open links in new tab
  1. File Format | Parquet

    Jul 7, 2024 · The format is explicitly designed to separate the metadata from the data. This allows splitting columns into multiple files, as well as having a single metadata file reference multiple …

  2. Apache Parquet Explained: A Guide for Data Professionals

    Feb 10, 2025 · Learn how to use Apache Parquet with practical code examples. This guide covers its features, schema evolution, and comparisons with CSV, JSON, and Avro.

  3. Parquet File Format – Everything You Need to Know!

    May 14, 2025 · Learn everything you need to know about the Parquet file format. With the amount of data growing exponentially in the last few years, one of the biggest challenges has become finding …

  4. Download Parquet Sample Data - TimeStored.com

    QStudio is a great Windows Parquet File Viewer. QStudio is a free SQL Editor that allows easily querying parquet/h2/json/csv/tsv/duckdb files. Would query the file above, alternatively within …

  5. Sample Parquet Files - Download Free Parquet Datasets

    Download free sample Parquet files for testing and development. High-quality columnar data samples for big data and analytics testing.

  6. What’s Really Inside a Parquet File? And why it matters

    Sep 30, 2025 · Every day, data engineers casually call df.to_parquet() or its equivalent, moving on without much thought. But what actually happens in that instant? What gets written to disk, and why …

  7. Introduction to Java Parquet (Formerly Parquet MR) - Baeldung

    Mar 25, 2026 · Learn to use Apache Parquet in Java 17, understanding Example API, Avro models, column projection, predicate pushdown, and ZSTD compression for efficient analytics.

  8. How to Work with Parquet Files in Python – A Guide with Examples

    Mar 5, 2026 · In this tutorial, you learned how to write and read Parquet files, use column pruning to reduce I/O, apply compression codecs, and filter large files efficiently using predicate pushdown.

  9. What Is Parquet? Columnar File Format vs CSV, Avro & ORC

    Apache Parquet is a columnar storage format built for fast analytics. Learn how Parquet compression, schema evolution, and predicate pushdown work — and when to choose Parquet over CSV, Avro, …

  10. What is Parquet? - Databricks

    Apache Parquet is designed to be a common interchange format for both batch and interactive workloads. It is similar to other columnar-storage file formats available in Hadoop, namely RCFile …