About 50 results
Open links in new tab
  1. Apache Spark™ - Unified Engine for large-scale data analytics

    Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

  2. Overview - Spark 4.1.1 Documentation

    Spark Connect is a new client-server architecture introduced in Spark 3.4 that decouples Spark client applications and allows remote connectivity to Spark clusters.

  3. Documentation | Apache Spark

    Hands-On Exercises Hands-on exercises from Spark Summit 2014. These let you install Spark on your laptop and learn basic concepts, Spark SQL, Spark Streaming, GraphX and MLlib. Hands-on …

  4. Quick Start - Spark 4.1.1 Documentation - Apache Spark

    Quick Start Interactive Analysis with the Spark Shell Basics More on Dataset Operations Caching Self-Contained Applications Where to Go from Here This tutorial provides a quick introduction to using …

  5. Examples - Apache Spark

    Apache Spark ™ examples This page shows you how to use different Apache Spark APIs with simple examples. Spark is a great engine for small and large datasets. It can be used with single …

  6. Spark Release 3.5.0 - Apache Spark

    Spark Release 3.5.0 Apache Spark 3.5.0 is the sixth release in the 3.x series. With significant contributions from the open-source community, this release addressed over 1,300 Jira tickets. This …

  7. Useful Developer Tools | Apache Spark

    Apache Spark leverages GitHub Actions that enables continuous integration and a wide range of automation. Apache Spark repository provides several GitHub Actions workflows for developers to …

  8. PySpark Overview — PySpark 4.1.1 documentation - Apache Spark

    Jan 2, 2026 · PySpark Overview # Date: Jan 02, 2026 Version: 4.1.1 Useful links: Live Notebook | GitHub | Issues | Examples | Community | Stack Overflow | Dev Mailing List | User Mailing List …

  9. Spark SQL & DataFrames | Apache Spark

    Spark SQL is Spark's module for working with structured data, either within Spark programs or through standard JDBC and ODBC connectors.

  10. Spark SQL and DataFrames - Spark 4.1.1 Documentation

    Spark SQL, DataFrames and Datasets Guide Spark SQL is a Spark module for structured data processing. Unlike the basic Spark RDD API, the interfaces provided by Spark SQL provide Spark …