About 57 results
Open links in new tab
  1. From PyTorch DDP to Accelerate to Trainer, mastery of distributed ...

    Oct 21, 2022 · We’re on a journey to advance and democratize artificial intelligence through open source and open science.

  2. Distributed Training with JAX and Flax NNX: A Practical Guide to …

    Mar 26, 2025 · A Blog post by Douglas Jia on Hugging Face

  3. Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

    Aug 8, 2025 · To achieve the best performance, parameters of consecutive layers can be distributed in a specific fashion, minimizing the required communication. When working with pairs of linear layers, …

  4. Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

    Jan 23, 2025 · A Blog post by Junlin Zhou on Hugging Face

  5. Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

    Mar 10, 2026 · Distributed MoE (Mixture of Experts) support is the emerging differentiator. If you'd rather skip straight to the good part, here's the full comparison table (no reading required, we won't judge).

  6. Accelerate Large Model Training using PyTorch Fully Sharded Data …

    May 2, 2022 · We’re on a journey to advance and democratize artificial intelligence through open source and open science.

  7. Reproducing and Validating Distributed Muon : A Practical ...

    Dec 12, 2025 · A Blog post by Jen Wei on Hugging Face

  8. How to train a Language Model with Megatron-LM - Hugging Face

    Sep 7, 2022 · Training large language models in Pytorch requires more than a simple training loop. It is usually distributed across multiple devices, with many optimization techniques for a stable and …

  9. DualPipe Explained: A Comprehensive Guide to DualPipe That Anyone …

    Feb 28, 2025 · With today’s buzz around large language models (LLMs) such as GPT-3, PaLM, and LLama, distributed training has become an essential technique for pushing beyond the limits of a …

  10. Finetuning Falcon 7b in a hybrid distributed fashion - Hugging Face

    Dec 31, 2024 · A Blog post by Bilel Cherif on Hugging Face