About 134,000 results
Open links in new tab
  1. Quantization (signal processing) - Wikipedia

    In mathematics and digital signal processing, quantization is the process of mapping input values from a large set (often a continuous set) to output values in a (countable) smaller set, often with a finite …

  2. What is Quantization - GeeksforGeeks

    Nov 6, 2025 · Quantization is a model optimization technique that reduces the precision of numerical values such as weights and activations in models to make them faster and more efficient. It helps …

  3. What Is Quantization? | How It Works & Applications

    Quantization is the process of mapping continuous infinite values to a smaller set of discrete finite values. In the context of simulation and embedded computing, it is about approximating real-world …

  4. Model Quantization: Concepts, Methods, and Why It Matters

    Nov 24, 2025 · Quantization has emerged as a crucial technique to address this challenge, enabling resource-intensive models to run on constrained hardware. The NVIDIA TensorRT and Model …

  5. What is quantization? - IBM

    Quantization is the process of reducing the precision of a digital signal, typically from a higher-precision format to a lower-precision format. This technique is widely used in various fields, including signal …

  6. Quantization from the ground up | ngrok blog

    Mar 25, 2026 · A complete guide to what quantization is, how it works, and how it's used to compress large language models

  7. What is Quantization and Why It Matters for AI Inference?

    Jul 20, 2025 · Among many optimization techniques to improve AI inference performance, quantization has become an essential method when deploying modern AI models into real-world services.