Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Lattice Centering and Coordination Number02:33

Lattice Centering and Coordination Number

9.9K
The structure of a crystalline solid, whether a metal or not, is best described by considering its simplest repeating unit, which is referred to as its unit cell. The unit cell consists of lattice points that represent the locations of atoms or ions. The entire structure then consists of this unit cell repeating in three dimensions. The three different types of unit cells present in the cubic lattice are illustrated in Figure 1.
Types of Unit Cells
Imagine taking a large number of identical...
9.9K
Bewley Lattice Diagram01:12

Bewley Lattice Diagram

842
The Bewley lattice diagram, developed by L. V. Bewley, effectively organizes the reflections occurring during transmission-line transients. It visually represents how voltage waves propagate and reflect within a transmission line, making it easier to understand the complex interactions that occur.
842
Trends in Lattice Energy: Ion Size and Charge02:54

Trends in Lattice Energy: Ion Size and Charge

24.2K
An ionic compound is stable because of the electrostatic attraction between its positive and negative ions. The lattice energy of a compound is a measure of the strength of this attraction. The lattice energy (ΔHlattice) of an ionic compound is defined as the energy required to separate one mole of the solid into its component gaseous ions. For the ionic solid sodium chloride, the lattice energy is the enthalpy change of the process:
24.2K
Hybridization of Atomic Orbitals II03:35

Hybridization of Atomic Orbitals II

33.7K
sp3d and sp3d 2 Hybridization
33.7K
Quantum Numbers02:43

Quantum Numbers

36.4K
It is said that the energy of an electron in an atom is quantized; that is, it can be equal only to certain specific values and can jump from one energy level to another but not transition smoothly or stay between these levels.
36.4K
Hybridization of Atomic Orbitals I03:24

Hybridization of Atomic Orbitals I

48.9K
The mathematical expression known as the wave function, ψ, contains information about each orbital and the wavelike properties of electrons in an isolated atom. When atoms are bound together in a molecule, the wave functions combine to produce new mathematical descriptions that have different shapes. This process of combining the wave functions for atomic orbitals is called hybridization and is mathematically accomplished by the linear combination of atomic orbitals. The new orbitals that...
48.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Endurance exercise improves age- and obesity-related cardiac dysfunction and locomotor impairment through muscle Gγ1 gene in Drosophila.

Life sciences·2026
Same author

Simple Guidance Mechanisms for Discrete Diffusion Models.

... International Conference on Learning Representations·2026
Same author

The Diffusion Duality.

Proceedings of machine learning research·2026
Same author

Calibrated Probabilistic Forecasts for Arbitrary Sequences.

Transactions on machine learning research·2026
Same author

BLOCK DIFFUSION: INTERPOLATING BETWEEN AU-TOREGRESSIVE AND DIFFUSION LANGUAGE MODELS.

... International Conference on Learning Representations·2026
Same author

PlantCAD2: A Long-Context DNA Language Model for Cross-Species Functional Annotation in Angiosperms.

bioRxiv : the preprint server for biology·2025
Same journal

Towards the Efficient Inference by Incorporating Automated Computational Phenotypes under Covariate Shift.

Proceedings of machine learning research·2026
Same journal

Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video.

Proceedings of machine learning research·2026
Same journal

Perspective: Machine Learning for Health Should Consider Social Drivers of Health.

Proceedings of machine learning research·2026
Same journal

Classifying Phonotrauma Severity from Vocal Fold Images with Soft Ordinal Regression.

Proceedings of machine learning research·2026
Same journal

Does Domain-Specific Retrieval Augmented Generation Help LLMs Answer Consumer Health Questions?

Proceedings of machine learning research·2026
Same journal

Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential.

Proceedings of machine learning research·2026
See all related articles

Related Experiment Video

Updated: Sep 9, 2025

Generation and Coherent Control of Pulsed Quantum Frequency Combs
06:42

Generation and Coherent Control of Pulsed Quantum Frequency Combs

Published on: June 8, 2018

9.1K

QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks.

Albert Tseng1, Jerry Chee1, Qingyao Sun2

  • 1Department of Computer Science, Cornell University.

Proceedings of Machine Learning Research
|September 2, 2025
PubMed
Summary
This summary is machine-generated.

QuIP# is a novel post-training quantization method that achieves state-of-the-art extreme compression for large language models (LLMs) using advanced techniques. This weight-only approach significantly reduces model size while maintaining performance, enabling efficient deployment.

More Related Videos

Quantum State Engineering of Light with Continuous-wave Optical Parametric Oscillators
09:23

Quantum State Engineering of Light with Continuous-wave Optical Parametric Oscillators

Published on: May 30, 2014

14.6K
A Photonic System for Generating Unconditional Polarization-Entangled Photons Based on Multiple Quantum Interference
07:56

A Photonic System for Generating Unconditional Polarization-Entangled Photons Based on Multiple Quantum Interference

Published on: September 5, 2019

8.6K

Related Experiment Videos

Last Updated: Sep 9, 2025

Generation and Coherent Control of Pulsed Quantum Frequency Combs
06:42

Generation and Coherent Control of Pulsed Quantum Frequency Combs

Published on: June 8, 2018

9.1K
Quantum State Engineering of Light with Continuous-wave Optical Parametric Oscillators
09:23

Quantum State Engineering of Light with Continuous-wave Optical Parametric Oscillators

Published on: May 30, 2014

14.6K
A Photonic System for Generating Unconditional Polarization-Entangled Photons Based on Multiple Quantum Interference
07:56

A Photonic System for Generating Unconditional Polarization-Entangled Photons Based on Multiple Quantum Interference

Published on: September 5, 2019

8.6K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Computer Science

Background:

  • Post-training quantization (PTQ) is crucial for reducing the memory footprint of large language models (LLMs).
  • Existing PTQ methods face challenges in extreme compression regimes (e.g., ≤ 4 bits per weight).

Purpose of the Study:

  • Introduce QuIP#, a novel weight-only PTQ method for extreme LLM compression.
  • Achieve state-of-the-art results in extreme quantization by employing three innovative techniques.

Main Methods:

  • Utilize the randomized Hadamard transform for improved incoherence processing.
  • Employ vector quantization with hardware-efficient codebooks based on the E8 lattice for sub-Gaussian weights.
  • Incorporate fine-tuning to enhance fidelity to the original model.

Main Results:

  • QuIP# demonstrates superior performance compared to existing PTQ methods in extreme compression.
  • The method enables new scaling behaviors for PTQ.
  • QuIP# supports fast inference speeds.

Conclusions:

  • QuIP# represents a significant advancement in weight-only PTQ for LLMs.
  • The novel techniques employed lead to state-of-the-art results in extreme compression.
  • QuIP# offers a practical solution for deploying highly compressed LLMs efficiently.