Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Downsampling

Downsampling

When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Upsampling

Upsampling

Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...

Fineness Modulus

Fineness Modulus

The fineness modulus (FM) of aggregate is a numerical index that measures the coarseness or fineness of the particles. It is calculated by adding the cumulative percentages of aggregate retained on each of a specified series of sieves and dividing the sum by 100.
Consider performing sieve analysis on sand through a set of ASTM sieves. The weight of aggregate retained in each sieve and pan placed at the bottom is recorded, as given in Column B of Table 1.
To determine the fineness modulus of...

Per-Unit Sequence Models

Per-Unit Sequence Models

An ideal Y-Y transformer, grounded through neutral impedances, displays per-unit sequence networks akin to those of a single-phase ideal transformer when subjected to balanced positive- or negative-sequence currents. These currents do not produce neutral currents, and their associated voltage drops.
Zero-sequence currents, which are identical in magnitude and phase, generate a neutral current, resulting in voltage drops across the neutral impedance and the low-voltage winding. If the...

Improving Translational Accuracy

Improving Translational Accuracy

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Closed-Loop digital therapeutics empowered by deep reinforcement learning and wearable sensing for precision orthopedic rehabilitation: a simulation-based proof-of-concept study.

Frontiers in rehabilitation sciences·2026

Same author

Iron-based anodes facilitate concurrent mercury removal and bioenergy generation in constructed wetland-microbial fuel cells.

Bioresource technology·2026

Same author

Dynamic energy-aware fixed-point linear mapping multiplier for internet of things edge devices.

Scientific reports·2026

Same author

'Fuhui 631', a novel hybrid rice restorer line with multiple resistance genes, high combining ability, and superior grain quality developed with marker‑assisted breeding.

Molecular breeding : new strategies in plant improvement·2026

Same author

A pathogen lncRNA secreted into rice sequesters a host miRNA for virulence.

Nature·2026

Same author

Rapid and Specific Dual-Mode Recognition of Cathinone Derivatives Enabled by π-Conjugation-Tuned Multinoncovalent Interactions.

Analytical chemistry·2026

Same journal

Scaling Up Bayesian Neural Networks with Neural Networks.

Transactions on machine learning research·2026

Same journal

Tumor-anchored deep feature random forests for out-of-distribution detection in lung cancer segmentation.

Transactions on machine learning research·2026

Same journal

Still Competitive: Revisiting Recurrent Models for Irregular Time Series Prediction.

Transactions on machine learning research·2026

Same journal

Multi-Modal Foundation Models for Computational Pathology: A Survey.

Transactions on machine learning research·2026

Same journal

Sparse-Input Neural Network using Group Concave Regularization.

Transactions on machine learning research·2026

Same journal

Bayesian Neighborhood Adaptation for Graph Neural Networks.

Transactions on machine learning research·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 10, 2025

Generation and Coherent Control of Pulsed Quantum Frequency Combs

Generation and Coherent Control of Pulsed Quantum Frequency Combs

Published on: June 8, 2018

ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers.

Junjie Yin¹, Jiahao Dong², Yingheng Wang³

¹Department of Computer Science, Johns Hopkins University.

Transactions on Machine Learning Research

|August 21, 2025

Summary

This summary is machine-generated.

We introduce ModuLoRA, a memory-efficient algorithm for finetuning large language models (LLMs) using 2-4 bit precision on a single GPU. This method enables advanced low-precision finetuning, achieving competitive performance with reduced memory usage.

More Related Videos

Characterization of Anisotropic Leaky Mode Modulators for Holovideo

Characterization of Anisotropic Leaky Mode Modulators for Holovideo

Published on: March 19, 2016

Shaping the Amplitude and Phase of Laser Beams by Using a Phase-only Spatial Light Modulator

Shaping the Amplitude and Phase of Laser Beams by Using a Phase-only Spatial Light Modulator

Published on: January 28, 2019

Related Experiment Videos

Last Updated: Sep 10, 2025

Generation and Coherent Control of Pulsed Quantum Frequency Combs

Generation and Coherent Control of Pulsed Quantum Frequency Combs

Published on: June 8, 2018

Characterization of Anisotropic Leaky Mode Modulators for Holovideo

Characterization of Anisotropic Leaky Mode Modulators for Holovideo

Published on: March 19, 2016

Shaping the Amplitude and Phase of Laser Beams by Using a Phase-only Spatial Light Modulator

Shaping the Amplitude and Phase of Laser Beams by Using a Phase-only Spatial Light Modulator

Published on: January 28, 2019

Area of Science:

Artificial Intelligence
Machine Learning
Natural Language Processing

Background:

Large Language Models (LLMs) require substantial computational resources for finetuning.
Existing finetuning methods often necessitate high-end hardware, limiting accessibility.
Low-precision quantization offers a path to reduce memory footprints but presents finetuning challenges.

Purpose of the Study:

To develop a memory-efficient finetuning algorithm for LLMs.
To enable finetuning of LLMs with 65B parameters on consumer-grade GPUs.
To integrate arbitrary weight quantizers with low-rank adaptation for flexible finetuning.

Main Methods:

Proposed ModuLoRA (modular low-rank adaptation), a novel finetuning approach.
Implemented a quantization-agnostic backward pass for adaptive low-precision weight materialization.
Integrated state-of-the-art 2-bit QuIP# and 3-bit OPTQ quantization methods.

Main Results:

Successfully finetuned LLMs with 65B parameters using 2/3/4-bit precision on a single 24GB GPU.
Achieved competitive performance on text classification, natural language inference, and instruction following tasks.
Surpassed state-of-the-art ROUGE scores in summarization tasks, outperforming existing 4-bit and 8-bit methods.

Conclusions:

ModuLoRA significantly reduces memory requirements for LLM finetuning.
The method enables finetuning of highly precise LLMs (2-bit, 3-bit) for the first time.
Released ModuLoRA and low-precision models via the LLMTools library for broader accessibility.