Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Downsampling01:20

Downsampling

252
When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...
252
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

131
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
131
Upsampling01:22

Upsampling

309
Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...
309
Fineness Modulus01:19

Fineness Modulus

784
The fineness modulus (FM) of aggregate is a numerical index that measures the coarseness or fineness of the particles. It is calculated by adding the cumulative percentages of aggregate retained on each of a specified series of sieves and dividing the sum by 100.
Consider performing sieve analysis on sand through a set of ASTM sieves. The weight of aggregate retained in each sieve and pan placed at the bottom is recorded, as given in Column B of Table 1.
To determine the fineness modulus of...
784
Per-Unit Sequence Models01:26

Per-Unit Sequence Models

116
An ideal Y-Y transformer, grounded through neutral impedances, displays per-unit sequence networks akin to those of a single-phase ideal transformer when subjected to balanced positive- or negative-sequence currents. These currents do not produce neutral currents, and their associated voltage drops.
Zero-sequence currents, which are identical in magnitude and phase, generate a neutral current, resulting in voltage drops across the neutral impedance and the low-voltage winding. If the...
116
Improving Translational Accuracy02:07

Improving Translational Accuracy

2.7K
2.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Closed-Loop digital therapeutics empowered by deep reinforcement learning and wearable sensing for precision orthopedic rehabilitation: a simulation-based proof-of-concept study.

Frontiers in rehabilitation sciences·2026
Same author

Iron-based anodes facilitate concurrent mercury removal and bioenergy generation in constructed wetland-microbial fuel cells.

Bioresource technology·2026
Same author

Dynamic energy-aware fixed-point linear mapping multiplier for internet of things edge devices.

Scientific reports·2026
Same author

'Fuhui 631', a novel hybrid rice restorer line with multiple resistance genes, high combining ability, and superior grain quality developed with marker‑assisted breeding.

Molecular breeding : new strategies in plant improvement·2026
Same author

A pathogen lncRNA secreted into rice sequesters a host miRNA for virulence.

Nature·2026
Same author

Rapid and Specific Dual-Mode Recognition of Cathinone Derivatives Enabled by π-Conjugation-Tuned Multinoncovalent Interactions.

Analytical chemistry·2026
Same journal

Scaling Up Bayesian Neural Networks with Neural Networks.

Transactions on machine learning research·2026
Same journal

Tumor-anchored deep feature random forests for out-of-distribution detection in lung cancer segmentation.

Transactions on machine learning research·2026
Same journal

Still Competitive: Revisiting Recurrent Models for Irregular Time Series Prediction.

Transactions on machine learning research·2026
Same journal

Multi-Modal Foundation Models for Computational Pathology: A Survey.

Transactions on machine learning research·2026
Same journal

Sparse-Input Neural Network using Group Concave Regularization.

Transactions on machine learning research·2026
Same journal

Bayesian Neighborhood Adaptation for Graph Neural Networks.

Transactions on machine learning research·2026
See all related articles

Related Experiment Video

Updated: Sep 10, 2025

Generation and Coherent Control of Pulsed Quantum Frequency Combs
06:42

Generation and Coherent Control of Pulsed Quantum Frequency Combs

Published on: June 8, 2018

9.1K

ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers.

Junjie Yin1, Jiahao Dong2, Yingheng Wang3

  • 1Department of Computer Science, Johns Hopkins University.

Transactions on Machine Learning Research
|August 21, 2025
PubMed
Summary
This summary is machine-generated.

We introduce ModuLoRA, a memory-efficient algorithm for finetuning large language models (LLMs) using 2-4 bit precision on a single GPU. This method enables advanced low-precision finetuning, achieving competitive performance with reduced memory usage.

More Related Videos

Characterization of Anisotropic Leaky Mode Modulators for Holovideo
09:36

Characterization of Anisotropic Leaky Mode Modulators for Holovideo

Published on: March 19, 2016

8.0K
Shaping the Amplitude and Phase of Laser Beams by Using a Phase-only Spatial Light Modulator
08:39

Shaping the Amplitude and Phase of Laser Beams by Using a Phase-only Spatial Light Modulator

Published on: January 28, 2019

9.9K

Related Experiment Videos

Last Updated: Sep 10, 2025

Generation and Coherent Control of Pulsed Quantum Frequency Combs
06:42

Generation and Coherent Control of Pulsed Quantum Frequency Combs

Published on: June 8, 2018

9.1K
Characterization of Anisotropic Leaky Mode Modulators for Holovideo
09:36

Characterization of Anisotropic Leaky Mode Modulators for Holovideo

Published on: March 19, 2016

8.0K
Shaping the Amplitude and Phase of Laser Beams by Using a Phase-only Spatial Light Modulator
08:39

Shaping the Amplitude and Phase of Laser Beams by Using a Phase-only Spatial Light Modulator

Published on: January 28, 2019

9.9K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Natural Language Processing

Background:

  • Large Language Models (LLMs) require substantial computational resources for finetuning.
  • Existing finetuning methods often necessitate high-end hardware, limiting accessibility.
  • Low-precision quantization offers a path to reduce memory footprints but presents finetuning challenges.

Purpose of the Study:

  • To develop a memory-efficient finetuning algorithm for LLMs.
  • To enable finetuning of LLMs with 65B parameters on consumer-grade GPUs.
  • To integrate arbitrary weight quantizers with low-rank adaptation for flexible finetuning.

Main Methods:

  • Proposed ModuLoRA (modular low-rank adaptation), a novel finetuning approach.
  • Implemented a quantization-agnostic backward pass for adaptive low-precision weight materialization.
  • Integrated state-of-the-art 2-bit QuIP# and 3-bit OPTQ quantization methods.

Main Results:

  • Successfully finetuned LLMs with 65B parameters using 2/3/4-bit precision on a single 24GB GPU.
  • Achieved competitive performance on text classification, natural language inference, and instruction following tasks.
  • Surpassed state-of-the-art ROUGE scores in summarization tasks, outperforming existing 4-bit and 8-bit methods.

Conclusions:

  • ModuLoRA significantly reduces memory requirements for LLM finetuning.
  • The method enables finetuning of highly precise LLMs (2-bit, 3-bit) for the first time.
  • Released ModuLoRA and low-precision models via the LLMTools library for broader accessibility.