Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Language Development

Language Development

Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...

Reducing Line Loss

Reducing Line Loss

In a three-phase circuit, line loss is an indicator of energy dissipated as heat due to the resistance of transmission lines. To address this, incorporating transformers into the system—a step-up transformer at the source and a step-down transformer at the load—is a strategic solution. Two three-phase transformers are introduced to improve this.
With a step-up transformer at the source, the voltage is increased, thereby reducing the current in the transmission lines since power loss...

Language and Cognition

Language and Cognition

Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Polyphenols from Pulses: Recent Advances in Gut Health Benefits and Strategies to Elevate Their Concentrations.

Nutrients·2026

Same author

Genetic Differentiation in the SdhC Subunit Confers Intrinsic Resistance to SDHI Fungicides in Fusarium asiaticum.

Molecular plant pathology·2026

Same author

Remnant cholesterol level modified the effects of intensive systolic blood pressure lowering treatment in high-risk hypertensive patients: a post hoc analysis of the ESPRIT trial.

Hypertension research : official journal of the Japanese Society of Hypertension·2026

Same author

How to elevate the shelf-life of minor grains rich in unsaturated fatty acids: From rancidity mechanism to stability strategies.

Food chemistry·2026

Same author

Prolonged QT interval is associated with 90-day mortality in sepsis independent of onset timing and heightened in male patients: A cohort study.

The American journal of the medical sciences·2026

Same author

Corrigendum to "GelMA@APPA Microspheres Promote Chondrocyte Regeneration and Alleviate Osteoarthritis via Fgfr2 Activation" [2025 Feb:137:156176. doi: 10.1016/j.phymed.2024.156176/PHYMED-D-24-03041].

Phytomedicine : international journal of phytotherapy and phytopharmacology·2026

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 13, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

LS-PRISM: A layer-selective pruning method via low-rank approximation and sparsification for efficient large language

Renshuai Tao¹, Hairong Chen¹, Yuzhe Guo¹

¹Beijing Jiaotong University, Beijing, 100044, China.

Neural Networks : the Official Journal of the International Neural Network Society

|August 1, 2025

Summary

This summary is machine-generated.

We developed LS-PRISM, a novel method for compressing Large Language Models (LLMs) by selectively pruning layers. This technique significantly reduces model size while maintaining high performance on NLP tasks.

Keywords:

Large language models (LLMs)Low-rank approximation Model compression Sparsification Unstructured pruning

More Related Videos

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

Lensless Fluorescent Microscopy on a Chip

Lensless Fluorescent Microscopy on a Chip

Published on: August 17, 2011

Related Experiment Videos

Last Updated: Sep 13, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

Lensless Fluorescent Microscopy on a Chip

Lensless Fluorescent Microscopy on a Chip

Published on: August 17, 2011

Area of Science:

Artificial Intelligence
Natural Language Processing
Machine Learning

Background:

Large Language Models (LLMs) achieve state-of-the-art performance in Natural Language Processing (NLP).
The substantial parameter count of LLMs poses deployment challenges for resource-constrained environments.
Existing compression methods often apply uniform compression across all layers, potentially impacting performance unevenly.

Purpose of the Study:

To introduce LS-PRISM, a novel Layer-Selective Pruning via low-Rank Approximation and Sparsification Method.
To efficiently compress LLMs while preserving performance on critical NLP benchmarks.
To provide a scalable solution for LLM deployment in resource-limited settings.

Main Methods:

LS-PRISM employs layer-selective low-rank approximation based on accuracy and loss impact.
Dynamic Rank Selection adaptively determines approximation ranks for optimal performance retention.
Unstructured pruning and optional LoRA fine-tuning further enhance model sparsification and performance recovery.

Main Results:

Significant reductions in parameter count and storage achieved.
Minimal degradation in accuracy observed across NLP benchmarks (BoolQ, RTE, ARC-Challenge).
Up to 12% parameter reduction demonstrated on a 2.5B parameter LLM with comparable performance.

Conclusions:

LS-PRISM offers an effective and scalable approach for compressing LLMs.
The method successfully balances model compression with performance preservation.
LS-PRISM is suitable for deploying LLMs in resource-constrained environments.