Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

11.9K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.9K
Language Development01:22

Language Development

454
Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...
454
Reducing Line Loss01:18

Reducing Line Loss

194
In a three-phase circuit, line loss is an indicator of energy dissipated as heat due to the resistance of transmission lines. To address this, incorporating transformers into the system—a step-up transformer at the source and a step-down transformer at the load—is a strategic solution. Two three-phase transformers are introduced to improve this.
With a step-up transformer at the source, the voltage is increased, thereby reducing the current in the transmission lines since power loss...
194
Language and Cognition01:27

Language and Cognition

444
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
444
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

717
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
717
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

101
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
101

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Polyphenols from Pulses: Recent Advances in Gut Health Benefits and Strategies to Elevate Their Concentrations.

Nutrients·2026
Same author

Genetic Differentiation in the SdhC Subunit Confers Intrinsic Resistance to SDHI Fungicides in Fusarium asiaticum.

Molecular plant pathology·2026
Same author

Remnant cholesterol level modified the effects of intensive systolic blood pressure lowering treatment in high-risk hypertensive patients: a post hoc analysis of the ESPRIT trial.

Hypertension research : official journal of the Japanese Society of Hypertension·2026
Same author

How to elevate the shelf-life of minor grains rich in unsaturated fatty acids: From rancidity mechanism to stability strategies.

Food chemistry·2026
Same author

Prolonged QT interval is associated with 90-day mortality in sepsis independent of onset timing and heightened in male patients: A cohort study.

The American journal of the medical sciences·2026
Same author

Corrigendum to "GelMA@APPA Microspheres Promote Chondrocyte Regeneration and Alleviate Osteoarthritis via Fgfr2 Activation" [2025 Feb:137:156176. doi: 10.1016/j.phymed.2024.156176/PHYMED-D-24-03041].

Phytomedicine : international journal of phytotherapy and phytopharmacology·2026
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Sep 13, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

684

LS-PRISM: A layer-selective pruning method via low-rank approximation and sparsification for efficient large language

Renshuai Tao1, Hairong Chen1, Yuzhe Guo1

  • 1Beijing Jiaotong University, Beijing, 100044, China.

Neural Networks : the Official Journal of the International Neural Network Society
|August 1, 2025
PubMed
Summary
This summary is machine-generated.

We developed LS-PRISM, a novel method for compressing Large Language Models (LLMs) by selectively pruning layers. This technique significantly reduces model size while maintaining high performance on NLP tasks.

Keywords:
Large language models (LLMs)Low-rank approximationModel compressionSparsificationUnstructured pruning

More Related Videos

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.3K
Lensless Fluorescent Microscopy on a Chip
11:23

Lensless Fluorescent Microscopy on a Chip

Published on: August 17, 2011

17.8K

Related Experiment Videos

Last Updated: Sep 13, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

684
Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.3K
Lensless Fluorescent Microscopy on a Chip
11:23

Lensless Fluorescent Microscopy on a Chip

Published on: August 17, 2011

17.8K

Area of Science:

  • Artificial Intelligence
  • Natural Language Processing
  • Machine Learning

Background:

  • Large Language Models (LLMs) achieve state-of-the-art performance in Natural Language Processing (NLP).
  • The substantial parameter count of LLMs poses deployment challenges for resource-constrained environments.
  • Existing compression methods often apply uniform compression across all layers, potentially impacting performance unevenly.

Purpose of the Study:

  • To introduce LS-PRISM, a novel Layer-Selective Pruning via low-Rank Approximation and Sparsification Method.
  • To efficiently compress LLMs while preserving performance on critical NLP benchmarks.
  • To provide a scalable solution for LLM deployment in resource-limited settings.

Main Methods:

  • LS-PRISM employs layer-selective low-rank approximation based on accuracy and loss impact.
  • Dynamic Rank Selection adaptively determines approximation ranks for optimal performance retention.
  • Unstructured pruning and optional LoRA fine-tuning further enhance model sparsification and performance recovery.

Main Results:

  • Significant reductions in parameter count and storage achieved.
  • Minimal degradation in accuracy observed across NLP benchmarks (BoolQ, RTE, ARC-Challenge).
  • Up to 12% parameter reduction demonstrated on a 2.5B parameter LLM with comparable performance.

Conclusions:

  • LS-PRISM offers an effective and scalable approach for compressing LLMs.
  • The method successfully balances model compression with performance preservation.
  • LS-PRISM is suitable for deploying LLMs in resource-constrained environments.