Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies
  1. Home
  2. Fine: Fine-grained Neuron-level Model Editing For Reliable And Safe Llms.
  1. Home
  2. Fine: Fine-grained Neuron-level Model Editing For Reliable And Safe Llms.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Time-Aware Tranformer-Based Prediction Model for AECOPD.

Studies in health technology and informatics·2025
Same author

Establishment and validation of an immune-related genes diagnostic model and experimental validation of diagnostic biomarkers for autoimmune thyroiditis based on RNA-seq.

International immunopharmacology·2025
Same author

Establishment and Validation of Sandell-Kolthoff Method for Saliva Iodine Determination.

Biological trace element research·2024
Same author

Preparation and Application of a Novel Liquid Oxygen-Compatible Epoxy Resin of Fluorinated Glycidyl Amine with Low Viscosity.

Polymers·2024
Same author

Effect of progestin on thyroid function in female Wistar rats.

Frontiers in endocrinology·2024
Same author

Cryogenic Mechanical Properties and Stability of Polymer Films for Liquid Oxygen Hoses.

Polymers·2023
Same journal

Raising the Bar in Graph OOD Generalization: Invariant Learning beyond Explicit Environment Modeling.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

LoRASculpt: Harmonious Low-Rank Adaptation for Multimodal Large Language Models.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Linearly Solving Robust Rotation Estimation.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Adapting Dense Vision-Language Relationships for Multi-label Classification with Partial Label.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Forensics Adapter: Unleashing CLIP for Generalizable Face Forgery Detection.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

MoE-Enhanced Explainable Deep Manifold Transformation for Complex Data Embedding and Visualization.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions
07:34

A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions

Published on: March 25, 2014

FiNE: Fine-grained Neuron-level Model Editing for Reliable and Safe LLMs.

Xun Yang, Haowen Pan, Xiaozhi Wang

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |May 27, 2026

    View abstract on PubMed

    Summary
    This summary is machine-generated.

    We introduce Fine-grained Neuron-level Editing (FiNE), a new method for improving Large Language Models (LLMs). FiNE enhances LLM reliability and safety by precisely editing internal network components, leading to more robust and accurate outputs.

    More Related Videos

    Laser Capture Microdissection of Enriched Populations of Neurons or Single Neurons for Gene Expression Analysis After Traumatic Brain Injury
    13:32

    Laser Capture Microdissection of Enriched Populations of Neurons or Single Neurons for Gene Expression Analysis After Traumatic Brain Injury

    Published on: April 10, 2013

    Related Experiment Videos

    A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions
    07:34

    A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions

    Published on: March 25, 2014

    Laser Capture Microdissection of Enriched Populations of Neurons or Single Neurons for Gene Expression Analysis After Traumatic Brain Injury
    13:32

    Laser Capture Microdissection of Enriched Populations of Neurons or Single Neurons for Gene Expression Analysis After Traumatic Brain Injury

    Published on: April 10, 2013

    Area of Science:

    • Artificial Intelligence
    • Machine Learning
    • Natural Language Processing

    Background:

    • Large Language Models (LLMs) require enhanced reliability and safety measures.
    • Existing editing methods like ROME may overlook internal network structures, causing brittle updates.
    • There is a need for more robust and fine-grained editing techniques for LLMs.

    Purpose of the Study:

    • To propose FiNE (Fine-grained Neuron-level Editing), a unified framework for LLM editing.
    • To address limitations of prior methods by focusing on internal Feed-Forward Network (FFN) structures.
    • To enhance both factual accuracy and safety of LLM outputs.

    Main Methods:

    • FiNE employs gradient-free localization and fine-grained parameter adjustment within FFNs.
    • A theoretically grounded contribution score identifies concept-relevant neurons, acting as an efficient alternative to gradient-based methods.
    • A composite loss function balances editing accuracy, model consistency, and output diversity.

    Main Results:

    • FiNE demonstrates superior precision, robustness, and efficiency in knowledge and safety editing tasks.
    • Experiments on KnowEdit and SafeEdit benchmarks show minimal disruption to the LLM's general behavior.
    • FiNE achieves state-of-the-art results on both factual correction and safety enhancement.

    Conclusions:

    • Fine-grained neuron editing is an effective and scalable approach for improving LLMs.
    • FiNE offers a unified framework for building more reliable and safer Large Language Models.
    • The proposed method advances the field of LLM interpretability and controllability.