Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Censoring Survival Data01:09

Censoring Survival Data

Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different reasons...
Types of Errors: Detection and Minimization01:12

Types of Errors: Detection and Minimization

Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Interpretable Sensor Change Detection via Conditional Cauchy-Schwarz Divergence.

Sensors (Basel, Switzerland)·2026
Same author

miR-199b-5p Regulates Immune-Mediated Allograft Rejection after Lung Transplantation Through the GSK3β and NF-κB Pathways.

Inflammation·2018
Same author

A signature based on survival-related genes identifies high-risk glioblastomas harboring immunosuppressive and aggressive ECM characteristics.

Zhong nan da xue xue bao. Yi xue ban = Journal of Central South University. Medical sciences·2018
Same author

Curcumin enhances anti-tumor immune response in tongue squamous cell carcinoma.

Archives of oral biology·2018
Same author

Analyses of risk factors for polycystic ovary syndrome complicated with non-alcoholic fatty liver disease.

Experimental and therapeutic medicine·2018
Same author

Metformin Therapy for Pulmonary Hypertension Associated with Heart Failure with Preserved Ejection Fraction versus Pulmonary Arterial Hypertension.

American journal of respiratory and critical care medicine·2018
Same journal

Research on a Regional Availability Evaluation Model for Road-Area High-Entropy Energy Based on Synergy Factors.

Entropy (Basel, Switzerland)·2026
Same journal

Atmospheric Turbulence Channel Modeling and Performance Analysis of a CO-ZP-OFDM Coherent Optical Communication System for UAV Air-to-Ground Scenarios.

Entropy (Basel, Switzerland)·2026
Same journal

Information Geometry and Asymptotic Theory for SMML Estimators.

Entropy (Basel, Switzerland)·2026
Same journal

Correlation Entropy and Power-Law Kinetics.

Entropy (Basel, Switzerland)·2026
Same journal

Research on the Contagion of Systemic Financial Risk Under the Impact of Climate Risks-From the Perspective of Complex Networks and Machine Learning.

Entropy (Basel, Switzerland)·2026
Same journal

The Statistical-Mechanical Meaning of the Wave Function of Quantum Mechanics.

Entropy (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Jun 27, 2026

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

RIB-Guard: A Risk-Aware Information Bottleneck Defense for Black-Box Large Language Models.

Muen Cai1, Yuan Shen2, Xiong Luo3

  • 1School of Computer Science and Engineering, University of Electronic Science and Technology of China, No. 2006, Xiyuan Ave, Chengdu 611731, China.

Entropy (Basel, Switzerland)
|June 26, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces RIB-Guard, a novel defense against large language model (LLM) jailbreaks in black-box settings. RIB-Guard enhances LLM security by learning a token-level masking policy for improved prompt protection.

Keywords:
information bottleneckjailbreak defenselarge language modelsprompt maskingreinforcement learning

Related Experiment Videos

Last Updated: Jun 27, 2026

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

  • Artificial Intelligence
  • Natural Language Processing
  • Cybersecurity

Background:

  • Large language models (LLMs) are susceptible to jailbreak attacks, particularly in black-box scenarios.
  • Existing defenses often require white-box access or rely on the model's inherent alignment, limiting their applicability.
  • Information bottleneck methods frame prompt protection as a compression task but face optimization challenges.

Purpose of the Study:

  • To develop a novel defense mechanism, RIB-Guard, for enhancing the security of black-box large language models against jailbreak attacks.
  • To address the limitations of existing information bottleneck defenses by enabling black-box optimization and model-agnostic safety guidance.
  • To create a framework that balances prompt compactness, utility preservation, and residual risk reduction.

Main Methods:

  • Proposed RIB-Guard, a safety-aware information bottleneck defense specifically designed for black-box LLMs.
  • Implemented a token-level masking policy learned via reinforcement learning using only black-box feedback.
  • Introduced an independent lightweight safety head to estimate residual jailbreak risk and provide model-agnostic safety guidance.

Main Results:

  • Demonstrated improved jailbreak robustness in direct single-turn harmful and benign prompt settings.
  • Showcased competitive preservation of benign utility alongside enhanced security.
  • Successfully extended information bottleneck-based prompt protection from white-box to black-box environments.

Conclusions:

  • RIB-Guard offers a significant advancement in defending black-box LLMs against jailbreak attacks.
  • The proposed framework provides a step towards safety-aware, information-theoretic front-end defenses for LLMs.
  • This research contributes to more secure and reliable deployment of large language models.