Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Problem-Solving: Tuning of a Guitar String

Problem-Solving: Tuning of a Guitar String

In the case of stringed instruments like the guitar, the elastic property that determines the speed of the sound produced is its linear mass density or the mass per unit length. This is simply called the linear density. If the string's linear density is constant along the string, then the linear density is simply the total mass divided by the total length.
The string's wave speed can be regulated by varying the linear density. Tension is the other property that determines the speed of...

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

Language

Language

Language is a unique communication system that uses words and systematic rules to organize and transmit information. Unlike other forms of communication, which may involve postures, movements, odors, or vocalizations, language relies on symbols and grammar. This makes human communication distinct from that of other species, who also communicate but do not use language in the same way humans do.
Corballis and Suddendorf (2007) and Tomasello and Rakoczy (2003) highlight the role of language in...

Color Vision

Color Vision

Color perception begins in the retina, the light-sensitive layer at the back of the eye. Two main theories explain how colors are seen: the trichromatic theory and the opponent-process theory. The trichromatic theory, proposed by Thomas Young in 1802 and extended by Hermann von Helmholtz in 1852, suggests that color vision is based on three types of cone receptors in the retina. These cones are sensitive to different but overlapping ranges of wavelengths corresponding to red, blue, and green.

Components of Language

Components of Language

Language, whether spoken, signed, or written, consists of specific components: lexicon and grammar. The lexicon is the vocabulary of a language, comprising its words. Grammar is the set of rules used to convey meaning through the lexicon. For example, English grammar adds “-ed” to most verbs to indicate past tense. Words are formed by combining phonemes, which are the basic sound units of a language. Different languages have different sets of phonemes (e.g., “ah” vs.

Language Development

Language Development

Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

The PTHR1/PKA/CREB1 axis promotes osteosarcoma progression by activating the PVT1/miR-590-3p/AXIN2 ceRNA network to induce epithelial-mesenchymal transition.

Biology direct·2026

Same author

The Effects of Different Organic Amendment Strategies on Soil Properties and Microbial Communities in Maize Monocropping.

Plants (Basel, Switzerland)·2026

Same author

Decoding the hidden resistome: Single-cell Raman detection of viable but non-culturable antibiotic-resistant bacteria in aquatic environments.

Journal of hazardous materials·2026

Same author

Task-aware cross-modal refinement and liquid fusion for text-visual grounding.

Frontiers in artificial intelligence·2026

Same author

Amphibian skin peptides: Diversity, biological functions, and research progress.

Toxicon : official journal of the International Society on Toxinology·2026

Same author

Analysis of the Oncogenic Role of Colony-Stimulating Factor 1 Receptor (CSF1R) in Pancreatic Adenocarcinoma.

Anti-cancer agents in medicinal chemistry·2026

Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 1, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

NAP-Tuning: Neural Augmented Prompt Tuning for Adversarially Robust Vision-Language Models.

Jiaming Zhang, Xin Wang, Xingjun Ma

IEEE Transactions on Pattern Analysis and Machine Intelligence

|January 30, 2026

Summary

This summary is machine-generated.

Neural Augmentor framework for Multi-modal Adversarial Prompt Tuning (NAP-Tuning) enhances vision-language model security. It purifies features at the internal level, significantly improving adversarial robustness against attacks.

More Related Videos

Experience is Instrumental in Tuning a Link Between Language and Cognition: Evidence from 6- to 7- Month-Old Infants' Object Categorization

Experience is Instrumental in Tuning a Link Between Language and Cognition: Evidence from 6- to 7- Month-Old Infants' Object Categorization

Published on: April 19, 2017

Tuning Oxide Properties by Oxygen Vacancy Control During Growth and Annealing

Tuning Oxide Properties by Oxygen Vacancy Control During Growth and Annealing

Published on: June 9, 2023

Related Experiment Videos

Last Updated: Feb 1, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Experience is Instrumental in Tuning a Link Between Language and Cognition: Evidence from 6- to 7- Month-Old Infants' Object Categorization

Experience is Instrumental in Tuning a Link Between Language and Cognition: Evidence from 6- to 7- Month-Old Infants' Object Categorization

Published on: April 19, 2017

Tuning Oxide Properties by Oxygen Vacancy Control During Growth and Annealing

Tuning Oxide Properties by Oxygen Vacancy Control During Growth and Annealing

Published on: June 9, 2023

Area of Science:

Computer Vision
Natural Language Processing
Machine Learning Security

Background:

Vision-Language Models (VLMs) excel at joint visual-textual understanding but are vulnerable to adversarial attacks.
Existing defenses like Adversarial Prompt Tuning (AdvPT) improve robustness but can be enhanced.
Adversarial perturbations pose significant security risks to VLMs.

Purpose of the Study:

To introduce the Neural Augmentor framework for Multi-modal Adversarial Prompt Tuning (NAP-Tuning).
To enhance adversarial robustness in VLMs through multi-modal, multi-layer feature purification.
To develop an adaptive defense mechanism for identifying and rectifying adversarial perturbations.

Main Methods:

Developed a comprehensive multi-modal (text and visual) and multi-layer prompting framework (NAP-Tuning).
Implemented a Neural Augmentor approach with TokenRefiners for feature-level purification via residual connections.
Conducted experiments across various datasets and attack types, including AutoAttack.

Main Results:

NAP-Tuning significantly outperforms existing adversarial robustness methods.
Achieved substantial improvements over baselines under AutoAttack (32.3% on ViT-B16, 31.3% on ViT-B32).
Maintained competitive clean accuracy while enhancing adversarial defense.

Conclusions:

Internal feature-level intervention is effective for prompt tuning in adversarial robustness.
NAP-Tuning offers an adaptive defense by rectifying perturbations within embedding spaces.
This approach moves beyond input-side alignment for more robust VLMs.