Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Problem-Solving: Tuning of a Guitar String01:04

Problem-Solving: Tuning of a Guitar String

1.1K
In the case of stringed instruments like the guitar, the elastic property that determines the speed of the sound produced is its linear mass density or the mass per unit length. This is simply called the linear density. If the string's linear density is constant along the string, then the linear density is simply the total mass divided by the total length.
The string's wave speed can be regulated by varying the linear density. Tension is the other property that determines the speed of...
1.1K
Vision01:24

Vision

60.0K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
60.0K
Language01:16

Language

910
Language is a unique communication system that uses words and systematic rules to organize and transmit information. Unlike other forms of communication, which may involve postures, movements, odors, or vocalizations, language relies on symbols and grammar. This makes human communication distinct from that of other species, who also communicate but do not use language in the same way humans do.
Corballis and Suddendorf (2007) and Tomasello and Rakoczy (2003) highlight the role of language in...
910
Color Vision01:24

Color Vision

1.5K
Color perception begins in the retina, the light-sensitive layer at the back of the eye. Two main theories explain how colors are seen: the trichromatic theory and the opponent-process theory. The trichromatic theory, proposed by Thomas Young in 1802 and extended by Hermann von Helmholtz in 1852, suggests that color vision is based on three types of cone receptors in the retina. These cones are sensitive to different but overlapping ranges of wavelengths corresponding to red, blue, and green.
1.5K
Components of Language01:24

Components of Language

820
Language, whether spoken, signed, or written, consists of specific components: lexicon and grammar. The lexicon is the vocabulary of a language, comprising its words. Grammar is the set of rules used to convey meaning through the lexicon. For example, English grammar adds “-ed” to most verbs to indicate past tense. Words are formed by combining phonemes, which are the basic sound units of a language. Different languages have different sets of phonemes (e.g., “ah” vs.
820
Language Development01:22

Language Development

912
Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...
912

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The PTHR1/PKA/CREB1 axis promotes osteosarcoma progression by activating the PVT1/miR-590-3p/AXIN2 ceRNA network to induce epithelial-mesenchymal transition.

Biology direct·2026
Same author

The Effects of Different Organic Amendment Strategies on Soil Properties and Microbial Communities in Maize Monocropping.

Plants (Basel, Switzerland)·2026
Same author

Decoding the hidden resistome: Single-cell Raman detection of viable but non-culturable antibiotic-resistant bacteria in aquatic environments.

Journal of hazardous materials·2026
Same author

Task-aware cross-modal refinement and liquid fusion for text-visual grounding.

Frontiers in artificial intelligence·2026
Same author

Amphibian skin peptides: Diversity, biological functions, and research progress.

Toxicon : official journal of the International Society on Toxinology·2026
Same author

Analysis of the Oncogenic Role of Colony-Stimulating Factor 1 Receptor (CSF1R) in Pancreatic Adenocarcinoma.

Anti-cancer agents in medicinal chemistry·2026
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Feb 1, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.2K

NAP-Tuning: Neural Augmented Prompt Tuning for Adversarially Robust Vision-Language Models.

Jiaming Zhang, Xin Wang, Xingjun Ma

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |January 30, 2026
    PubMed
    Summary
    This summary is machine-generated.

    Neural Augmentor framework for Multi-modal Adversarial Prompt Tuning (NAP-Tuning) enhances vision-language model security. It purifies features at the internal level, significantly improving adversarial robustness against attacks.

    More Related Videos

    Experience is Instrumental in Tuning a Link Between Language and Cognition: Evidence from 6- to 7- Month-Old Infants' Object Categorization
    05:35

    Experience is Instrumental in Tuning a Link Between Language and Cognition: Evidence from 6- to 7- Month-Old Infants' Object Categorization

    Published on: April 19, 2017

    7.1K
    Tuning Oxide Properties by Oxygen Vacancy Control During Growth and Annealing
    06:44

    Tuning Oxide Properties by Oxygen Vacancy Control During Growth and Annealing

    Published on: June 9, 2023

    3.8K

    Related Experiment Videos

    Last Updated: Feb 1, 2026

    Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
    03:14

    Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

    Published on: December 6, 2024

    1.2K
    Experience is Instrumental in Tuning a Link Between Language and Cognition: Evidence from 6- to 7- Month-Old Infants' Object Categorization
    05:35

    Experience is Instrumental in Tuning a Link Between Language and Cognition: Evidence from 6- to 7- Month-Old Infants' Object Categorization

    Published on: April 19, 2017

    7.1K
    Tuning Oxide Properties by Oxygen Vacancy Control During Growth and Annealing
    06:44

    Tuning Oxide Properties by Oxygen Vacancy Control During Growth and Annealing

    Published on: June 9, 2023

    3.8K

    Area of Science:

    • Computer Vision
    • Natural Language Processing
    • Machine Learning Security

    Background:

    • Vision-Language Models (VLMs) excel at joint visual-textual understanding but are vulnerable to adversarial attacks.
    • Existing defenses like Adversarial Prompt Tuning (AdvPT) improve robustness but can be enhanced.
    • Adversarial perturbations pose significant security risks to VLMs.

    Purpose of the Study:

    • To introduce the Neural Augmentor framework for Multi-modal Adversarial Prompt Tuning (NAP-Tuning).
    • To enhance adversarial robustness in VLMs through multi-modal, multi-layer feature purification.
    • To develop an adaptive defense mechanism for identifying and rectifying adversarial perturbations.

    Main Methods:

    • Developed a comprehensive multi-modal (text and visual) and multi-layer prompting framework (NAP-Tuning).
    • Implemented a Neural Augmentor approach with TokenRefiners for feature-level purification via residual connections.
    • Conducted experiments across various datasets and attack types, including AutoAttack.

    Main Results:

    • NAP-Tuning significantly outperforms existing adversarial robustness methods.
    • Achieved substantial improvements over baselines under AutoAttack (32.3% on ViT-B16, 31.3% on ViT-B32).
    • Maintained competitive clean accuracy while enhancing adversarial defense.

    Conclusions:

    • Internal feature-level intervention is effective for prompt tuning in adversarial robustness.
    • NAP-Tuning offers an adaptive defense by rectifying perturbations within embedding spaces.
    • This approach moves beyond input-side alignment for more robust VLMs.