Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Visual System

Visual System

Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...

Force Classification

Force Classification

Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...

Visual Agnosia

Visual Agnosia

Visual agnosia is a condition characterized by the inability to recognize visually presented objects despite having normal vision. For instance, a person with visual agnosia can describe the shape and color of an object but cannot identify or name it. This impairment does not affect their visual field, acuity, color vision, brightness discrimination, language, or memory. An example of this condition in a social setting is someone at a dinner party asking for "that silver thing with a round...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Progressive Fusion of Multi-Scale Mamba Context and Local Detail Priors for Infrared Small Target Detection.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

Performance of Age-Adjusted Whole Genome Sequencing Telomere Length in Idiopathic Pulmonary Fibrosis.

American journal of respiratory and critical care medicine·2026

Same author

Publisher Correction: Whole genome sequence analysis of pulmonary function and COPD in 44,287 multi-ancestry participants.

Genome biology·2026

Same author

Optical Coherence Tomography Biomarkers Differentiate Epiretinal Membranes Secondary to Retinal Detachment from Idiopathic Epiretinal Membranes.

Journal of vitreoretinal diseases·2026

Same author

Arrhythmia Burden and Clinical Responses Under Continuous Monitoring in Heart Failure: Observations From the ALLEVIATE-HF Trial.

Journal of the American College of Cardiology·2026

Same author

Risk-Based Nurse-Managed Personalized Heart Failure Interventions: The ALLEVIATE-HF Trial.

Journal of the American College of Cardiology·2026

Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Multi-Branch Tree-based Fusion Neural Architecture Search with Zero-Cost Screen for Multi-Modal Classification.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 14, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

LearnMat: Semantic-Aware Self-Supervision Fine-Grained Visual Recognition.

Shuaiheng Li, Qing Cai, Fan Zhang

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

|March 12, 2026

Summary

This summary is machine-generated.

This study introduces LearnMat, a novel self-supervised learning framework for fine-grained visual recognition. LearnMat effectively filters irrelevant patterns and extracts subtle discriminative features, significantly improving recognition accuracy.

More Related Videos

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Related Experiment Videos

Last Updated: Mar 14, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Area of Science:

Computer Vision
Machine Learning
Artificial Intelligence

Background:

Self-supervised learning (SSL) shows promise for fine-grained visual recognition (FGVR).
Existing SSL methods struggle with irrelevant patterns and subtle differences crucial for FGVR.
Current approaches are predominantly unimodal, neglecting the potential of vision-language models (VLMs).

Purpose of the Study:

To develop a novel self-supervised learning framework, LearnMat, for enhanced FGVR.
To address limitations of existing methods in handling irrelevant features and capturing subtle discriminative details.
To explore the untapped potential of VLMs in self-supervised FGVR.

Main Methods:

Proposed the LearnMat framework with two key modules: Semantic Awareness Module (SAM) and Insight Extraction Module (IEM).
SAM utilizes a vision-language-grounded semantic distillation strategy with generic textual attributes for semantic constraints and robustness.
IEM employs gradient-based signals to highlight subtle differences, localize discriminative regions, and mitigate intra-class variation and inter-class similarity.

Main Results:

LearnMat effectively filters irrelevant feature interference during training.
The framework successfully extracts more important and subtle discriminative features.
Experiments demonstrated significant performance improvements over state-of-the-art methods on multiple FGVR datasets.

Conclusions:

LearnMat offers a robust and effective approach to self-supervised FGVR.
The proposed framework enhances fine-grained discrimination by focusing on critical subtle differences.
LearnMat represents a significant advancement in leveraging VLMs for self-supervised FGVR tasks.