Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Visual System01:26

Visual System

2.2K
Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
2.2K
Force Classification01:22

Force Classification

2.6K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
2.6K
Visual Agnosia01:12

Visual Agnosia

1.6K
Visual agnosia is a condition characterized by the inability to recognize visually presented objects despite having normal vision. For instance, a person with visual agnosia can describe the shape and color of an object but cannot identify or name it. This impairment does not affect their visual field, acuity, color vision, brightness discrimination, language, or memory. An example of this condition in a social setting is someone at a dinner party asking for "that silver thing with a round...
1.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Progressive Fusion of Multi-Scale Mamba Context and Local Detail Priors for Infrared Small Target Detection.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Performance of Age-Adjusted Whole Genome Sequencing Telomere Length in Idiopathic Pulmonary Fibrosis.

American journal of respiratory and critical care medicine·2026
Same author

Publisher Correction: Whole genome sequence analysis of pulmonary function and COPD in 44,287 multi-ancestry participants.

Genome biology·2026
Same author

Optical Coherence Tomography Biomarkers Differentiate Epiretinal Membranes Secondary to Retinal Detachment from Idiopathic Epiretinal Membranes.

Journal of vitreoretinal diseases·2026
Same author

Arrhythmia Burden and Clinical Responses Under Continuous Monitoring in Heart Failure: Observations From the ALLEVIATE-HF Trial.

Journal of the American College of Cardiology·2026
Same author

Risk-Based Nurse-Managed Personalized Heart Failure Interventions: The ALLEVIATE-HF Trial.

Journal of the American College of Cardiology·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Multi-Branch Tree-based Fusion Neural Architecture Search with Zero-Cost Screen for Multi-Modal Classification.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Mar 14, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

1.2K

LearnMat: Semantic-Aware Self-Supervision Fine-Grained Visual Recognition.

Shuaiheng Li, Qing Cai, Fan Zhang

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |March 12, 2026
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces LearnMat, a novel self-supervised learning framework for fine-grained visual recognition. LearnMat effectively filters irrelevant patterns and extracts subtle discriminative features, significantly improving recognition accuracy.

    More Related Videos

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.7K

    Related Experiment Videos

    Last Updated: Mar 14, 2026

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    1.2K
    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.7K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Artificial Intelligence

    Background:

    • Self-supervised learning (SSL) shows promise for fine-grained visual recognition (FGVR).
    • Existing SSL methods struggle with irrelevant patterns and subtle differences crucial for FGVR.
    • Current approaches are predominantly unimodal, neglecting the potential of vision-language models (VLMs).

    Purpose of the Study:

    • To develop a novel self-supervised learning framework, LearnMat, for enhanced FGVR.
    • To address limitations of existing methods in handling irrelevant features and capturing subtle discriminative details.
    • To explore the untapped potential of VLMs in self-supervised FGVR.

    Main Methods:

    • Proposed the LearnMat framework with two key modules: Semantic Awareness Module (SAM) and Insight Extraction Module (IEM).
    • SAM utilizes a vision-language-grounded semantic distillation strategy with generic textual attributes for semantic constraints and robustness.
    • IEM employs gradient-based signals to highlight subtle differences, localize discriminative regions, and mitigate intra-class variation and inter-class similarity.

    Main Results:

    • LearnMat effectively filters irrelevant feature interference during training.
    • The framework successfully extracts more important and subtle discriminative features.
    • Experiments demonstrated significant performance improvements over state-of-the-art methods on multiple FGVR datasets.

    Conclusions:

    • LearnMat offers a robust and effective approach to self-supervised FGVR.
    • The proposed framework enhances fine-grained discrimination by focusing on critical subtle differences.
    • LearnMat represents a significant advancement in leveraging VLMs for self-supervised FGVR tasks.