Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

ReCoTR: Reducing Semantic Cognitive Shift via Dual-Consensus Token Compression for Remote Sensing Image-Text

Jirui Huang, Yaxiong Chen, Chuang Du

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |May 11, 2026
    PubMed
    Summary
    This summary is machine-generated.

    Related Concept Videos

    You might also read

    Related Articles

    Articles linked to this work by shared authors, journal, and citation graph.

    Sort by
    Same author

    Total Synthesis of Nineteen-Carbon Picrotoxane Terpenoid Picrodendrin L.

    Journal of the American Chemical Society·2026
    Same author

    Physicochemical Dual Cross-Linked Multifunctional Conductive Organohydrogel Sensors for Fireworks Burn Wound Healing and Intelligent Real-Time Monitoring.

    Polymer science & technology (Washington, D.C.)·2026
    Same author

    Precision detection of miniature dam cracks with a multi-scale enhanced YOLO framework for UAV inspection.

    Scientific reports·2026
    Same author

    Reconstruction-Contrast Coupling Learning for Open-Set Semi-Supervised Hyperspectral Image Classification.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same author

    From Bio-Interface Materials to Neural Integration: The Next-Generation Brain-Machine Interfaces Powered by Hydrogels.

    Advanced materials (Deerfield Beach, Fla.)·2026
    Same author

    Noise fingerprint-based infrared fixed pattern noise removal.

    Applied optics·2026
    Same journal

    Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    GoP-based Quality Enhancement on Video Compression.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    See all related articles

    This study introduces ReCoTR, a novel framework for remote sensing (RS) data retrieval. ReCoTR enhances vision-language model (VLM) performance by addressing semantic shifts in RS images, improving urban governance and environmental monitoring.

    Area of Science:

    • Remote Sensing
    • Computer Vision
    • Artificial Intelligence

    Background:

    • Vision-language models (VLMs) show promise for remote sensing (RS) data analysis.
    • Semantic shift in RS images challenges VLM transferability and performance.
    • Existing methods struggle with region-level bias and background noise in RS data.

    Purpose of the Study:

    • To develop an enhanced CLIP-based framework, ReCoTR, for cross-modal retrieval in remote sensing.
    • To address semantic shift, region-level granularity bias, and contextual semantic drift in RS imagery.
    • To improve the accuracy and robustness of semantic understanding for large-scale RS datasets.

    Main Methods:

    • Proposed ReCoTR framework utilizing a Dual Consensus Token Evaluation (DCTE) module.

    Related Experiment Videos

  • DCTE employs a mixture-of-experts strategy to fuse inter-modal semantic consensus and intra-modal structural consistency.
  • Introduced Semantic Confidence Token Compression (SCTC) module to filter and aggregate semantically relevant tokens, reducing noise.
  • Main Results:

    • ReCoTR demonstrated superior performance on bidirectional image-text retrieval tasks across three benchmark RS datasets.
    • The framework effectively mitigates region-level granularity bias and contextual semantic drift.
    • ReCoTR shows robustness in handling background noise and improving semantic alignment in RS data.

    Conclusions:

    • ReCoTR significantly enhances cross-modal retrieval for remote sensing data.
    • The proposed DCTE and SCTC modules effectively address key challenges in RS image-text understanding.
    • ReCoTR offers a robust solution for applications in urban governance, environmental monitoring, and disaster response.