Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Design Example: Identifying the Locations of Monuments in the Field Using Global Positioning System Device01:30

Design Example: Identifying the Locations of Monuments in the Field Using Global Positioning System Device

Surveyors use Global Positioning System (GPS) technology to measure the precise location and elevation of points on Earth. In a recent survey, GPS receivers were used to determine the coordinates and elevations of two park monuments. The process involved careful mission planning, data collection, and correction to ensure accuracy. The survey began with mission planning to identify optimal satellite visibility and minimize Position Dilution of Precision (PDOP). A geodetic control point served as...
Design Example: Alignment of a Road Line Using GIS01:17

Design Example: Alignment of a Road Line Using GIS

The alignment of a road line using Geographic Information Systems (GIS) is a critical process in civil engineering, combining advanced technology with practical decision-making. This methodology begins with the collection of geospatial data, including information on land cover, geomorphology, drainage patterns, slope, and contour details. Such data is typically acquired through satellite imagery and GIS tools, offering a comprehensive understanding of the terrain.Once the data is gathered, it...
Types of Global Positioning System Surveys01:30

Types of Global Positioning System Surveys

GPS surveying methods vary in application, accuracy, and data collection techniques, catering to diverse surveying and mapping needs. Static GPS, kinematic GPS, and real-time kinematic (RTK) surveying are widely used. Each technique offers distinct advantages.Static GPS involves placing one receiver at a known reference point and another at the target point. It collects exact positional data by observing multiple satellite ranges over an extended period, achieving centimeter-level accuracy for...
Structural Classification of Joints01:20

Structural Classification of Joints

Joints, also known as articulations, are classified based on their structural characteristics, i.e., based on whether the articulating surfaces of the adjacent bones are directly connected by fibrous connective tissue or cartilage, or whether the articulating surfaces contact each other within a fluid-filled joint cavity. These differences serve to divide the joints of the body into three structural classifications.
A fibrous joint is where the adjacent bones are united by fibrous connective...
Field Application of Global Positioning System01:28

Field Application of Global Positioning System

The Global Positioning System (GPS) has become an indispensable tool in fieldwork, offering unparalleled precision and efficiency for surveying, navigation, and infrastructure development. By harnessing signals from a constellation of satellites, GPS receivers determine the location of objects with remarkable speed and accuracy, often completing calculations within a second.Advantages of Modern GPS TechnologyContemporary GPS receivers are designed to meet the practical demands of field...
Local Attraction01:22

Local Attraction

Local attraction refers to disturbances in compass readings caused by magnetic influences from nearby objects such as metal fences, buried pipes, vehicles, buildings, power lines, or natural iron ore deposits. Small items like wristwatches, steel tools, or belt buckles can also interfere with the compass by creating local magnetic fields that distort the Earth's natural magnetic field. These distortions lead to inaccurate readings, posing navigation and land surveying challenges.Local...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Contextual Style Coherence Network for X-Ray Prohibited Item Image Synthesis.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Efficient real time small object detection framework in aerial images using edge awareness and dynamic convolution.

Scientific reports·2026
Same author

Irradiation-Induced Phase Stability in Ti- and Nb-Containing Nickel-Based High-Entropy Alloys at 500 °C.

Nanomaterials (Basel, Switzerland)·2026
Same author

Caveolin-1 Regulates Ultrastructural Alterations of Astrocytes in Chronic Cerebral Hypoperfusion.

Inflammation·2025
Same author

Clinical value of clinicopathological, ultrasonographic features and fibroblast activation protein in evaluating the radioiodine treatment efficacy in papillary thyroid carcinoma.

Annals of nuclear medicine·2025
Same author

Meta-TIP: An Unsupervised End-to-End Fusion Network for Multi-Dataset Style-Adaptive Threat Image Projection.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2025
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Jun 13, 2026

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Global and Local Visual-Textual Alignment for Open Vocabulary Object Detection.

Hao Wang, Tong Jia, Shizhuo Deng

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |June 11, 2026
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a novel Global and Local Visual-Textual Alignment method to improve open vocabulary object detection by effectively aligning image and text features. The approach enhances detector performance on unseen categories without requiring extensive pre-training data.

    Related Experiment Videos

    Last Updated: Jun 13, 2026

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Artificial Intelligence

    Background:

    • Vision-Language Models (VLMs) like CLIP are increasingly integrated into object detection frameworks, enabling open vocabulary object detection.
    • Current methods often rely on large, uncurated image-text pairs for pre-training or use knowledge distillation, facing limitations like computational overhead and neglecting global information alignment.
    • Existing approaches struggle with effective and efficient alignment between visual and textual features in the semantic space for open-set scenes.

    Purpose of the Study:

    • To propose a novel Global and Local Visual-Textual Alignment method for open vocabulary object detection.
    • To address the limitations of current pre-training and knowledge distillation methods by integrating global and local feature alignment.
    • To improve the ability of object detectors to perceive unseen objects by enhancing visual-textual feature alignment.

    Main Methods:

    • Developed a unified learning paradigm integrating global image-caption alignment and local region-prompt alignment.
    • Global alignment uses contrastive learning between whole image and caption representations from CLIP's components.
    • Local alignment focuses on matching region embeddings from CLIP's image encoder with textual token prompts from its text encoder, coupled with a prompt tuning strategy.

    Main Results:

    • The proposed method was implemented on Faster R-CNN and evaluated on OV-COCO and OV-LVIS benchmarks.
    • Achieved clear improvements over existing methods on detecting novel categories.
    • Demonstrated favorable performance compared to state-of-the-art approaches in open vocabulary object detection.

    Conclusions:

    • The Global and Local Visual-Textual Alignment method effectively enhances open vocabulary object detection by unifying global and local feature alignment strategies.
    • The approach offers a parameter-efficient way to adapt VLMs like CLIP for downstream object detection tasks.
    • This method provides a promising direction for improving the perception of unseen objects in computer vision systems.