Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Visual System

Visual System

Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...

Source Transformation

Source Transformation

Source transformation is a fundamental technique employed in circuit analysis, offering a valuable tool for simplifying complex electrical circuits. This technique involves the replacement of either a voltage source in series with a resistor by a current source in parallel with a resistor, or vice versa. The key concept here is that when the original sources are deactivated (turned off), the equivalent resistance at the circuit's end terminals remains the same.
It is essential to note that when...

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

Types Of Transformers

Types Of Transformers

Transformers can provide desired voltages to a circuit by modifying the number of turns in the secondary windings.
If the ratio of the number of turns in the secondary winding to that of the primary winding is greater than one, then the transformer is said to be a step-up transformer. In a step-up transformer, the voltage at the secondary winding is greater than the voltage applied at the primary winding.
However, if this ratio is less than one, the transformer is said to be a step-down...

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Transformers in Distribution System

Transformers in Distribution System

Transformers in distribution systems can be broadly categorized into distribution substation transformers and other distribution transformers. They are crucial for stepping down high transmission voltages to levels suitable for distribution and end-user applications.
Distribution substation transformers come in various ratings and typically use mineral oil for insulation and cooling. To prevent moisture and air from entering the oil, some transformers use an inert gas like nitrogen to fill the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Study on a Novel Dental Composite Resin with Fluorinated Polyurethane Monomer and Modified Polyether Ether Ketone Fillers.

International dental journal·2026

Same author

Towards a universal JPEG lossless recompression foundation model for pathology images: A transformer context modeling approach.

Medical image analysis·2026

Same author

Improving protein and protein interactions using pseudo-dimers derived from monomeric proteins.

Nature communications·2026

Same author

CRA5 a high-fidelity compressed reanalysis atmospheric dataset for weather and climate research.

Scientific data·2026

Same author

Penetration testing of a quantum key distribution system as a black box.

National science review·2026

Same author

A Commentary on "Disuse Bone Loss in Fusion Constructs After Multilevel Lumbar Fusion: A Computed Tomography Hounsfield Unit Analysis".

Neurospine·2026

Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Adaptive Hardness-Driven Dictionary Distillation for Incomplete Streaming View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Achieving Text-based Person Retrieval with Any Granularity.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 23, 2025

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

TransVG++: End-to-End Visual Grounding With Language Conditioned Vision Transformer.

Jiajun Deng, Zhengyuan Yang, Daqing Liu

IEEE Transactions on Pattern Analysis and Machine Intelligence

|July 19, 2023

Summary

This summary is machine-generated.

This study introduces Transformer-based frameworks for visual grounding, simplifying complex fusion mechanisms. The improved TransVG++ model, utilizing Vision Transformer, achieves state-of-the-art results by enhancing vision-language fusion.

More Related Videos

VisualEyes: A Modular Software System for Oculomotor Experimentation

VisualEyes: A Modular Software System for Oculomotor Experimentation

Published on: March 25, 2011

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

Related Experiment Videos

Last Updated: Jul 23, 2025

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

VisualEyes: A Modular Software System for Oculomotor Experimentation

VisualEyes: A Modular Software System for Oculomotor Experimentation

Published on: March 25, 2011

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

Area of Science:

Computer Vision
Artificial Intelligence
Machine Learning

Background:

Traditional visual grounding methods rely on complex, manually-designed fusion mechanisms.
These heuristic designs can lead to overfitting and suboptimal performance on diverse datasets.

Purpose of the Study:

To develop simplified yet effective Transformer-based frameworks for visual grounding.
To improve multi-modal fusion and reasoning capabilities in visual grounding models.
To establish new state-of-the-art performance on visual grounding tasks.

Main Methods:

Proposed TransVG, using Transformers for multi-modal correspondence and direct box coordinate regression.
Introduced TransVG++, a purely Transformer-based framework leveraging Vision Transformer (ViT).
Developed Language Conditioned Vision Transformer for intermediate vision-language fusion, reusing ViT components.

Main Results:

Demonstrated that simple Transformer encoder layers outperform complex fusion modules in TransVG.
TransVG++ achieved significant performance gains by integrating ViT and intermediate fusion.
Reported state-of-the-art records across five prevalent visual grounding datasets.

Conclusions:

Transformer-based approaches offer a more effective and less complex alternative to traditional visual grounding methods.
The TransVG++ framework, with its integrated Vision Transformer, represents a significant advancement in visual grounding accuracy and efficiency.
The proposed methods pave the way for more robust and generalizable visual grounding systems.