Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Perceiving Loudness, Pitch, and Location01:21

Perceiving Loudness, Pitch, and Location

510
The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...
510
Perception of Sound Waves01:01

Perception of Sound Waves

4.8K
The human ear is not equally sensitive to all frequencies in the audible range. It may perceive sound waves with the same pressure but different frequencies as having different loudness. Moreover, the perception of sound waves depends on the health of an individual's ears, which decays with age. The health of one's ears may also be affected by regular exposure to loud noises.
The pitch of a sound depends on the frequency and the pressure amplitude of the source. Two sounds of the same...
4.8K
Design Example: Identifying the Locations of Monuments in the Field Using Global Positioning System Device01:30

Design Example: Identifying the Locations of Monuments in the Field Using Global Positioning System Device

191
Surveyors use Global Positioning System (GPS) technology to measure the precise location and elevation of points on Earth. In a recent survey, GPS receivers were used to determine the coordinates and elevations of two park monuments. The process involved careful mission planning, data collection, and correction to ensure accuracy. The survey began with mission planning to identify optimal satellite visibility and minimize Position Dilution of Precision (PDOP). A geodetic control point...
191
Auditory Perception01:17

Auditory Perception

653
The auditory system is essential for sound perception, utilizing various critical structures. When sound waves enter the outer ear, they travel through the ear canal and cause the eardrum to vibrate. These vibrations are then transmitted to the middle ear, where three tiny bones – the malleus, incus, and stapes – amplify the sound. This amplification is crucial, as it ensures that the sound vibrations are strong enough to be conveyed to the inner ear. These vibrations then reach the...
653
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

1.1K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
1.1K
Distance Measurements by Taping01:18

Distance Measurements by Taping

142
Tapes are essential in surveying for accurate, durable, and short-distance measurements. Made from lightweight, nylon-coated steel, they offer flexibility and strength for rugged outdoor use. The nylon coating protects against rust and wear, extending the tape's life. Standard lengths, around 30 meters, are marked in meters and millimeters for precision.Surveyors select tapes based on site conditions and accuracy needs. Lightweight, nylon-coated tapes are commonly used for ease of handling and...
142

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Dextromethorphan-bupropion-associated pharmacovigilance signals based on the FAERS database: An observational study.

Medicine·2026
Same author

Development of the authentication and authorization processes for the iAgree portal, a platform for patient-controlled data sharing across health systems.

JAMIA open·2026
Same author

Risk factors for severe <i>Chlamydia pneumoniae</i> pneumonia in children: a retrospective case-control study.

Frontiers in pediatrics·2026
Same author

Relative contributions of climate factors and air pollution to childhood allergic diseases in Chongqing, China.

BMC public health·2026
Same author

DNA methylation-mediated suppression of endocytosis confers resistance to duck hepatitis A virus type 3.

Microbiology spectrum·2026
Same author

Corrigendum to "Pharmacological effects of indole alkaloids from Alstonia scholaris (L.) R. Br. on pulmonary fibrosis in vivo" [J. Ethnopharmacol. 267 (2021) 113506].

Journal of ethnopharmacology·2026
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Oct 9, 2025

Sound Source Localization Testing in Single-sided Deafness Following Bone Conduction Intervention
04:32

Sound Source Localization Testing in Single-sided Deafness Following Bone Conduction Intervention

Published on: December 20, 2024

508

Class-Aware Sounding Objects Localization via Audiovisual Correspondence.

Di Hu, Yake Wei, Rui Qian

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |December 23, 2021
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a novel framework for machines to identify and categorize sound-producing objects in complex scenes, improving audiovisual understanding. The method effectively localizes objects and filters out silent areas, enhancing machine perception capabilities.

    More Related Videos

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.2K
    Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind
    09:01

    Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

    Published on: March 27, 2013

    14.5K

    Related Experiment Videos

    Last Updated: Oct 9, 2025

    Sound Source Localization Testing in Single-sided Deafness Following Bone Conduction Intervention
    04:32

    Sound Source Localization Testing in Single-sided Deafness Following Bone Conduction Intervention

    Published on: December 20, 2024

    508
    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.2K
    Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind
    09:01

    Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

    Published on: March 27, 2013

    14.5K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Signal Processing

    Background:

    • Humans effortlessly link sounds to objects, a capability challenging for machines.
    • Current methods struggle with class-aware localization of sound-producing objects without category labels.

    Purpose of the Study:

    • To develop a framework for class-aware sounding object localization and recognition using only audio-visual correspondence.
    • To enable machines to identify and locate objects that generate sound in complex environments.

    Main Methods:

    • A two-stage, step-by-step learning framework utilizing coarse-grained audiovisual correspondence.
    • Establishing a category-representation object dictionary from visual features of sounding areas.
    • Employing category-level audiovisual consistency for fine-grained alignment.

    Main Results:

    • Superior performance in localizing and recognizing sound-producing objects.
    • Effective suppression of silent areas in audiovisual scenes.
    • Successful transfer of the learned network to unsupervised object detection.

    Conclusions:

    • The proposed framework significantly advances class-aware sounding object localization and recognition.
    • The method demonstrates robustness in complex audiovisual scenarios.
    • Potential applications in unsupervised object detection and enhanced machine perception.