Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Perception01:28

Perception

517
Perception is a fundamental psychological process that enables individuals to organize, interpret, and consciously experience sensory information. This process is crucial for understanding and interacting with the world around us. It includes both bottom-up and top-down processing, each playing a distinct role in how we perceive our environment.
Bottom-up processing begins at the sensory level, where receptors detect external environmental stimuli. These could include the tactile sensation of...
517
Parallel Processing01:20

Parallel Processing

186
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
186

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Rapid and non-targeted screening of multiple antibiotic fermentation residues in single-cell protein feed using HS-GC-IMS and chemometrics.

Food chemistry·2026
Same author

Integrative lipidomics profiles and explainable machine learning for species-specific identification of meat and bone meal.

Journal of chromatography. A·2026
Same author

CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection.

IEEE transactions on pattern analysis and machine intelligence·2025
Same author

FSDM: An efficient video super-resolution method based on Frames-Shift Diffusion Model.

Neural networks : the official journal of the International Neural Network Society·2025
Same author

JointFormer: A Unified Framework With Joint Modeling for Video Object Segmentation.

IEEE transactions on pattern analysis and machine intelligence·2025
Same author

Group Visual Relation Detection.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2025
Same journal

Benchmarking the Robustness of Autonomous Driving to Environmental Illusions: A Lane Perception Perspective.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Topology-Aware Representations via Test-Time Adaptation for Anomaly Segmentation.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

TraGraph-GS: Trajectory Graph-based Gaussian Splatting for Arbitrary Large-Scale Scene Rendering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

SWIFT: A Small-World Interaction Framework for Flow-Aware Trajectory Prediction in Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Jul 27, 2025

Testing Sensory and Multisensory Function in Children with Autism Spectrum Disorder
09:13

Testing Sensory and Multisensory Function in Children with Autism Spectrum Disorder

Published on: April 22, 2015

16.6K

Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection.

Jing Tan, Yuhong Wang, Gangshan Wu

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |June 6, 2023
    PubMed
    Summary
    This summary is machine-generated.

    Temporal Perceiver offers a unified Transformer-based solution for detecting various generic boundaries in videos. This novel architecture achieves state-of-the-art results across multiple benchmarks, improving video understanding.

    More Related Videos

    Author Spotlight: Exploring the Link Between Time Perception of Visual Stimuli and Reading Skills
    09:27

    Author Spotlight: Exploring the Link Between Time Perception of Visual Stimuli and Reading Skills

    Published on: January 19, 2024

    1.3K
    Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments
    13:00

    Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments

    Published on: January 23, 2017

    10.0K

    Related Experiment Videos

    Last Updated: Jul 27, 2025

    Testing Sensory and Multisensory Function in Children with Autism Spectrum Disorder
    09:13

    Testing Sensory and Multisensory Function in Children with Autism Spectrum Disorder

    Published on: April 22, 2015

    16.6K
    Author Spotlight: Exploring the Link Between Time Perception of Visual Stimuli and Reading Skills
    09:27

    Author Spotlight: Exploring the Link Between Time Perception of Visual Stimuli and Reading Skills

    Published on: January 19, 2024

    1.3K
    Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments
    13:00

    Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments

    Published on: January 23, 2017

    10.0K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Artificial Intelligence

    Background:

    • Generic Boundary Detection (GBD) is crucial for long-form video understanding, segmenting videos into coherent units.
    • Existing GBD methods often use specialized deep networks for different boundary types (shot, event, scene), lacking a unified approach.
    • Previous models face challenges with computational complexity, particularly the quadratic relationship between attention operations and input frames.

    Purpose of the Study:

    • To introduce Temporal Perceiver, a unified Transformer-based architecture for detecting arbitrary generic boundaries in videos.
    • To reduce the computational complexity of attention mechanisms for video processing.
    • To enhance the generalization ability of GBD models across diverse datasets.

    Main Methods:

    • Developed Temporal Perceiver, a Transformer architecture utilizing a small set of latent feature queries as anchors.
    • Employed cross-attention blocks to compress video input into a fixed dimension, achieving linear complexity with respect to input frames.
    • Introduced boundary and context queries to capture temporal structure, along with an alignment loss to guide query learning and a sparse detection head for direct output.

    Main Results:

    • Achieved state-of-the-art performance on multiple GBD benchmarks (SoccerNet-v2, Kinetics-GEBD, TAPOS, MovieScenes, MovieNet) using single-stream RGB features.
    • Demonstrated significant improvements in average-mAP and average-f1 scores across all tested datasets.
    • A class-agnostic Temporal Perceiver model showed comparable accuracy and superior generalization capabilities compared to dataset-specific models.

    Conclusions:

    • Temporal Perceiver provides a generalized and efficient solution for various types of Generic Boundary Detection.
    • The proposed latent query mechanism effectively reduces computational complexity while maintaining high detection accuracy.
    • The model's strong performance and generalization highlight its potential as a foundational tool for advanced video understanding tasks.