Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Absolute Motion Analysis- General Plane Motion01:24

Absolute Motion Analysis- General Plane Motion

273
Visualize a drone, with its propellers spinning rapidly, hovering mid-air. The fascinating movements and operations of this drone can be comprehended by applying the principle of general plane motion.
As the drone's propellers rotate, an upward force is generated that counteracts the force of gravity, enabling the drone to lift off from the ground. This initial movement of the drone is along a straight path, representing a form of translational motion. In this phase, every point on the...
273
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

150
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
150
End Point Prediction: Gran Plot01:07

End Point Prediction: Gran Plot

590
A Gran plot is used to predict the equivalence volume or endpoint of a potentiometric or acid-base titration without reaching the endpoint. Typically, titration data is collected as a function of the titrant's volume up to a point less than the equivalence volume and then transformed into a linear format. The straight line is extended to the x-axis, indicating the necessary titrant volume to achieve the equivalence point.
For potentiometric titration, the Gran plot is created by plotting...
590
Relative Motion Analysis - Velocity01:24

Relative Motion Analysis - Velocity

436
A stroke engine has a slider-crank mechanism that converts rotational motion from the crank into linear motion of the slider or vice versa. This mechanism consists of three main parts: the crank, the connecting rod, and the slider.
When an external force is exerted, it sets the crank into a rotational movement. This, in turn, instigates the motion of the connecting rod, leading to what is referred to as a general plane motion. This process involves two key points - point A on the connecting rod...
436
Observational Learning01:12

Observational Learning

314
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
314
Associative Learning01:27

Associative Learning

579
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
579

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

H<sub>2</sub>S-mediated protein S-sulfhydration: a novel regulatory module in lipid metabolism.

Frontiers in cell and developmental biology·2026
Same author

Tension-band high-strength suture combined with absorbable screws with cortical penetration for treating Mayo type IIA olecranon fractures: finite element analysis, biomechanical testing, and clinical study.

Frontiers in bioengineering and biotechnology·2026
Same author

Targeting P2X receptor signaling for chronic visceral pain and beyond.

Neuropharmacology·2026
Same author

Therapeutic Effects of Shengdu Pingmu Formula on Loperamide-Induced Constipation in Rats via PI3K/AKT Signaling and Gut Microbiota Regulation.

Journal of visualized experiments : JoVE·2026
Same author

Diff-MomentFormer: Generative Diffusion-Augmented Transformer for End-to-End Joint Moment Estimation.

Sensors (Basel, Switzerland)·2026
Same author

Low-temperature topotactic conversion growth of porous CoFe<sub>2</sub>O<sub>4</sub> nanocubes for an advanced oxygen evolution reaction.

Chemical communications (Cambridge, England)·2026
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Sep 13, 2025

Trajectory Data Analyses for Pedestrian Space-time Activity Study
16:14

Trajectory Data Analyses for Pedestrian Space-time Activity Study

Published on: February 25, 2013

13.7K

Multi-Task Trajectory Prediction Using a Vehicle-Lane Disentangled Conditional Variational Autoencoder.

Haoyang Chen1, Na Li1, Hangguan Shan1

  • 1The College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China.

Sensors (Basel, Switzerland)
|July 30, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces MS-SLV, a new generative framework for autonomous driving trajectory prediction. It enhances multimodal prediction by integrating time-aware scene encoding and structured latent models for more accurate and contextually coherent forecasts.

Keywords:
autonomous drivingdeep learninggenerative modeltrajectory prediction

Frequently Asked Questions

More Related Videos

Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research
07:15

Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research

Published on: December 18, 2020

4.6K
Using a Virtual Reality Walking Simulator to Investigate Pedestrian Behavior
06:38

Using a Virtual Reality Walking Simulator to Investigate Pedestrian Behavior

Published on: June 9, 2020

5.0K

Related Experiment Videos

Last Updated: Sep 13, 2025

Trajectory Data Analyses for Pedestrian Space-time Activity Study
16:14

Trajectory Data Analyses for Pedestrian Space-time Activity Study

Published on: February 25, 2013

13.7K
Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research
07:15

Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research

Published on: December 18, 2020

4.6K
Using a Virtual Reality Walking Simulator to Investigate Pedestrian Behavior
06:38

Using a Virtual Reality Walking Simulator to Investigate Pedestrian Behavior

Published on: June 9, 2020

5.0K

Area of Science:

  • Autonomous vehicle navigation and machine learning.
  • The intersection of computer vision and vehicle-lane disentangled CVAE.
  • Robotics and spatial-temporal data modeling.

Background:

Prior research has shown that integrating dynamic vehicle states with static high-definition (HD) maps is essential for modeling complex agent-scene interactions in autonomous driving. Conventional approaches typically rely on static scene encodings that fail to capture the evolving nature of spatial contexts during motion. These existing frameworks often utilize unstructured latent spaces, which restricts their capacity to generate diverse yet contextually coherent future paths. The inability to align map features with vehicle movement leads to suboptimal scene semantics. Such limitations hinder the effectiveness of multi-modal prediction systems in dense urban environments. The absence of structured latent models makes it difficult to separate the specific intentions of an agent from the physical constraints imposed by the road layout. This absence of evidence motivated a new approach to handle time-aware scene alignment and structured latent modeling.

Purpose Of The Study:

MS-SLV introduces a generative framework designed to enhance trajectory forecasting through a vehicle-lane disentangled Conditional Variational Autoencoder (CVAE). The system aims to align high-definition (HD) map features with vehicle motion to capture evolving scene semantics accurately. Researchers sought to create a structured latent model that explicitly separates agent-specific intent from scene-level constraints. This architecture incorporates an auxiliary lane prediction task to provide targeted supervision for improved scene understanding. By jointly predicting future trajectories and lane sequences, the framework enables more interpretable and scene-consistent forecasts. The study focuses on overcoming the limitations of unstructured latent variables in generative models for autonomous navigation. Engineers designed this multi-task approach to ensure that predicted paths remain within the boundaries of the high-definition (HD) map.

Main Methods:

The investigative process utilized the MS-SLV generative framework to process multimodal information for autonomous navigation. Engineers implemented a time-aware scene encoder to synchronize high-definition (HD) map features with the dynamic states of the vehicle. The experimental design featured a structured latent model to achieve disentanglement between agent intent and environmental constraints. To refine latent variable learning, the team integrated an auxiliary lane prediction task into the training pipeline. Evaluation of the proposed architecture was conducted using the comprehensive nuScenes dataset, a standard benchmark for autonomous driving research. This methodology allowed for the joint estimation of future trajectories and corresponding lane sequences. The researchers compared the performance of this vehicle-lane disentangled Conditional Variational Autoencoder (CVAE) against several state-of-the-art baselines.

Main Results:

MS-SLV achieved a 12.37% reduction in average displacement error (ADE) compared to existing state-of-the-art methods. The framework also demonstrated a 7.67% improvement in final displacement error (FDE) during extensive evaluations on the nuScenes dataset. Multi-modal prediction performance saw significant gains, with the top-5 Miss Rate (MR5) decreasing by 26%. The top-10 Miss Rate (MR10) showed an even greater reduction of 33% relative to the strongest baseline evaluated. The system successfully lowered the Off-Road Rate (ORR) by 3%, indicating superior scene consistency. These metrics confirm that the structured latent model effectively captures diverse and contextually appropriate future paths. The results highlight the effectiveness of aligning high-definition (HD) map features with temporal vehicle motion.

Conclusions:

The findings suggest that disentangling agent intent from scene constraints significantly enhances the reliability of trajectory forecasting in autonomous vehicles. This approach provides a pathway for developing more interpretable and scene-consistent navigation systems. The integration of auxiliary tasks like lane prediction offers a robust method for supervising scene understanding in generative models. Future research may build upon this time-aware scene encoding to handle even more complex agent-scene interactions. These advancements contribute to the safety and efficiency of self-driving technology by reducing displacement errors and miss rates. The study establishes MS-SLV as a high-performing framework for multi-task trajectory prediction. By addressing the gap in structured latent modeling, this work provides a foundation for more sophisticated agent-scene interaction models.

By explicitly separating agent-specific intent from scene-level constraints, the model generates diverse paths that remain contextually coherent within the high-definition (HD) map boundaries.

The study reported a 12.37% reduction in average displacement error (ADE) and a 7.67% reduction in final displacement error (FDE) when compared to other state-of-the-art methods on the nuScenes dataset.

This task provides targeted supervision for scene understanding, which improves latent variable learning and ensures that the predicted future trajectories and lane sequences are more interpretable and scene-consistent.

Existing methods often use static scene encodings that cannot capture evolving spatial contexts, whereas this encoder aligns high-definition (HD) map features with vehicle motion to capture changing scene semantics.

The authors state that MS-SLV significantly improves multi-modal forecasting, reducing the top-5 Miss Rate (MR5) and top-10 Miss Rate (MR10) by 26% and 33%, respectively, while also lowering the Off-Road Rate (ORR) by 3%.