Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Video

Updated: Jun 18, 2026

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

Published on: December 15, 2023

DiMuS: Disentangled Multi-Signal Learning for Weakly Supervised Point-Based 3D Object Detection.

Wenbo Zhang, Yunzhi Zhuge, Lu Zhang

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

|June 16, 2026

Summary

This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Pyrolytic hydrocarbon growth from cyclopentadiene.

The journal of physical chemistry. A·2010

Same author

In(III)-catalyzed tandem reaction of chromone-derived Morita-Baylis-Hillman alcohols with amines.

Organic & biomolecular chemistry·2010

Same author

Regression-based multi-trait QTL mapping using a structural equation model.

Statistical applications in genetics and molecular biology·2010

Same author

Elevated expression of APE1/Ref-1 and its regulation on IL-6 and IL-8 in bone marrow stromal cells of multiple myeloma.

Clinical lymphoma, myeloma & leukemia·2010

Same author

Accelerated aging of intervertebral discs in a mouse model of progeria.

Journal of orthopaedic research : official publication of the Orthopaedic Research Society·2010

Same author

The synthesis of a multiblock osteotropic polyrotaxane by copper(I)-catalyzed huisgen 1,3-dipolar cycloaddition.

Macromolecular bioscience·2010

Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Multi-Branch Tree-based Fusion Neural Architecture Search with Zero-Cost Screen for Multi-Modal Classification.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

See all related articles

DiMuS, a novel framework for weakly supervised 3D object detection, uses multiple signals to improve 3D box estimation. It achieves near fully supervised performance, reducing the need for expensive 3D annotations.

Area of Science:

Computer Vision
Machine Learning
Robotics

Background:

Weakly supervised 3D object detection reduces reliance on costly 3D annotations.
Existing methods struggle with projection ambiguity and geometric inconsistency.
2D projection constraints and heuristic priors are common but limited supervision techniques.

Purpose of the Study:

To introduce DiMuS, a Disentangled Multi-Signal learning framework for enhanced 3D object detection.
To improve the accuracy of 3D box estimation (position, dimension, orientation) using complementary supervision.
To overcome limitations of existing weakly supervised methods.

Main Methods:

DiMuS integrates 2D boxes, LLM-derived semantic priors, and 3D geometric alignment.
Key components include Centerness-enhanced Projection Constraint (CPC), Semantic Prior Anchoring (SPA), and Rotation-aware Consistency Regularization (RCR).

Related Experiment Videos

Last Updated: Jun 18, 2026

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

Published on: December 15, 2023

An Adversarial Geometric Alignment (AGA) module refines boundaries using LiDAR point and box edge interactions.

Main Results:

DiMuS significantly outperforms previous weakly supervised methods on the KITTI dataset.
Achieved 96.82% of fully supervised performance for car detection.
Demonstrated robustness across various object categories.

Conclusions:

DiMuS effectively enhances distinct 3D properties (position, dimension, orientation) through disentangled learning.
The framework offers a robust and efficient alternative to fully supervised 3D object detection.
LLM-derived priors and geometric alignment contribute to superior performance in weakly supervised settings.