Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

Deconvolution

Deconvolution

Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Convolution Properties II

Convolution Properties II

The important convolution properties include width, area, differentiation, and integration properties.
The width property indicates that if the durations of input signals are T1 and T2, then the width of the output response equals the sum of both durations, irrespective of the shapes of the two functions. For instance, convolving two rectangular pulses with durations of 2 seconds and 1 second results in a function with a width of 3 seconds.
The area property asserts that the area under the...

Convolution Properties I

Convolution Properties I

Convolution computations can be simplified by utilizing their inherent properties.
The commutative property reveals that the input and the impulse response of an LTI (Linear Time-Invariant) system can be interchanged without affecting the output:

Convolution: Math, Graphics, and Discrete Signals

Convolution: Math, Graphics, and Discrete Signals

In any LTI (Linear Time-Invariant) system, the convolution of two signals is denoted using a convolution operator, assuming all initial conditions are zero. The convolution integral can be divided into two parts: the zero-input or natural response and the zero-state or forced response, with t0 indicating the initial time.
To simplify the convolution integral, it is assumed that both the input signal and impulse response are zero for negative time values. The graphical convolution process...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Variations in GHG fluxes in small- and medium-sized water bodies in different climate zones.

Journal of environmental management·2026

Same author

Author Correction: Complete defluorination of PFASs via photocatalytic reduction in water.

Nature communications·2026

Same author

Lysosome-derived methylated arginine is a signalling metabolite controlling the lipidome.

Nature cell biology·2026

Same author

Comprehensive multi-omics analysis of Mengding bud yellow tea in the intangible cultural heritage: insights into taste formation.

NPJ science of food·2026

Same author

Hybrid-integrated dual-wavelength semiconductor laser with 100 GHz stable frequency spacing.

Optics express·2026

Same author

Dietary fat alters goblet cell function and microbial bile acid metabolism to promote intestinal lipid absorption in mice.

Nature microbiology·2026

Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 10, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Deep M²CDL: Deep Multi-Scale Multi-Modal Convolutional Dictionary Learning Network.

Xin Deng, Jingyi Xu, Fangyuan Gao

IEEE Transactions on Pattern Analysis and Machine Intelligence

|November 20, 2023

Summary

This summary is machine-generated.

This study introduces Deep M²CDL, a multi-scale, multi-modal convolutional dictionary learning model for interpretable image processing. It enhances representation ability for multi-modal image restoration and fusion tasks.

More Related Videos

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

Related Experiment Videos

Last Updated: Jul 10, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

Area of Science:

Computer Vision
Machine Learning
Signal Processing

Background:

Network interpretability is crucial for multi-modal image processing due to complex cross-modal dependencies.
Existing multi-modal dictionary learning models are limited by single-layer and single-scale architectures, restricting their representational power.

Purpose of the Study:

To introduce a multi-scale, multi-modal convolutional dictionary learning (M²CDL) model for enhanced representation in image processing.
To propose a unified Deep M²CDL framework for multi-modal image restoration (MIR) and multi-modal image fusion (MIF) tasks.
To ensure network interpretability by aligning the Deep M²CDL architecture with its optimization steps.

Main Methods:

Developed a multi-layer M²CDL model for coarse-to-fine association of different image modalities.
Created a unified Deep M²CDL framework by unfolding the M²CDL model, ensuring interpretable network modules.
Learned dictionary and sparse feature priors directly through the network, avoiding handcrafted priors.

Main Results:

The Deep M²CDL model demonstrated superior performance on various MIR and MIF tasks compared to state-of-the-art methods.
Quantitative and qualitative evaluations confirmed the effectiveness of the proposed model.
Visualizations of learned multi-modal sparse features and dictionary filters validated the network's interpretability.

Conclusions:

The proposed Deep M²CDL framework offers an interpretable and effective solution for multi-modal image processing tasks.
The multi-layer, multi-scale approach significantly improves representation ability.
Learned priors contribute to better performance and interpretability in MIR and MIF.