Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

108
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
108
Deconvolution01:20

Deconvolution

163
Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...
163
Associative Learning01:27

Associative Learning

407
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
407
Convolution Properties II01:17

Convolution Properties II

209
The important convolution properties include width, area, differentiation, and integration properties.
The width property indicates that if the durations of input signals are T1 and T2, then the width of the output response equals the sum of both durations, irrespective of the shapes of the two functions. For instance, convolving two rectangular pulses with durations of 2 seconds and 1 second results in a function with a width of 3 seconds.
The area property asserts that the area under the...
209
Convolution Properties I01:20

Convolution Properties I

153
Convolution computations can be simplified by utilizing their inherent properties.
The commutative property reveals that the input and the impulse response of an LTI (Linear Time-Invariant) system can be interchanged without affecting the output:
153
Convolution: Math, Graphics, and Discrete Signals01:24

Convolution: Math, Graphics, and Discrete Signals

266
In any LTI (Linear Time-Invariant) system, the convolution of two signals is denoted using a convolution operator, assuming all initial conditions are zero. The convolution integral can be divided into two parts: the zero-input or natural response and the zero-state or forced response, with t0 indicating the initial time.
To simplify the convolution integral, it is assumed that both the input signal and impulse response are zero for negative time values. The graphical convolution process...
266

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Variations in GHG fluxes in small- and medium-sized water bodies in different climate zones.

Journal of environmental management·2026
Same author

Author Correction: Complete defluorination of PFASs via photocatalytic reduction in water.

Nature communications·2026
Same author

Lysosome-derived methylated arginine is a signalling metabolite controlling the lipidome.

Nature cell biology·2026
Same author

Comprehensive multi-omics analysis of Mengding bud yellow tea in the intangible cultural heritage: insights into taste formation.

NPJ science of food·2026
Same author

Hybrid-integrated dual-wavelength semiconductor laser with 100 GHz stable frequency spacing.

Optics express·2026
Same author

Dietary fat alters goblet cell function and microbial bile acid metabolism to promote intestinal lipid absorption in mice.

Nature microbiology·2026
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Jul 10, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

559

Deep M2CDL: Deep Multi-Scale Multi-Modal Convolutional Dictionary Learning Network.

Xin Deng, Jingyi Xu, Fangyuan Gao

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |November 20, 2023
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces Deep M²CDL, a multi-scale, multi-modal convolutional dictionary learning model for interpretable image processing. It enhances representation ability for multi-modal image restoration and fusion tasks.

    More Related Videos

    Deep Neural Networks for Image-Based Dietary Assessment
    13:19

    Deep Neural Networks for Image-Based Dietary Assessment

    Published on: March 13, 2021

    9.2K
    Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
    08:20

    Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

    Published on: October 27, 2023

    1.5K

    Related Experiment Videos

    Last Updated: Jul 10, 2025

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    559
    Deep Neural Networks for Image-Based Dietary Assessment
    13:19

    Deep Neural Networks for Image-Based Dietary Assessment

    Published on: March 13, 2021

    9.2K
    Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
    08:20

    Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

    Published on: October 27, 2023

    1.5K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Signal Processing

    Background:

    • Network interpretability is crucial for multi-modal image processing due to complex cross-modal dependencies.
    • Existing multi-modal dictionary learning models are limited by single-layer and single-scale architectures, restricting their representational power.

    Purpose of the Study:

    • To introduce a multi-scale, multi-modal convolutional dictionary learning (M²CDL) model for enhanced representation in image processing.
    • To propose a unified Deep M²CDL framework for multi-modal image restoration (MIR) and multi-modal image fusion (MIF) tasks.
    • To ensure network interpretability by aligning the Deep M²CDL architecture with its optimization steps.

    Main Methods:

    • Developed a multi-layer M²CDL model for coarse-to-fine association of different image modalities.
    • Created a unified Deep M²CDL framework by unfolding the M²CDL model, ensuring interpretable network modules.
    • Learned dictionary and sparse feature priors directly through the network, avoiding handcrafted priors.

    Main Results:

    • The Deep M²CDL model demonstrated superior performance on various MIR and MIF tasks compared to state-of-the-art methods.
    • Quantitative and qualitative evaluations confirmed the effectiveness of the proposed model.
    • Visualizations of learned multi-modal sparse features and dictionary filters validated the network's interpretability.

    Conclusions:

    • The proposed Deep M²CDL framework offers an interpretable and effective solution for multi-modal image processing tasks.
    • The multi-layer, multi-scale approach significantly improves representation ability.
    • Learned priors contribute to better performance and interpretability in MIR and MIF.