Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Force Classification01:22

Force Classification

1.7K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
1.7K
Parallel Processing01:20

Parallel Processing

252
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
252
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

152
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
152

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Explainable Lightweight AI for the Identification of Right-Sided Cardiac Dysfunction in a Saudi Arabian Diabetic Cohort.

Journal of clinical medicine·2026
Same author

Design Guidelines for Anode Modifications to Eliminate Soft Short Circuits in Anode-Free Na Batteries Under High Current Density With Large-Capacity Plating, Demonstrated via Initiated Chemical Vapor Deposition (iCVD).

Angewandte Chemie (International ed. in English)·2026
Same author

Advancing Arabic automated essay scoring through cross-encoder BERT models and interpretable explanations.

Scientific reports·2026
Same author

Multimodal healthcare system for human activity recognition using multiple features and advanced ensemble classifier.

Digital health·2026
Same author

Aging in South Korea: the first penguin of the fastest aging nations.

The Gerontologist·2026
Same author

Toward intelligent rehabilitation: Multimodal human pose modeling with parametric meshes and graph-based temporal reasoning.

Digital health·2026
Same journal

DARUMA: a gateway to fast and easy prediction of intrinsically disordered regions.

PeerJ. Computer science·2026
Same journal

Alzheimer's disease detection using a quantum deep neural network with Haralick feature extraction and simulated annealing optimization.

PeerJ. Computer science·2026
Same journal

Network anomaly detection using Deep Autoencoder and parallel Artificial Bee Colony algorithm-trained neural network.

PeerJ. Computer science·2026
Same journal

An anomaly detection model for multivariate time series with anomaly perception.

PeerJ. Computer science·2026
Same journal

Retraction: A wormhole attack detection method for tactical wireless sensor networks.

PeerJ. Computer science·2026
Same journal

Evaluation of mental disorder with prioritization of its type by utilizing the bipolar complex fuzzy decision-making approach based on Schweizer-Sklar prioritized aggregation operators.

PeerJ. Computer science·2026
See all related articles

Related Experiment Video

Updated: Sep 18, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

650

Multimodal scene recognition using semantic segmentation and deep learning integration.

Aysha Naseer1, Mohammed Alnusayri2, Haifa F Alhasson3

  • 1Department of Computer Science, Air University, Islamabad, Pakistan.

Peerj. Computer Science
|June 26, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a multimodal deep learning approach combining red-green-blue (RGB) and depth data for improved indoor scene recognition. The novel method achieves high accuracy, enhancing scene understanding for applications in robotics and security.

Keywords:
Artificial intelligenceFeatures optimizationImage analysisMachine learningScene modelingSpatial pyramid poolingVoxel grid representation

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.0K
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.1K

Related Experiment Videos

Last Updated: Sep 18, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

650
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.0K
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.1K

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Machine Learning

Background:

  • Indoor scene recognition is challenging due to complex scene compositions and the gap between high-level interpretation and low-level visual features.
  • Existing methods struggle with the intricate details and semantic understanding of generic indoor environments.
  • The need for robust and accurate scene recognition is critical for various applications.

Purpose of the Study:

  • To develop a novel multimodal deep learning technique for enhanced indoor scene recognition.
  • To overcome the limitations of traditional methods by integrating depth information with RGB data.
  • To improve the accuracy and robustness of semantic modeling and scene classification.

Main Methods:

  • A depth-aware segmentation methodology was employed to identify objects within images.
  • Convolutional Neural Networks (CNNs) and Spatial Pyramid Pooling (SPP) were utilized for feature analysis.
  • A multimodal approach combining red-green-blue (RGB) and depth image data was implemented.

Main Results:

  • The proposed method achieved 91.73% accuracy on the RGB-D scene dataset.
  • The technique demonstrated 90.53% accuracy on the NYU Depth v2 dataset.
  • Experimental findings validate the effectiveness of the multimodal approach in improving scene recognition.

Conclusions:

  • The multimodal deep learning technique significantly enhances indoor scene recognition accuracy and robustness.
  • Integrating depth information with RGB data provides a more comprehensive understanding of indoor scenes.
  • This approach has potential applications in robotics, sports analysis, and security systems.