Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Parallel Processing01:20

Parallel Processing

150
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
150
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

631
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
631
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

106
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
106
Visual System01:26

Visual System

574
Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
574
Collisions in Multiple Dimensions: Problem Solving01:06

Collisions in Multiple Dimensions: Problem Solving

4.1K
In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...
4.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Antiosteoporotic activity of echinacoside in ovariectomized rats.

Phytomedicine : international journal of phytotherapy and phytopharmacology·2013
Same author

Low molecular weight fucoidan against renal ischemia-reperfusion injury via inhibition of the MAPK signaling pathway.

PloS one·2013
Same author

Promoting the activity of catalysts for the oxidation of water with bridged dinuclear ruthenium complexes.

Angewandte Chemie (International ed. in English)·2013
Same author

Tunable green oxygen barrier through layer-by-layer self-assembly of chitosan and cellulose nanocrystals.

Carbohydrate polymers·2013
Same author

Gastrointestinal effects of resistant starch, soluble maize fibre and pullulan in healthy adults.

The British journal of nutrition·2013
Same author

GeneExpressionSignature: an R package for discovering functional connections using gene expression signatures.

Omics : a journal of integrative biology·2013
Same journal

Correction: A method for supervoxel-wise association studies of age and other non-imaging variables from coronary computed tomography angiograms.

Scientific reports·2026
Same journal

Poly(bromophenol blue)/CoSn(OH)<sub>6</sub> cubic particles modified pencil graphite electrode for electrochemical determination of diphenhydramine.

Scientific reports·2026
Same journal

Dietary Chlorella, Spirulina, and acidifier modulate jejunal cytokine-related gene expression in broiler chickens.

Scientific reports·2026
Same journal

Perceived physical activity barriers in university students: associations with fatigue and eating behaviours.

Scientific reports·2026
Same journal

Refuge limitation structures habitat use in agricultural landscapes: evidence from Sunda pangolins.

Scientific reports·2026
Same journal

Lightweight stateless transaction verification with outsourced witness updates for UTXO blockchains.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jun 26, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

522

Multi-scale coupled attention for visual object detection.

Fei Li1, Hongping Yan2, Linsu Shi3

  • 1China Tower Corporation Limited, No.9 Dongran North Street, Beijing, 100195, China. lifei123457@chinatowercom.cn.

Scientific Reports
|May 16, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces the Multi-Scale Coupled Attention (MSCA) network for improved object detection. MSCA enhances deep neural networks by effectively processing multi-scale objects in complex scenes, boosting overall performance.

Keywords:
Attention mechanismDeep neural networksObject detectionSelf-attention learningTransformerYOLO

More Related Videos

Methods to Test Visual Attention Online
09:44

Methods to Test Visual Attention Online

Published on: February 19, 2015

11.9K
A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
12:39

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

7.6K

Related Experiment Videos

Last Updated: Jun 26, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

522
Methods to Test Visual Attention Online
09:44

Methods to Test Visual Attention Online

Published on: February 19, 2015

11.9K
A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
12:39

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

7.6K

Area of Science:

  • Computer Vision
  • Deep Learning
  • Artificial Intelligence

Background:

  • Deep neural networks have shown great success in object detection.
  • Evolving network structures is crucial for enhancing performance, especially in complex scenes with multi-scale objects.

Purpose of the Study:

  • To propose a novel network structure, Multi-Scale Coupled Attention (MSCA), for improved object detection.
  • To address the challenge of detecting multi-scale objects in complex environments.

Main Methods:

  • The proposed MSCA network integrates Multi-Scale Coupled Channel Attention (MSCCA) and Multi-Scale Coupled Spatial Attention (MSCSA) modules.
  • MSCCA performs linear self-attention learning on multi-scale channels.
  • MSCSA achieves nonlinear self-attention learning on multi-scale spatial grids.

Main Results:

  • The MSCA network was evaluated on two public datasets against 13 state-of-the-art models.
  • Experimental results, including ablation studies and performance analysis, demonstrated the effectiveness of the MSCA model.
  • The proposed network achieved superior performance in object detection tasks.

Conclusions:

  • The MSCA network offers an effective approach for enhancing object detection capabilities.
  • The modular design allows it to be integrated as a plugin for end-to-end learning models.
  • MSCA shows significant potential for applications requiring high-performance object detection.