Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Parallel Processing01:20

Parallel Processing

252
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
252
Fast Decoupled and DC Powerflow01:24

Fast Decoupled and DC Powerflow

303
The fast decoupled power flow method addresses contingencies in power system operations, such as generator outages or transmission line failures. This method provides quick power flow solutions, essential for real-time system adjustments. Fast decoupled power flow algorithms simplify the Jacobian matrix by neglecting certain elements, leading to two sets of decoupled equations:
303
Maxwell-Boltzmann Distribution: Problem Solving01:20

Maxwell-Boltzmann Distribution: Problem Solving

1.8K
Individual molecules in a gas move in random directions, but a gas containing numerous molecules has a predictable distribution of molecular speeds, which is known as the Maxwell-Boltzmann distribution, f(v).
This distribution function f(v) is defined by saying that the expected number N (v1,v2) of particles with speeds between v1 and v2 is given by
1.8K
Ampere-Maxwell's Law: Problem-Solving01:17

Ampere-Maxwell's Law: Problem-Solving

774
A parallel-plate capacitor with capacitance C, whose plates have area A and separation distance d, is connected to a resistor R and a battery of voltage V. The current starts to flow at t = 0. What is the displacement current between the capacitor plates at time t? From the properties of the capacitor, what is the corresponding real current?
To solve the problem, we can use the equations from the analysis of an RC circuit and Maxwell's version of Ampère's law.
For the first part of...
774
Neural Circuits01:25

Neural Circuits

1.6K
Neural circuits and neuronal pools are two of the main structures found in the nervous system. Neural circuits are networks of neurons that work together to carry out a specific task or process. They consist of interconnected neurons and glial cells, which provide structural and metabolic support.
Neuronal pools are collections of nerve cells with similar functions and interact through chemical and electrical signals. These pools include both interneurons (the central neural circuit nodes that...
1.6K
Multimachine Stability01:25

Multimachine Stability

235
Multimachine stability analysis is crucial for understanding the dynamics and stability of power systems with multiple synchronous machines. The objective is to solve the swing equations for a network of M machines connected to an N-bus power system.
In analyzing the system, the nodal equations represent the relationship between bus voltages, machine voltages, and machine currents. The nodal equation is given by:
235

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

PET-MR Examination in Diagnosing Bone Metastases in Prostate Cancer.

Cancer management and research·2026
Same author

O-GlcNAcylation licenses RNF166 to degrade the M protein of porcine coronaviruses.

PLoS pathogens·2026
Same author

The therapeutic effect of nicotinamide riboside chloride on ameliorating alcohol-induced neuronal damage with a focus on mitochondrial unfolded protein response and mitophagy.

Genes & diseases·2026
Same author

Population-Specific 3D/4D Transperineal Ultrasound Reference Values and Cutoffs Associated With Pelvic Organ Prolapse in Chinese Women.

Ultrasound in medicine & biology·2026
Same author

AAV-mediated long-term TBX18 expression causes cardiac fibrosis and fails to induce pacemaker activity in rodents.

The Journal of clinical investigation·2026
Same author

Predicting Harm: A Systematic Review of Pediatric Pressure Injury Models and Their Clinical Shortcomings.

Advances in wound care·2026
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Sep 18, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

4.1K

ESCAN: Efficient GPU sharing for cascade neural network inference.

Jianan Wang1, Yang Shi1, Zhaoyun Chen1

  • 1National University of Defense Technology, Deya Road, Changsha, 410000, Hunan, China; Key Laboratory of Advanced Microprocessor Chips and Systems, Deya Road, Changsha, 410000, Hunan, China.

Neural Networks : the Official Journal of the International Neural Network Society
|June 20, 2025
PubMed
Summary
This summary is machine-generated.

We developed ESCAN, a GPU-sharing framework for cascade neural networks, to improve inference efficiency. ESCAN optimizes device sharing by balancing resource gains with early-exit mechanisms, enhancing low-latency services.

Keywords:
Cascade neural networkGPU sharingInference servicesMPSResource allocation

More Related Videos

Computational Reconstruction of Pancreatic Islets as a Tool for Structural and Functional Analysis
07:58

Computational Reconstruction of Pancreatic Islets as a Tool for Structural and Functional Analysis

Published on: March 9, 2022

1.7K

Related Experiment Videos

Last Updated: Sep 18, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

4.1K
Computational Reconstruction of Pancreatic Islets as a Tool for Structural and Functional Analysis
07:58

Computational Reconstruction of Pancreatic Islets as a Tool for Structural and Functional Analysis

Published on: March 9, 2022

1.7K

Area of Science:

  • Computer Science
  • Artificial Intelligence
  • Machine Learning

Background:

  • Cascading models balance efficiency and accuracy in industrial deployments.
  • Low-latency services demand optimized execution efficiency and device utilization.
  • Existing GPU sharing (Multiprocessing Services) struggles with cascade models due to early-exit and execution order.

Purpose of the Study:

  • To address the challenges of applying GPU sharing to cascade neural networks.
  • To propose a framework that optimizes inference efficiency for cascade models.
  • To enhance device utilization and reduce latency in cloud-based inference services.

Main Methods:

  • Analyzed cascade neural network characteristics and device-sharing techniques.
  • Developed ESCAN, a GPU-sharing optimization framework for online inference.
  • Integrated exit-ratio-aware batch-parallel execution and resource allocation algorithms in PyTorch.

Main Results:

  • ESCAN improves inference efficiency by an average of 19.53% compared to parallel execution.
  • Significantly enhances the efficiency of searching for computation resource allocation schemes.
  • Optimizes computational resource utilization through effective GPU-sharing.

Conclusions:

  • ESCAN provides an effective solution for GPU-sharing optimization in cascade neural networks.
  • Achieves a balance between device-sharing gains and early-exit computation wastage.
  • Delivers a low-latency, high-precision optimization for interactive online services using cascade models.