Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Multi-task recurrent convolutional network with correlation loss for surgical video analysis.

Yueming Jin1, Huaxia Li1, Qi Dou1

  • 1Department of Computer Science and Engineering, The Chinese University of Hong Kong, China.

Medical Image Analysis
|October 23, 2019
PubMed
Summary

Related Concept Videos

Neural Circuits01:25

Neural Circuits

2.5K
Neural circuits and neuronal pools are two of the main structures found in the nervous system. Neural circuits are networks of neurons that work together to carry out a specific task or process. They consist of interconnected neurons and glial cells, which provide structural and metabolic support.
Neuronal pools are collections of nerve cells with similar functions and interact through chemical and electrical signals. These pools include both interneurons (the central neural circuit nodes that...
2.5K
Reducing Line Loss01:18

Reducing Line Loss

334
In a three-phase circuit, line loss is an indicator of energy dissipated as heat due to the resistance of transmission lines. To address this, incorporating transformers into the system—a step-up transformer at the source and a step-down transformer at the load—is a strategic solution. Two three-phase transformers are introduced to improve this.
With a step-up transformer at the source, the voltage is increased, thereby reducing the current in the transmission lines since power loss in...
334

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A retrieval-augmented framework enabling VLM spatial awareness for object-centric robot manipulation.

Science robotics·2026
Same author

HandBooster+: Boosting 3D Hand-Mesh Reconstruction From Data Synthesis to Progressive Multi-Hypothesis Aggregation.

IEEE transactions on pattern analysis and machine intelligence·2025
Same author

Visualization-Driven Illumination for Density Plots.

IEEE transactions on visualization and computer graphics·2024
Same author

Video Instance Shadow Detection Under the Sun and Sky.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2024
Same author

DreamStone: Image as a Stepping Stone for Text-Guided 3D Shape Generation.

IEEE transactions on pattern analysis and machine intelligence·2023
Same author

Kine-Appendage: Enhancing Freehand VR Interaction Through Transformations of Virtual Appendages.

IEEE transactions on visualization and computer graphics·2023
Same journal

Generative morphodynamic forecasting enables robust zero-shot volumetric medical segmentation.

Medical image analysis·2026
Same journal

ContiMorph: An unsupervised learning framework for cardiac motion tracking with time-continuous diffeomorphism.

Medical image analysis·2026
Same journal

MedP-CLIP: Medical CLIP with region-aware prompt integration.

Medical image analysis·2026
Same journal

Multi-organ guided diagnosis of mild cognitive impairment via hierarchical alignment and knowledge distillation.

Medical image analysis·2026
Same journal

SUDA: Simultaneous unsupervised knowledge distillation and adaptation of foundation models for efficient pathological image analysis.

Medical image analysis·2026
Same journal

Beyond the LUMIR challenge: The pathway to foundational registration models.

Medical image analysis·2026
See all related articles
This summary is machine-generated.

This study introduces a new multi-task deep learning model (MTRCNet-CL) that simultaneously detects surgical tools and recognizes surgical phases in videos. By leveraging the relationship between these tasks, the model significantly improves accuracy in both surgical tool presence detection and phase recognition.

Area of Science:

  • Computer Vision
  • Medical Image Analysis
  • Artificial Intelligence in Surgery

Background:

  • Surgical tool presence detection and surgical phase recognition are critical for operating room applications but are often addressed independently.
  • Existing methods fail to capitalize on the inherent correlation between tool presence and surgical phases, limiting performance.
  • Accurate surgical video analysis requires robust methods for understanding both instrument status and procedural progression.

Purpose of the Study:

  • To develop a novel multi-task learning framework that jointly addresses surgical tool presence detection and surgical phase recognition.
  • To exploit the correlation between these two tasks to enhance the performance of each individual task.
  • To introduce a new correlation loss function to effectively model the relationship between tool presence and phase identification.
Keywords:
Correlation lossDeep learningMulti-task learningSurgical video analysis

Related Experiment Videos

Main Methods:

  • A multi-task recurrent convolutional network with correlation loss (MTRCNet-CL) was designed with shared feature encoders and task-specific layers.
  • Long-short term memory (LSTM) was incorporated into the phase recognition branch to capture temporal dependencies.
  • A novel correlation loss was implemented to minimize prediction divergence between the tool presence and phase recognition branches.

Main Results:

  • The MTRCNet-CL model achieved superior performance on the Cholec80 dataset compared to state-of-the-art methods.
  • Significant improvements were observed in mean Average Precision (mAP) for tool presence detection (89.1% vs. 81.0%).
  • The F1 score for surgical phase recognition was also substantially enhanced (87.4% vs. 84.5%).

Conclusions:

  • Simultaneously learning correlated tasks like tool presence and phase recognition leads to mutual performance benefits.
  • The proposed MTRCNet-CL effectively leverages feature sharing and prediction correlation for improved surgical video analysis.
  • This approach offers a promising direction for advancing intelligent surgical systems and real-time operating room assistance.