Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Revolutionizing Community-Based Cervical Cancer Screening: Evaluating an Indigenous Artificial Intelligence-Enabled Portable Colposcope.

Journal of obstetrics and gynaecology of India·2026
Same author

A novel real-world data methodology for lymphoma outcome classification: the real-world Lugano study.

Journal of comparative effectiveness research·2026
Same author

Effectiveness of Cognitive Behavioral Therapy in Managing Physical and Psychological Distress in Women during the Perimenopausal Phase.

Journal of mid-life health·2025
Same author

Documentation of individual-level social determinants of health among adults diagnosed with cancer in the United States.

Scientific reports·2025
Same author

Friends Across Time: Multi-Scale Action Segmentation Transformer for Surgical Phase Recognition.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Clinical risk factors and blood protein biomarkers of 10-year pneumonia risk.

PloS one·2024
Same journal

Contactless robotic system for linear catheter advancement using magnetic actuation.

International journal of computer assisted radiology and surgery·2026
Same journal

Sound source localization for spatial mapping of surgical actions in dynamic scenes.

International journal of computer assisted radiology and surgery·2026
Same journal

ESD-VesNet: uncertainty-aware vessel segmentation network for endoscopic submucosal dissection with hard negative mining.

International journal of computer assisted radiology and surgery·2026
Same journal

Lean Unet: a compact model for image segmentation.

International journal of computer assisted radiology and surgery·2026
Same journal

Strain alignment: toward assessing mechanical plausibility of predicted displacement fields.

International journal of computer assisted radiology and surgery·2026
Same journal

Vascular geometry characterization for AI-based endovascular navigation.

International journal of computer assisted radiology and surgery·2026
See all related articles

Related Experiment Video

Updated: Aug 16, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

480

Surgical workflow recognition with temporal convolution and transformer for action segmentation.

Bokai Zhang1, Bharti Goel2, Mohammad Hasan Sarhan3

  • 1Johnson & Johnson MedTech, 1100 Olive Way, Suite 1100, Seattle, 98101, WA, USA. bzhang29@its.jnj.com.

International Journal of Computer Assisted Radiology and Surgery
|December 21, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces a novel Action Segmentation Temporal Convolutional Transformer (ASTCFormer) for surgical workflow recognition. The ASTCFormer method significantly improves accuracy by better capturing temporal context in surgical videos.

Keywords:
Action segmentationSurgical workflow recognitionTemporal convolutionalTransformer

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.9K
Author Spotlight: Revolutionizing Remote Surgery with Augmented Reality and Robotics for Enhanced Precision and Accessibility
07:46

Author Spotlight: Revolutionizing Remote Surgery with Augmented Reality and Robotics for Enhanced Precision and Accessibility

Published on: August 9, 2024

822

Related Experiment Videos

Last Updated: Aug 16, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

480
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.9K
Author Spotlight: Revolutionizing Remote Surgery with Augmented Reality and Robotics for Enhanced Precision and Accessibility
07:46

Author Spotlight: Revolutionizing Remote Surgery with Augmented Reality and Robotics for Enhanced Precision and Accessibility

Published on: August 9, 2024

822

Area of Science:

  • Computer Vision
  • Surgical Workflow Recognition
  • Machine Learning in Healthcare

Background:

  • Automatic surgical workflow recognition enhances surgeon training and enables context-aware systems for improved planning and decision-making.
  • Utilizing temporal information is critical for context recognition in surgical procedures.
  • Recurrent neural networks and transformers are current approaches for action recognition in surgical videos.

Purpose of the Study:

  • To design and implement a novel two-stage method for surgical workflow recognition.
  • To develop the Action Segmentation Temporal Convolutional Transformer (ASTCFormer) network for enhanced temporal awareness.
  • To improve the accuracy and effectiveness of surgical workflow recognition systems.

Main Methods:

  • A two-stage approach was employed, starting with R(2+1)D for video clip modeling.
  • The Action Segmentation Temporal Convolutional Transformer (ASTCFormer) network was proposed for full video modeling in the second stage.
  • ASTCFormer integrates action segmentation transformers (ASFormers) with temporal convolutional networks (TCNs) for temporally aware recognition.

Main Results:

  • The ASTCFormer method demonstrated superior performance compared to recurrent neural networks, multi-stage TCN, and ASFormer.
  • A relative improvement of [Formula: see text] in average segmental F1-score was achieved over the state-of-the-art ASFormer.
  • The proposed method attained state-of-the-art results on the Cholec80 dataset, comprising 207 surgical videos.

Conclusions:

  • Integrating temporal convolutional network (TCN) information into the ASFormer paradigm enhances the capture of temporal context.
  • This integration leads to significant improvements in surgical workflow recognition accuracy.
  • The ASTCFormer method represents aadvancement in automated surgical video analysis.