An automated approach for real-time informative frames classification in laryngeal endoscopy using deep learning
View abstract on PubMed
Summary
This summary is machine-generated.Artificial intelligence (AI) can automatically select informative frames from laryngoscopy videos. This AI system shows high accuracy and real-time capabilities, aiding in diagnosis and data management.
Area Of Science
- Otolaryngology
- Medical Imaging
- Artificial Intelligence
Background
- Laryngoscopy generates large video datasets, posing challenges for manual review and data extraction.
- Automated informative image selection can improve data management and diagnostic accuracy in laryngoscopy.
Purpose Of The Study
- To demonstrate the feasibility of an AI system for automatic, real-time selection of informative frames during laryngoscopy.
- To provide visual feedback to otolaryngologists during examinations.
Main Methods
- Deep learning models, including ResNet-50, were trained and tested on internal and external datasets of laryngoscopy images (white light and narrow band).
- The best-performing model's real-time performance was assessed using four testing videos.
Main Results
- The ResNet-50 model achieved high precision (95% internal, 97% external) and F1-scores (96% internal, 93% external).
- The model demonstrated excellent performance in identifying diagnostically relevant frames.
Conclusions
- The AI model shows excellent performance for identifying key frames in laryngoscopic videos.
- The system's accuracy and real-time capabilities make it promising for clinical use in quality control and AI-assisted tumor detection.

