An adaptive and lightweight YOLOv5 detection model for lung tumor in PET/CT images
- Tao Zhou 1,2, Xinyu Ye 3,4, Huiling Lu 5, Yujie Guo 1,2, Hongxia Wang 1,2, Yang Liu 1,2
- Tao Zhou 1,2, Xinyu Ye 3,4, Huiling Lu 5
- 1School of Computer Science and Engineering, North Minzu University, Yinchuan, 750021, China.
- 2Key Laboratory of Image and Graphics Intelligent Processing of State Ethnic Affairs Commission, North Minzu University, Yinchuan, 750021, China.
- 3School of Computer Science and Engineering, North Minzu University, Yinchuan, 750021, China. yexinyubmd@163.com.
- 4Key Laboratory of Image and Graphics Intelligent Processing of State Ethnic Affairs Commission, North Minzu University, Yinchuan, 750021, China. yexinyubmd@163.com.
- 5School of Medical Information and Engineering, Ningxia Medical University, Yinchuan, 750004, China.
- 0School of Computer Science and Engineering, North Minzu University, Yinchuan, 750021, China.
Related Experiment Videos
Contact us if these videos are not relevant.
Contact us if these videos are not relevant.
View abstract on PubMed
Summary
This summary is machine-generated.A novel Cross Modal YOLOv5 (CMYOLOv5) model enhances lung tumor detection by fusing multi-modal PET and CT images. This approach significantly improves the accuracy and feature representation of irregular tumors compared to existing methods.
Area Of Science
- Medical Imaging
- Artificial Intelligence
- Oncology
Background
- Multi-modal medical images are crucial for accurate tumor lesion detection.
- Current models often rely on single imaging modalities, limiting the comprehensive understanding of lesion characteristics like shape, size, and contrast.
- There's a need for models that can effectively integrate semantic information from multiple sources.
Purpose Of The Study
- To propose a Cross Modal YOLOv5 (CMYOLOv5) model for improved multi-modal lung tumor detection.
- To enhance the fusion of complementary information from PET and CT scans.
- To improve the feature representation capabilities for irregular lung tumors.
Main Methods
- Developed a dual-branch auxiliary network for PET and CT semantic extraction.
- Implemented Cross-modal Features Fusion (CFF) and Self-Adaptive Attention Fusion (AAF) for integrating multi-modal data.
- Utilized a Self-Adaptive Transformer (SAT) with deformable attention in the feature enhancement neck.
- Incorporated Reparameter Residual Blocks (RRB) and Reparameter Convolution (RC) for richer feature learning.
Main Results
- CMYOLOv5 achieved high performance metrics: 97.16% Precision, 96.41% Recall, 97.18% mAP, and 96.78% F1-score.
- The model demonstrated superior detection accuracy for irregular lung tumors.
- Achieved a processing speed (FPS) of 96.37 with a training time of 3912 seconds.
Conclusions
- The proposed CMYOLOv5 model effectively leverages multi-modal PET/CT data for superior lung tumor detection.
- The integration of advanced fusion techniques and Transformer architecture enhances the model's ability to capture complex lesion features.
- CMYOLOv5 outperforms existing advanced methods in detecting irregular lung tumors.
Related Experiment Videos
Contact us if these videos are not relevant.
Contact us if these videos are not relevant.

