Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Deconvolution01:20

Deconvolution

282
Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...
282

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

ROM-Pose: restoring occluded mask image for 2D human pose estimation.

PeerJ. Computer science·2025
Same author

Detection of Cervical Foraminal Stenosis from Oblique Radiograph Using Convolutional Neural Network Algorithm.

Yonsei medical journal·2024
Same author

An Autoscaling System Based on Predicting the Demand for Resources and Responding to Failure in Forecasting.

Sensors (Basel, Switzerland)·2023
Same author

MTGEA: A Multimodal Two-Stream GNN Framework for Efficient Point Cloud and Skeleton Data Alignment.

Sensors (Basel, Switzerland)·2023
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Oct 2, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

538

Cross Encoder-Decoder Transformer with Global-Local Visual Extractor for Medical Image Captioning.

Hojun Lee1, Hyunjun Cho1, Jieun Park1

  • 1Department of Computer Science and Engineering, Dongguk University, Seoul 04620, Korea.

Sensors (Basel, Switzerland)
|February 26, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces novel methods for improved medical image captioning, enhancing text generation from both global and local visual features for more detailed descriptions.

Keywords:
deep learningmedical image captioningtransformer

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.0K
A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

2.0K

Related Experiment Videos

Last Updated: Oct 2, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

538
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.0K
A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

2.0K

Area of Science:

  • Medical Imaging
  • Artificial Intelligence
  • Computer Vision

Background:

  • Transformer-based models show promise in image captioning.
  • Current methods struggle to integrate global image features effectively for comprehensive descriptions.

Purpose of the Study:

  • To develop advanced methods for generating more accurate and detailed medical image captions.
  • To address limitations in current image captioning techniques by incorporating both global and local visual information.

Main Methods:

  • Proposed the Global-Local Visual Extractor (GLVE) to capture comprehensive visual features, including organ size and bone structure, alongside local details like lesion areas.
  • Introduced the Cross Encoder-Decoder Transformer (CEDT) to integrate multi-level encoder features into the decoding process for richer descriptions.

Main Results:

  • The proposed model demonstrated superior performance on the IU X-ray dataset compared to existing transformer-based baselines.
  • Achieved significant improvements in evaluation metrics: 5.6% in BLEU score, 0.56% in METEOR, and 1.98% in ROUGE-L.

Conclusions:

  • The novel GLVE and CEDT methods enhance medical image captioning by effectively utilizing both global and local visual features.
  • The proposed approach generates more detailed and accurate descriptions, outperforming previous transformer-based models in key metrics.