Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Nonconscious Mimicry

Nonconscious Mimicry

Nonconscious mimicry occurs when individuals alter their mannerisms to match the behaviors and expressions of those nearby, without intention.

The Anchoring-and-Adjustment Heuristic

The Anchoring-and-Adjustment Heuristic

In order to make good decisions, we use our knowledge and our reasoning. Often, this knowledge and reasoning is sound and solid. However, sometimes, we are swayed by biases or by others manipulating a situation. For example, let’s say you and three friends wanted to rent a house and had a combined target budget of $1,600. The realtor shows you only very run-down houses for $1,600 and then shows you a very nice house for $2,000. Might you ask each person to pay more in rent to get the...

Focusing of Light in the Eye

Focusing of Light in the Eye

Light rays enter the eye through the cornea, a transparent dome-shaped tissue that is the eye's outermost layer. The cornea bends or refracts, light rays traveling to the pupil. The shape of the cornea determines how much of the light is bent and whether the image will be focused correctly on the retina at the back of the eye. Once the light has passed through both refraction layers, it converges into a single focal point onto a small area. This is where photoreceptors start transforming...

Lampbrush Chromosomes

Lampbrush Chromosomes

Improving Translational Accuracy

Improving Translational Accuracy

The Photochemical Reaction Center

The Photochemical Reaction Center

Reaction centers are pigment-protein complexes that initiate energy conversion from photons to chemical entities. Therefore, photochemical reaction center is a more appropriate term that describes these complexes. The Nobel laureates Robert Emerson and William Arnold provided the first experimental evidence of photochemical reaction centers by demonstrating the participation of nearly 2,500 chlorophyll molecules for the release of just one molecule of oxygen. Despite thousands of photosynthetic...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

AVaTER: Fusing Audio, Visual, and Textual Modalities Using Cross-Modal Attention for Emotion Recognition.

Sensors (Basel, Switzerland)·2024

Same author

A systematic review on EEG-based neuromarketing: recent trends and analyzing techniques.

Brain informatics·2024

Same author

ViTab Transformer Framework for Predicting Induced Electric Field and Focality in Transcranial Magnetic Stimulation.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society·2023

Same author

Conv-ViT: A Convolution and Vision Transformer-Based Hybrid Feature Extraction Method for Retinal Disease Detection.

Journal of imaging·2023

Same author

CovTiNet: Covid text identification network using attention-based positional embedding feature fusion.

Neural computing & applications·2023

Same author

Shapley-Additive-Explanations-Based Factor Analysis for Dengue Severity Prediction using Machine Learning.

Journal of imaging·2022

Same journal

Novel Parent Survey Measures Sensory Behaviors Incorporating Sensory Modality and Stimulus Intensity.

Heliyon·2026

Same journal

Corrigendum to "Short-term outcomes of robot-assisted minimally invasive surgery for brainstem hemorrhage: A case-control study" [Heliyon Volume 10, Issue 4, February 2024, Article e25912].

Heliyon·2026

Same journal

Retraction notice to "Rubidium zinc trioxide perovskite materials for photovoltaic solar cell applications: A first principle calculations" [Heliyon 10 (2024) e23818].

Heliyon·2026

Same journal

Retraction notice to "Experimental investigations of dual functional substrate integrated waveguide antenna with enhanced directivity for 5G mobile communications" [Heliyon 10 (2024) e36929].

Heliyon·2026

Same journal

Retraction notice to "Advancing higher education and its implication towards sustainable development: Moderate role of green innovation in BRI economies" [Heliyon 9 (2023) e19519].

Heliyon·2026

Same journal

Retraction notice to "Audit committee features and earnings management" [Heliyon 9 (2023) e20825].

Heliyon·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 13, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Enhancing image caption generation through context-aware attention mechanism.

Ahatesham Bhuiyan¹, Eftekhar Hossain¹, Mohammed Moshiul Hoque²

¹Department of Electronics and Telecommunication Engineering, Chittagong University of Engineering and Technology, Chittagong, 4349, Bangladesh.

|September 16, 2024

Summary

This summary is machine-generated.

This study introduces a novel context-aware attention mechanism for Bengali image captioning, significantly improving accuracy. The AI model enhances scene understanding and human-computer interaction in low-resource languages.

Keywords:

Attention mechanism Computer vision Cross-domain transfer Encoder-decoder Image captioning Natural language processing

More Related Videos

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Related Experiment Videos

Last Updated: Jun 13, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Area of Science:

Artificial Intelligence
Computer Vision
Natural Language Processing

Background:

Image captioning research primarily focuses on high-resource languages like English.
Low-resource languages, such as Bengali, face unique challenges in generating coherent image descriptions.
Accurate object diagnosis and contextual understanding are crucial for effective image captioning.

Purpose of the Study:

To propose a context-aware attention mechanism for improved image captioning in Bengali.
To address the challenges of low-resource language captioning and domain knowledge transfer.
To enhance the accuracy of linking visual objects with corresponding Bengali words.

Main Methods:

Utilized ResNet-50 for image feature encoding to address vanishing gradients and complex features.
Implemented a bidirectional Gated Recurrent Unit (GRU) with an attention mechanism for caption decoding.
Developed a context-aware attention mechanism over semantic attention for precise object diagnosis.

Main Results:

Achieved significant performance improvements on three Bengali datasets (BAN-Cap, BanglaLekhaImageCaption, Bornon).
Demonstrated METEOR score increases of approximately 30%, 18%, and 45% over existing methods.
Showcased superior performance compared to state-of-the-art models in Bengali image captioning.

Conclusions:

The proposed context-aware, attention-based system significantly enhances Bengali image captioning.
The model effectively captures contextual dependencies for more accurate descriptions.
This research contributes to advancing AI capabilities in low-resource language applications.