Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Nonconscious Mimicry01:13

Nonconscious Mimicry

4.5K
Nonconscious mimicry occurs when individuals alter their mannerisms to match the behaviors and expressions of those nearby, without intention.
4.5K
The Anchoring-and-Adjustment Heuristic01:25

The Anchoring-and-Adjustment Heuristic

7.2K
In order to make good decisions, we use our knowledge and our reasoning. Often, this knowledge and reasoning is sound and solid. However, sometimes, we are swayed by biases or by others manipulating a situation. For example, let’s say you and three friends wanted to rent a house and had a combined target budget of $1,600. The realtor shows you only very run-down houses for $1,600 and then shows you a very nice house for $2,000. Might you ask each person to pay more in rent to get the...
7.2K
Focusing of Light in the Eye01:16

Focusing of Light in the Eye

2.6K
Light rays enter the eye through the cornea, a transparent dome-shaped tissue that is the eye's outermost layer. The cornea bends or refracts, light rays traveling to the pupil. The shape of the cornea determines how much of the light is bent and whether the image will be focused correctly on the retina at the back of the eye. Once the light has passed through both refraction layers, it converges into a single focal point onto a small area. This is where photoreceptors start transforming...
2.6K
Lampbrush Chromosomes01:51

Lampbrush Chromosomes

2.4K
2.4K
Improving Translational Accuracy02:07

Improving Translational Accuracy

2.5K
2.5K
The Photochemical Reaction Center01:29

The Photochemical Reaction Center

4.1K
Reaction centers are pigment-protein complexes that initiate energy conversion from photons to chemical entities. Therefore, photochemical reaction center is a more appropriate term that describes these complexes. The Nobel laureates Robert Emerson and William Arnold provided the first experimental evidence of photochemical reaction centers by demonstrating the participation of nearly 2,500 chlorophyll molecules for the release of just one molecule of oxygen. Despite thousands of photosynthetic...
4.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

AVaTER: Fusing Audio, Visual, and Textual Modalities Using Cross-Modal Attention for Emotion Recognition.

Sensors (Basel, Switzerland)·2024
Same author

A systematic review on EEG-based neuromarketing: recent trends and analyzing techniques.

Brain informatics·2024
Same author

ViTab Transformer Framework for Predicting Induced Electric Field and Focality in Transcranial Magnetic Stimulation.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society·2023
Same author

Conv-ViT: A Convolution and Vision Transformer-Based Hybrid Feature Extraction Method for Retinal Disease Detection.

Journal of imaging·2023
Same author

CovTiNet: Covid text identification network using attention-based positional embedding feature fusion.

Neural computing & applications·2023
Same author

Shapley-Additive-Explanations-Based Factor Analysis for Dengue Severity Prediction using Machine Learning.

Journal of imaging·2022
Same journal

Novel Parent Survey Measures Sensory Behaviors Incorporating Sensory Modality and Stimulus Intensity.

Heliyon·2026
Same journal

Corrigendum to "Short-term outcomes of robot-assisted minimally invasive surgery for brainstem hemorrhage: A case-control study" [Heliyon Volume 10, Issue 4, February 2024, Article e25912].

Heliyon·2026
Same journal

Retraction notice to "Rubidium zinc trioxide perovskite materials for photovoltaic solar cell applications: A first principle calculations" [Heliyon 10 (2024) e23818].

Heliyon·2026
Same journal

Retraction notice to "Experimental investigations of dual functional substrate integrated waveguide antenna with enhanced directivity for 5G mobile communications" [Heliyon 10 (2024) e36929].

Heliyon·2026
Same journal

Retraction notice to "Advancing higher education and its implication towards sustainable development: Moderate role of green innovation in BRI economies" [Heliyon 9 (2023) e19519].

Heliyon·2026
Same journal

Retraction notice to "Audit committee features and earnings management" [Heliyon 9 (2023) e20825].

Heliyon·2026
See all related articles

Related Experiment Video

Updated: Jun 13, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

2.6K

Enhancing image caption generation through context-aware attention mechanism.

Ahatesham Bhuiyan1, Eftekhar Hossain1, Mohammed Moshiul Hoque2

  • 1Department of Electronics and Telecommunication Engineering, Chittagong University of Engineering and Technology, Chittagong, 4349, Bangladesh.

Heliyon
|September 16, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a novel context-aware attention mechanism for Bengali image captioning, significantly improving accuracy. The AI model enhances scene understanding and human-computer interaction in low-resource languages.

Keywords:
Attention mechanismComputer visionCross-domain transferEncoder-decoderImage captioningNatural language processing

More Related Videos

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

15.7K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

487

Related Experiment Videos

Last Updated: Jun 13, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

2.6K
Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

15.7K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

487

Area of Science:

  • Artificial Intelligence
  • Computer Vision
  • Natural Language Processing

Background:

  • Image captioning research primarily focuses on high-resource languages like English.
  • Low-resource languages, such as Bengali, face unique challenges in generating coherent image descriptions.
  • Accurate object diagnosis and contextual understanding are crucial for effective image captioning.

Purpose of the Study:

  • To propose a context-aware attention mechanism for improved image captioning in Bengali.
  • To address the challenges of low-resource language captioning and domain knowledge transfer.
  • To enhance the accuracy of linking visual objects with corresponding Bengali words.

Main Methods:

  • Utilized ResNet-50 for image feature encoding to address vanishing gradients and complex features.
  • Implemented a bidirectional Gated Recurrent Unit (GRU) with an attention mechanism for caption decoding.
  • Developed a context-aware attention mechanism over semantic attention for precise object diagnosis.

Main Results:

  • Achieved significant performance improvements on three Bengali datasets (BAN-Cap, BanglaLekhaImageCaption, Bornon).
  • Demonstrated METEOR score increases of approximately 30%, 18%, and 45% over existing methods.
  • Showcased superior performance compared to state-of-the-art models in Bengali image captioning.

Conclusions:

  • The proposed context-aware, attention-based system significantly enhances Bengali image captioning.
  • The model effectively captures contextual dependencies for more accurate descriptions.
  • This research contributes to advancing AI capabilities in low-resource language applications.