Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Steps in the Modeling Process01:14

Steps in the Modeling Process

375
Albert Bandura's theory of observational learning identifies four critical processes: attention, retention, motor reproduction, and reinforcement or motivation.
Attention is the first necessary component for observational learning. It involves focusing on what the model is doing and saying. For example, if you decide to take a drawing class to enhance your skills, you need to pay close attention to the instructor's words and hand movements. The characteristics of the model significantly...
375
Observational Learning01:12

Observational Learning

379
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
379

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Complete genome sequence of Brucella melitensis 133, an isolate of biovar 1 of sequence type 32.

Journal of bacteriology·2012
Same author

The role of prolactin receptor in GH signaling in breast cancer cells.

Molecular endocrinology (Baltimore, Md.)·2012
Same author

Green tea powder and Lactobacillus plantarum affect gut microbiota, lipid metabolism and inflammation in high-fat fed C57BL/6J mice.

Nutrition & metabolism·2012
Same author

[Effects of early application of Tuina treatment on quadriceps surface myoelectricity in patients after total knee arthroplasty: a randomized controlled trial].

Zhong xi yi jie he xue bao = Journal of Chinese integrative medicine·2012
Same author

A minimally invasive postero-medial approach with suture anchors for isolated tibial avulsion fracture of the posterior cruciate ligament.

The Knee·2012
Same author

The chloride channel/transporter Slc26a9 regulates the systemic arterial pressure and renal chloride excretion.

Journal of molecular medicine (Berlin, Germany)·2012
Same journal

Big Data-Driven Video Anomaly Detection Using VideoMAE for Visual Analytics in CCTV Surveillance.

Big data·2026
Same journal

Agentic Artificial Intelligence-Driven Explainable Deep Learning for Deciphering Noncoding Pathogenic Mechanisms of Delirium Through Genomic Big Data Integration.

Big data·2026
Same journal

Personalized Driven Instruction Through Explainable Agentic AI in Multicultural Higher Education Environments.

Big data·2026
Same journal

Big Data-Driven Explainable Agentic AI Decision Frameworks for Enterprise Innovation in FinTech Ecosystems.

Big data·2026
Same journal

An Edge-Enabled Low-Latency Cross-Lingual Speech-to-Text Framework for Efficient Human-Robot Interaction.

Big data·2026
Same journal

DS<sup>2</sup>PT: A Deep Two-Stage Patent Text Segmentation Framework Informed by Low-Latency Neural Network Characteristics.

Big data·2026
See all related articles

Related Experiment Video

Updated: Oct 14, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

685

Multilevel Attention Networks and Policy Reinforcement Learning for Image Caption Generation.

Zhibo Zhou1, Xiaoming Zhang2, Zhoujun Li1

  • 1School of Computer Science and Engineering, Beihang University, Beijing, China.

Big Data
|November 2, 2021
PubMed
Summary
This summary is machine-generated.

This study introduces a novel Multilevel Attention Networks and Policy Reinforcement Learning model for advanced image captioning. The model generates accurate, natural language descriptions by effectively processing image details.

Keywords:
attention modelimage captioningmultimodalreinforcement learning

More Related Videos

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

4.4K

Related Experiment Videos

Last Updated: Oct 14, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

685
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

4.4K

Area of Science:

  • Artificial Intelligence
  • Computer Vision
  • Natural Language Processing

Background:

  • Large-scale multimodal data analysis is increasingly important.
  • Image captioning, generating text descriptions for images, is a key AI challenge.
  • Existing methods often focus on either global image features or specific object/region details.

Purpose of the Study:

  • To develop a novel model for image caption generation.
  • To improve the accuracy and naturalness of automatically generated image descriptions.
  • To leverage both global and local image characteristics effectively.

Main Methods:

  • Proposed a novel Multilevel Attention Networks and Policy Reinforcement Learning model.
  • Incorporated an object-attention network for global/local object details.
  • Utilized a region-attention network for global/local region features.
  • Employed policy reinforcement learning to address training and generation issues.

Main Results:

  • The model demonstrated superior performance compared to existing methods.
  • Achieved accurate and natural language image descriptions.
  • Validated through extensive experiments on MSCOCO and Flickr30k datasets.

Conclusions:

  • The proposed model effectively integrates multilevel attention and policy reinforcement learning.
  • This approach enhances image captioning by capturing comprehensive image details.
  • The model offers a significant advancement in automated image description generation.