Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Introduction to Learning

Introduction to Learning

Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...

Elaborative Rehearsals

Elaborative Rehearsals

Elaborative rehearsal is a crucial cognitive strategy that strengthens information encoding in long-term memory by making meaningful connections between new data and pre-existing knowledge. This approach contrasts with maintenance rehearsal, which involves simple repetition without delving into the significance of the information. While maintenance rehearsal might temporarily keep information active in short-term memory, it is less effective for long-term retention.
The effectiveness of...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Cognitive Learning

Cognitive Learning

Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Maximizing pancreatic carcinoma classification performance using parrot optimized vision transformer.

Scientific reports·2026

Same author

Explainable artificial intelligence with pyramid vision transformer model for multi-class malignant cell classification on cytology slides.

Scientific reports·2026

Same author

Quantum-resistant hybrid encryption framework for secure and intelligent Vehicle-to-Vehicle communication using deep representation learning models.

Scientific reports·2026

Same author

Numerical simulation of thickness-dependent carrier collection efficiency in SnSe absorber layers.

Scientific reports·2026

Same author

An integration of deep learning models for effective classification of human activity patterns in disabled people using gesture analysis.

Scientific reports·2025

Same author

Enhanced pedestrian walkway object detection using deep learning and pelican optimization algorithm for assisting disabled persons.

Scientific reports·2025

Same journal

DARUMA: a gateway to fast and easy prediction of intrinsically disordered regions.

PeerJ. Computer science·2026

Same journal

Alzheimer's disease detection using a quantum deep neural network with Haralick feature extraction and simulated annealing optimization.

PeerJ. Computer science·2026

Same journal

Network anomaly detection using Deep Autoencoder and parallel Artificial Bee Colony algorithm-trained neural network.

PeerJ. Computer science·2026

Same journal

An anomaly detection model for multivariate time series with anomaly perception.

PeerJ. Computer science·2026

Same journal

Retraction: A wormhole attack detection method for tactical wireless sensor networks.

PeerJ. Computer science·2026

Same journal

Evaluation of mental disorder with prioritization of its type by utilizing the bipolar complex fuzzy decision-making approach based on Schweizer-Sklar prioritized aggregation operators.

PeerJ. Computer science·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 29, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Deep gradient reinforcement learning for music improvisation in cloud computing framework.

Fadwa Alrowais¹, Munya A Arasi², Saud S Alotaibi³

¹Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Peerj. Computer Science

|February 3, 2025

Summary

This summary is machine-generated.

This study introduces artificial intelligence (AI) using reinforcement learning (RL) for real-time music improvisation. The AI model generates harmonically cohesive and aesthetically intriguing musical pieces, outperforming existing methods.

Keywords:

Cloud frameworks Containerization Gated recurrent units Music improvisation Reinforcement learning

More Related Videos

A Lightweight, Headphones-based System for Manipulating Auditory Feedback in Songbirds

A Lightweight, Headphones-based System for Manipulating Auditory Feedback in Songbirds

Published on: November 26, 2012

Movement Retraining using Real-time Feedback of Performance

Movement Retraining using Real-time Feedback of Performance

Published on: January 17, 2013

Related Experiment Videos

Last Updated: May 29, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

A Lightweight, Headphones-based System for Manipulating Auditory Feedback in Songbirds

A Lightweight, Headphones-based System for Manipulating Auditory Feedback in Songbirds

Published on: November 26, 2012

Movement Retraining using Real-time Feedback of Performance

Movement Retraining using Real-time Feedback of Performance

Published on: January 17, 2013

Area of Science:

Music Technology
Artificial Intelligence
Computational Creativity

Background:

Real-time music improvisation presents challenges in creating dynamic and flexible compositions.
Artificial intelligence (AI) offers potential solutions for enhancing human creativity in music.
Reinforcement learning (RL) is explored as a method for developing interactive music creation systems.

Purpose of the Study:

To explore the use of reinforcement learning (RL) techniques for creating interactive and responsive music improvisation systems.
To develop an AI agent capable of navigating musical possibilities for real-time improvisation.
To generate aesthetically intriguing and harmonically cohesive musical improvisations.

Main Methods:

Utilized bi-directional gated recurrent units to identify melodic frameworks in musical data.
Transformed musical elements (notes, chords, rhythms) into a format suitable for RL input.
Employed a deep gradient-based reinforcement learning technique with a custom reward system.
Trained the RL agent on the Bach Chorales dataset within a containerized cloud environment.
Rendered improvised music in MIDI format.

Main Results:

The proposed AI model achieved specific performance metrics: +0.15 for Pitch Frequency (PF), -0.43 for Standard Pitch Delay (SPD), -0.07 for Average Distance Between Peaks (ADP), and 0.0041 for Note Duration Gradient (NDG).
These results indicate superior performance compared to other music improvisation methods.
The model demonstrated the ability to generate harmonically cohesive and aesthetically intriguing improvisations.

Conclusions:

Reinforcement learning provides a viable approach for developing sophisticated AI-powered music improvisation systems.
The integration of deep learning techniques with RL enables the creation of novel and high-quality musical compositions.
The proposed method shows promise for advancing the field of computational creativity in music.