Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Elaborative Rehearsals01:07

Elaborative Rehearsals

133
Elaborative rehearsal is a crucial cognitive strategy that strengthens information encoding in long-term memory by making meaningful connections between new data and pre-existing knowledge. This approach contrasts with maintenance rehearsal, which involves simple repetition without delving into the significance of the information. While maintenance rehearsal might temporarily keep information active in short-term memory, it is less effective for long-term retention.
The effectiveness of...
133
Perceiving Loudness, Pitch, and Location01:21

Perceiving Loudness, Pitch, and Location

429
The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...
429
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

150
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
150
Introduction to Learning01:18

Introduction to Learning

533
Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...
533
Neural Circuits01:25

Neural Circuits

1.6K
Neural circuits and neuronal pools are two of the main structures found in the nervous system. Neural circuits are networks of neurons that work together to carry out a specific task or process. They consist of interconnected neurons and glial cells, which provide structural and metabolic support.
Neuronal pools are collections of nerve cells with similar functions and interact through chemical and electrical signals. These pools include both interneurons (the central neural circuit nodes that...
1.6K
Associative Learning01:27

Associative Learning

579
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
579

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Molecular typing and characterization of a novel genotype of EV-B93 isolated from Tibet, China.

PloS one·2020
Same author

Fecal microbiota transplantation from patients with autoimmune encephalitis modulates Th17 response and relevant behaviors in mice.

Cell death discovery·2020
Same author

Comparison of Histogram-Based Gaussian Analysis With and Without Noise Correction for the Characterization of Indeterminate Adrenal Nodules.

AJR. American journal of roentgenology·2020
Same author

Photo-induced specific intracellular release EGFR inhibitor from enzyme/ROS-dual sensitive nano-platforms for molecular targeted-photodynamic combinational therapy of non-small cell lung cancer.

Journal of materials chemistry. B·2020
Same author

SYNTHESIS OF 1,3,4-OXADIAZOLES AS SELECTIVE T-TYPE CALCIUM CHANNEL INHIBITORS.

Heterocycles·2020
Same author

BMP-2 Signaling and Mechanotransduction Synergize to Drive Osteogenic Differentiation via YAP/TAZ.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2020
Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026
Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026
Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026
Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026
Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026
Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Sep 13, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.3K

Advancing deep learning for expressive music composition and performance modeling.

Man Zhang1

  • 1School of Mechanical Engineering, Yellow River Conservancy Technical University, Kaifeng, 475004, Henan, China. 2010830675@yrcti.edu.cn.

Scientific Reports
|July 31, 2025
PubMed
Summary
This summary is machine-generated.

This study compared deep learning models for AI music generation. Transformer models showed promise, but human compositions remain superior in expressiveness.

Keywords:
AI music generationDeep learningExpressive performance modelingGenerative adversarial networks (GANs)Harmonic consistencyLong short-term memory (LSTM)Music transcriptionPerplexityTransformer models

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.6K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

701

Related Experiment Videos

Last Updated: Sep 13, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.3K
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.6K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

701

Area of Science:

  • Artificial Intelligence
  • Music Information Retrieval
  • Machine Learning

Background:

  • AI music generation faces challenges in long-term structure and emotional nuance.
  • Deep learning models have advanced AI music composition and transcription.

Purpose of the Study:

  • To comparatively analyze Long Short-Term Memory (LSTM) networks, Transformer models, and Generative Adversarial Networks (GANs) for AI music.
  • To evaluate AI music generation and transcription using the MAESTRO dataset.

Main Methods:

  • Comparative analysis of LSTM, Transformer, and GAN architectures.
  • Dual evaluation framework: objective metrics (perplexity, harmonic consistency, rhythmic entropy) and subjective Mean Opinion Score (MOS) human evaluations.
  • Utilized the MAESTRO dataset for training and evaluation.

Main Results:

  • Transformer models achieved the best performance on objective metrics and MOS (4.3).
  • Objective metrics for Transformers: perplexity 2.87, harmonic consistency 79.4%.
  • Human compositions achieved the highest perceptual quality (MOS: 4.8).

Conclusions:

  • Transformer models demonstrate superior capabilities for expressive AI music generation.
  • Future AI music systems require emotion-aware modeling and human-AI collaboration.
  • Reinforcement learning is crucial for bridging the gap between AI and human music.