Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Perceiving Loudness, Pitch, and Location01:21

Perceiving Loudness, Pitch, and Location

1.2K
The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...
1.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A speech prediction model based on codec modeling and transformer decoding.

Computer speech & language·2026
Same author

A Molecular Trimming Strategy for Hypoxia-Tolerant Photosensitizers With Enhanced cGAS-STING Activation.

Angewandte Chemie (International ed. in English)·2026
Same author

Towards decoupling frontend enhancement and backend recognition in monaural robust ASR.

Computer speech & language·2026
Same author

Association of the single-point insulin sensitivity estimator with arterial stiffness: a cross-sectional analysis.

Frontiers in endocrinology·2026
Same author

The role of protein palmitoylation in disease pathogenesis and therapeutic innovation.

Annals of medicine·2026
Same author

Efficacy of SWIM technology combined with direct aspiration first pass technique for large vessel occlusion in acute ischemic stroke.

American journal of translational research·2026
Same journal

Sibilant differentiation before and after tongue cancer surgery: Acoustics, kinematics and the role of sensorimotor controla).

The Journal of the Acoustical Society of America·2026
Same journal

BioNet-A: Ultrasonic echo representation network for target discrimination using active SONAR.

The Journal of the Acoustical Society of America·2026
Same journal

Empty soft-drink cans and mass-loaded rods: Analogous homework problems from acoustic and mechanical domains.

The Journal of the Acoustical Society of America·2026
Same journal

Erratum: Statistical wave field theory: Anisotropic wave fields under Neumann's boundary condition [J. Acoust. Soc. Am. 159(3), 2265-2280 (2026)].

The Journal of the Acoustical Society of America·2026
Same journal

On the modification of tip leakage noise sources by porous treatment.

The Journal of the Acoustical Society of America·2026
Same journal

An educational opportunity: Acoustics in an empty room.

The Journal of the Acoustical Society of America·2026
See all related articles

Related Experiment Video

Updated: Mar 6, 2026

A Lightweight, Headphones-based System for Manipulating Auditory Feedback in Songbirds
10:13

A Lightweight, Headphones-based System for Manipulating Auditory Feedback in Songbirds

Published on: November 26, 2012

14.8K

Speaker-dependent multipitch tracking using deep neural networks.

Yuzhou Liu1, DeLiang Wang1

  • 1Department of Computer Science and Engineering, The Ohio State University, Columbus, Ohio 43210, USA.

The Journal of the Acoustical Society of America
|March 4, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces deep neural networks (DNNs) for accurate multipitch tracking in two-speaker audio. The novel methods improve speaker assignment and pitch estimation, outperforming existing techniques.

More Related Videos

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology
09:44

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

6.0K
Sound Source Localization Testing in Single-sided Deafness Following Bone Conduction Intervention
04:32

Sound Source Localization Testing in Single-sided Deafness Following Bone Conduction Intervention

Published on: December 20, 2024

949

Related Experiment Videos

Last Updated: Mar 6, 2026

A Lightweight, Headphones-based System for Manipulating Auditory Feedback in Songbirds
10:13

A Lightweight, Headphones-based System for Manipulating Auditory Feedback in Songbirds

Published on: November 26, 2012

14.8K
Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology
09:44

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

6.0K
Sound Source Localization Testing in Single-sided Deafness Following Bone Conduction Intervention
04:32

Sound Source Localization Testing in Single-sided Deafness Following Bone Conduction Intervention

Published on: December 20, 2024

949

Area of Science:

  • Signal Processing
  • Machine Learning
  • Speech Recognition

Background:

  • Multipitch tracking is crucial for analyzing simultaneous speech.
  • Accurate pitch estimation and speaker assignment remain significant challenges.

Purpose of the Study:

  • To develop advanced deep neural network (DNN) models for robust multipitch tracking.
  • To improve simultaneous speaker identification and pitch estimation accuracy.

Main Methods:

  • Utilized speaker-dependent and speaker-pair-dependent DNNs to model probabilistic pitch states.
  • Incorporated extensions like gender-pair-dependent DNNs and speaker adaptation.
  • Employed a factorial hidden Markov model (FHMM) with a junction tree algorithm for pitch track generation.

Main Results:

  • Proposed DNN-based methods significantly outperformed existing speaker-independent and speaker-dependent trackers.
  • Multi-ratio training ensured consistent performance across varying speaker energy levels.
  • Achieved substantial improvements in both pitch estimation and speaker assignment accuracy.

Conclusions:

  • Deep neural networks offer a powerful approach for complex multipitch tracking tasks.
  • The developed methods provide a significant advancement in simultaneous speech processing.
  • The system demonstrates robustness and high performance in realistic two-speaker scenarios.