Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Extraction: Advanced Methods

Extraction: Advanced Methods

Metal ions can be separated from one another by complexation with organic ligands–the chelating agent– to form uncharged chelates. Here, the chelating agent must contain hydrophobic groups and behave as a weak acid, losing a proton to bind with the metal. Since most organic ligands used in this process are insoluble or undergo oxidation in the aqueous phase, the chelating agent is initially added to the organic phase and extracted into the aqueous phase. The metal-ligand complex is...

Extraction: Partition and Distribution Coefficients

Extraction: Partition and Distribution Coefficients

The distribution law or Nernst's distribution law is the law that governs the distribution of a solute between two immiscible solvents. This law, also known as the partition law, states that if a solute is added to the mixture of two immiscible solvents at a constant temperature, the solute is distributed between the two solvents in such a way that the ratio of solute concentrations in the solvents remains constant at equilibrium.
For extracting a solute from an aqueous phase into an...

Hair Cells

Hair Cells

Hair cells are the sensory receptors of the auditory system—they transduce mechanical sound waves into electrical energy that the nervous system can understand. Hair cells are located in the organ of Corti within the cochlea of the inner ear, between the basilar and tectorial membranes. The actual sensory receptors are called inner hair cells. The outer hair cells serve other functions, such as sound amplification in the cochlea, and are not discussed in detail here.

Auditory Pathway

Auditory Pathway

Auditory pathways constitute the complex neural circuits responsible for transmitting and interpreting auditory information from the peripheral auditory system to the brain. Sound waves are initially captured by the outer ear, funneled through the ear canal, and reach the tympanic membrane (eardrum). These vibrations are transmitted via the middle ear's ossicles to the inner ear's cochlea.
When viewed cross-sectionally, the cochlea reveals the scala vestibuli and scala tympani flanking...

Sound as Pressure Waves

Sound as Pressure Waves

Sound waves, which are longitudinal waves, can be modeled as the displacement amplitude varying as a function of the spatial and temporal coordinates. As a column of the medium is displaced, its successive columns are also displaced. As the successive displacements differ relatively, a pressure difference with the surrounding pressure is created. The gauge pressure varies across the medium.
The pressure fluctuation depends on the difference in displacements between the successive points in the...

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Validity of the Polar H10 for Continuous Measures of Heart Rate and Heart Rate Synchrony Analysis.

Sensors (Basel, Switzerland)·2026

Same author

Simulating Early Phonetic and Word Learning Without Linguistic Categories.

Developmental science·2025

Same author

Modeling early phonetic acquisition from child-centered audio data.

Cognition·2024

Same author

Realistic and broad-scope learning simulations: first results and challenges.

Journal of child language·2023

Same author

The effect of different information sources on prosodic boundary perception.

JASA express letters·2022

Same author

Emotion expression through spoken language in Huntington disease.

Cortex; a journal devoted to the study of the nervous system and behavior·2022

Same journal

Exploring psychological tradeoffs: Developing and demonstrating an R Shiny app for Pareto optimization.

Behavior research methods·2026

Same journal

The performance of Bayesian fit measures in detecting misspecified multilevel structural equation modeling.

Behavior research methods·2026

Same journal

Psychometric functions from multiple responses : Dedicated to the memory of Colin L. Mallows.

Behavior research methods·2026

Same journal

Low-cost, open-source, full-stack software and Arduino-based hardware for control of commercially available animal behavior systems.

Behavior research methods·2026

Same journal

PyNeon: A Python package for the analysis of Neon multimodal mobile eye-tracking data.

Behavior research methods·2026

Same journal

Talking surveys: How photorealistic embodied conversational agents shape response quality, engagement, and satisfaction.

Behavior research methods·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 11, 2025

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

Shennong: A Python toolbox for audio speech features extraction.

Mathieu Bernard^1,2, Maxime Poli³, Julien Karadayi³

¹Cognitive Machine Learning, PSL Research University, CNRS, EHESS, ENS, Inria, Paris, France. mathieu.bernard.2@cnrs.fr.

Behavior Research Methods

|February 7, 2023

Summary

This summary is machine-generated.

Shennong is a new Python toolbox for extracting audio speech features using advanced algorithms. This open-source tool simplifies speech analysis for researchers and developers, integrating with existing machine learning workflows.

Keywords:

Features extraction Pitch estimation Python Software Speech processing

More Related Videos

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Related Experiment Videos

Last Updated: Aug 11, 2025

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Area of Science:

Speech Processing
Computational Linguistics
Machine Learning

Background:

Accurate audio speech feature extraction is crucial for various applications, including speech recognition and speaker identification.
Existing toolboxes may lack comprehensive algorithm implementations or user-friendly interfaces.

Purpose of the Study:

To introduce Shennong, an open-source Python toolbox for audio speech feature extraction.
To provide a reliable and extensible framework integrating state-of-the-art algorithms.
To demonstrate Shennong's utility through practical applications and benchmarks.

Main Methods:

Implementation of diverse algorithms: spectro-temporal filters (e.g., Mel-Frequency Cepstral Filterbank), predictive linear filters, pre-trained neural networks, pitch estimators, speaker normalization, and post-processing.
Development as a Python toolbox and command-line utility, built upon the Kaldi speech processing library.
Integration with the Python ecosystem for machine learning and speech modeling tools.

Main Results:

Shennong offers a wide range of well-established and state-of-the-art algorithms for speech feature extraction.
The toolbox is designed for ease of use by non-technical users and seamless integration with other Python tools.
Applications demonstrate benchmarking of feature extraction, analysis of speaker normalization performance, and comparison of pitch estimation algorithms under noise.

Conclusions:

Shennong provides a valuable, open-source resource for researchers and developers in speech processing.
Its comprehensive algorithm set and Python integration facilitate advanced speech analysis and machine learning tasks.
The demonstrated applications highlight Shennong's effectiveness in diverse speech-related research scenarios.