Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Hearing01:31

Hearing

55.4K
When we hear a sound, our nervous system is detecting sound waves—pressure waves of mechanical energy traveling through a medium. The frequency of the wave is perceived as pitch, while the amplitude is perceived as loudness.
55.4K
SBAR II: Application of SBAR01:14

SBAR II: Application of SBAR

5.3K
SBAR is an effective communication tool used by healthcare professionals to communicate patient information accurately. SBAR stands for Situation, Background, Assessment, and Recommendation. For a better understanding, an example is given below.
SBAR Report from a Nurse to a Health Care Provider
S: "Hello, Dr. Smith. This is Jane, RN, from the Med Surg unit. I am calling to tell you about Ms. White in Room 210, who is experiencing increased pain and redness at her incision site. Her recent...
5.3K
Perceiving Loudness, Pitch, and Location01:21

Perceiving Loudness, Pitch, and Location

643
The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...
643
Language and Cognition01:27

Language and Cognition

553
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
553
Air-entraining Agents01:27

Air-entraining Agents

169
Air-entraining agents improve the durability and workability of concrete in climates with frequent freezing and thawing. These agents prevent cracks by introducing small air bubbles into the mix, creating spaces accommodating water expansion when temperatures drop. The air-entraining agents lower the surface tension of water, forming stable, small air bubbles. This method is more effective than having accidental large voids, as the intentional, smaller, and evenly distributed air voids improve...
169
Force Classification01:22

Force Classification

2.0K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
2.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

iCub3 avatar system: Enabling remote fully immersive embodiment of humanoid robots.

Science robotics·2024
Same author

iCub-HRI: A Software Framework for Complex Human-Robot Interaction Scenarios on the iCub Humanoid Robot.

Frontiers in robotics and AI·2021
Same author

Editorial: Consciousness in Humanoid Robots.

Frontiers in robotics and AI·2021
Same author

iCub: The not-yet-finished story of building a robot child.

Science robotics·2020
Same author

A Novel Sensorised Insole for Sensing Feet Pressure Distributions.

Sensors (Basel, Switzerland)·2020
Same author

Author Correction: Multi-layer adaptation of group coordination in musical ensembles.

Scientific reports·2020
Same journal

Editorial: Synergizing large language models and computational intelligence for advanced robotic systems.

Frontiers in robotics and AI·2026
Same journal

Editorial: Innovations in industry 4.0: advancing mobility and manipulation in robotics.

Frontiers in robotics and AI·2026
Same journal

MPM-based simulation and bounded-error compression of material points for magnetic tactile sensors.

Frontiers in robotics and AI·2026
Same journal

Torque-sensorless control of a high-ratio, backdrivable Wolfrom-gearbox for safe human-centered robotics.

Frontiers in robotics and AI·2026
Same journal

The implications of robot navigation in social space: perceptual effects of socially aware and baseline navigation.

Frontiers in robotics and AI·2026
Same journal

DPTG: diffusion policy with tactile feasibility guidance.

Frontiers in robotics and AI·2026
See all related articles

Related Experiment Video

Updated: Nov 19, 2025

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.8K

Speech Recognition for the iCub Platform.

Bertrand Higy1,2, Alessio Mereta3, Giorgio Metta1

  • 1iCub Facility, Istituto Italiano di Tecnologia, Genoa, Italy.

Frontiers in Robotics and AI
|January 27, 2021
PubMed
Summary
This summary is machine-generated.

This paper introduces open-source software for creating automatic speech recognition (ASR) systems, including novel articulatory and unsupervised methods. These systems are designed for human-robot interaction, specifically with the iCub robot.

Keywords:
automatic speech recognitioncode:C++code:matlabcode:pythontensorflowyarp

More Related Videos

Systematic Hearing Performance Evaluation Process for Adolescents with Cochlear Implantation at Early Ages
06:04

Systematic Hearing Performance Evaluation Process for Adolescents with Cochlear Implantation at Early Ages

Published on: March 24, 2023

624
A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS
12:43

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS

Published on: February 21, 2011

35.5K

Related Experiment Videos

Last Updated: Nov 19, 2025

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.8K
Systematic Hearing Performance Evaluation Process for Adolescents with Cochlear Implantation at Early Ages
06:04

Systematic Hearing Performance Evaluation Process for Adolescents with Cochlear Implantation at Early Ages

Published on: March 24, 2023

624
A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS
12:43

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS

Published on: February 21, 2011

35.5K

Area of Science:

  • Robotics
  • Artificial Intelligence
  • Computational Neuroscience

Background:

  • Developing effective automatic speech recognition (ASR) systems for human-robot interaction presents significant challenges.
  • Existing ASR systems often require extensive labeled data and struggle in complex acoustic environments.
  • Bridging the gap between human speech perception and machine recognition is crucial for intuitive robot communication.

Purpose of the Study:

  • To present an open-source software toolkit for building and deploying ASR systems within the YARP platform.
  • To enable non-experts to create custom ASR systems for the iCub robot.
  • To develop advanced deep learning-based ASR models addressing challenges in human-iCub verbal interactions.

Main Methods:

  • The toolkit integrates Python, C++, and shell scripts within the YARP framework.
  • It offers codebases for both general ASR system development and specialized bio-inspired/developmental learning approaches.
  • Two distinct ASR paradigms are implemented: articulatory (motor cortex-inspired) and unsupervised speech recognition.

Main Results:

  • The developed articulatory speech recognition systems demonstrate superior performance compared to strong deep learning baselines.
  • Unsupervised speech recognition systems are provided, mimicking infant language acquisition without requiring labeled data.
  • Pre-trained deep learning models and the VoCub dataset (2.5h spoken commands) are released to facilitate ASR system adaptation.

Conclusions:

  • The open-source toolkit simplifies ASR system creation for non-experts, particularly for the iCub platform.
  • Novel articulatory and unsupervised ASR approaches show promise for robust human-robot communication.
  • The provided resources, including models and datasets, accelerate research and development in robot speech recognition.