Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Perceiving Loudness, Pitch, and Location

Perceiving Loudness, Pitch, and Location

The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...

Difference from Background: Limit of Detection

Difference from Background: Limit of Detection

The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...

IR Frequency Region: Fingerprint Region

IR Frequency Region: Fingerprint Region

IR spectra are divided into two main regions: the diagnostic region and the fingerprint region. The diagnostic region of the spectrum lies above 1500 cm−1. The absorptions resulting from single-bond vibrations of the N–H, C–H, and O–H stretch at higher wavenumbers and appear on the left side of the spectrum. The stretching absorptions of the C≡C and C≡N occur between 2100–2300 cm−1. In contrast, those arising from stretching absorptions of the...

Force Classification

Force Classification

Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...

Extraction: Advanced Methods

Extraction: Advanced Methods

Metal ions can be separated from one another by complexation with organic ligands–the chelating agent– to form uncharged chelates. Here, the chelating agent must contain hydrophobic groups and behave as a weak acid, losing a proton to bind with the metal. Since most organic ligands used in this process are insoluble or undergo oxidation in the aqueous phase, the chelating agent is initially added to the organic phase and extracted into the aqueous phase. The metal-ligand complex is...

Detection of Black Holes

Detection of Black Holes

Although black holes were theoretically postulated in the 1920s, they remained outside the domain of observational astronomy until the 1970s.
Their closest cousins are neutron stars, which are composed almost entirely of neutrons packed against each other, making them extremely dense. A neutron star has the same mass as the Sun but its diameter is only a few kilometers. Therefore, the escape velocity from their surface is close to the speed of light.
Not until the 1960s, when the first neutron...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Information Entropy-Guided Multi-Scale Feature Fusion for Crowd Density Estimation.

Entropy (Basel, Switzerland)·2026

Same author

Forensics Adapter: Unleashing CLIP for Generalizable Face Forgery Detection.

IEEE transactions on pattern analysis and machine intelligence·2026

Same author

Integrating Multi-Source and Multi-Temporal UAV Observations to Improve Wheat Yield Prediction Using Machine Learning.

Plants (Basel, Switzerland)·2026

Same author

Low-angular-dependence dynamic structural colors enabled by Sb<sub>2</sub>S<sub>3</sub> phase change material.

Optics letters·2026

Same author

Laser protection thin film compatible with multiband stealth based on metal-dielectric structure.

Optics letters·2026

Same author

CrossDF: improving cross-domain deepfake detection with deep information decomposition.

Frontiers in big data·2025

Same journal

Determination of the new psychoactive substances MDMB-4en-PINACA, ADB-BUTINACA and some of their metabolites in blood and urine using DLLE-LC-MS/MS: application to real forensic case samples.

Forensic science international·2026

Same journal

The revolver halo as a forensic marker: Raman spectroscopic evidence of primer-driven gunshot residue deposition.

Forensic science international·2026

Same journal

Research on the effects of signature size on experts' opinions.

Forensic science international·2026

Same journal

Experimental and numerical study of non-penetrating FMJ ballistic impacts on coupled soft and bone tissue surrogates.

Forensic science international·2026

Same journal

Limitations of inkjet and spot amino acid targets for fingermark reagent research and monitoring - and some observations.

Forensic science international·2026

Same journal

Background levels of inorganic gunshot residue like particles on the hands of motorcycle mechanics in Pakistan: Implications for forensic interpretation.

Forensic science international·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 8, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Forensic deepfake audio detection using segmental speech features.

Tianle Yang¹, Chengzhe Sun², Siwei Lyu²

¹University at Buffalo, Department of Linguistics, Buffalo, 14260, NY, United States.

Forensic Science International

|December 12, 2025

Summary

This summary is machine-generated.

This study shows that specific speech sound features can effectively detect audio deepfakes, unlike general audio characteristics. A new speaker-specific method is proposed for more accurate forensic deepfake detection.

Keywords:

Deepfake audio detection Deepfake speech Forensic voice comparison Likelihood ratio

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Related Experiment Videos

Last Updated: Jan 8, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Area of Science:

Acoustic Phonetics
Digital Forensics
Artificial Intelligence

Background:

Deepfake audio poses a significant challenge to authenticity verification.
Current deepfake detection methods often rely on global audio features.
Replicating fine-grained articulatory speech characteristics is difficult for deepfake generation models.

Purpose of the Study:

To investigate the efficacy of segmental speech sound features for audio deepfake detection.
To compare the performance of segmental versus global features in identifying deepfakes.
To propose and evaluate a novel speaker-specific framework for deepfake detection.

Main Methods:

Analysis of acoustic features of segmental speech sounds.
Utilizing features common in forensic voice comparison (FVC).
Development and testing of a speaker-specific deepfake detection framework.

Main Results:

Certain segmental features, particularly those used in FVC, are effective in detecting audio deepfakes.
Global audio features showed limited value in distinguishing deepfakes.
The proposed speaker-specific framework demonstrated potential advantages over speaker-independent systems.

Conclusions:

Segmental acoustic features offer a promising avenue for audio deepfake detection, distinct from traditional FVC approaches.
A speaker-specific detection framework is advantageous for forensic applications requiring high interpretability and sensitivity.
Future research should focus on refining speaker-specific models for robust deepfake identification.