Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Perceiving Loudness, Pitch, and Location01:21

Perceiving Loudness, Pitch, and Location

892
The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...
892
Difference from Background: Limit of Detection01:05

Difference from Background: Limit of Detection

8.0K
The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...
8.0K
IR Frequency Region: Fingerprint Region01:03

IR Frequency Region: Fingerprint Region

1.8K
IR spectra are divided into two main regions: the diagnostic region and the fingerprint region. The diagnostic region of the spectrum lies above 1500 cm−1. The absorptions resulting from single-bond vibrations of the N–H, C–H, and O–H stretch at higher wavenumbers and appear on the left side of the spectrum. The stretching absorptions of the C≡C and C≡N occur between 2100–2300 cm−1. In contrast, those arising from stretching absorptions of the...
1.8K
Force Classification01:22

Force Classification

2.2K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
2.2K
Extraction: Advanced Methods00:56

Extraction: Advanced Methods

1.0K
Metal ions can be separated from one another by complexation with organic ligands–the chelating agent– to form uncharged chelates. Here, the chelating agent must contain hydrophobic groups and behave as a weak acid, losing a proton to bind with the metal. Since most organic ligands used in this process are insoluble or undergo oxidation in the aqueous phase, the chelating agent is initially added to the organic phase and extracted into the aqueous phase. The metal-ligand complex is...
1.0K
Detection of Black Holes01:10

Detection of Black Holes

2.5K
Although black holes were theoretically postulated in the 1920s, they remained outside the domain of observational astronomy until the 1970s.
Their closest cousins are neutron stars, which are composed almost entirely of neutrons packed against each other, making them extremely dense. A neutron star has the same mass as the Sun but its diameter is only a few kilometers. Therefore, the escape velocity from their surface is close to the speed of light.
Not until the 1960s, when the first neutron...
2.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Information Entropy-Guided Multi-Scale Feature Fusion for Crowd Density Estimation.

Entropy (Basel, Switzerland)·2026
Same author

Forensics Adapter: Unleashing CLIP for Generalizable Face Forgery Detection.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Integrating Multi-Source and Multi-Temporal UAV Observations to Improve Wheat Yield Prediction Using Machine Learning.

Plants (Basel, Switzerland)·2026
Same author

Low-angular-dependence dynamic structural colors enabled by Sb<sub>2</sub>S<sub>3</sub> phase change material.

Optics letters·2026
Same author

Laser protection thin film compatible with multiband stealth based on metal-dielectric structure.

Optics letters·2026
Same author

CrossDF: improving cross-domain deepfake detection with deep information decomposition.

Frontiers in big data·2025
Same journal

Determination of the new psychoactive substances MDMB-4en-PINACA, ADB-BUTINACA and some of their metabolites in blood and urine using DLLE-LC-MS/MS: application to real forensic case samples.

Forensic science international·2026
Same journal

The revolver halo as a forensic marker: Raman spectroscopic evidence of primer-driven gunshot residue deposition.

Forensic science international·2026
Same journal

Research on the effects of signature size on experts' opinions.

Forensic science international·2026
Same journal

Experimental and numerical study of non-penetrating FMJ ballistic impacts on coupled soft and bone tissue surrogates.

Forensic science international·2026
Same journal

Limitations of inkjet and spot amino acid targets for fingermark reagent research and monitoring - and some observations.

Forensic science international·2026
Same journal

Background levels of inorganic gunshot residue like particles on the hands of motorcycle mechanics in Pakistan: Implications for forensic interpretation.

Forensic science international·2026
See all related articles

Related Experiment Video

Updated: Jan 8, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

774

Forensic deepfake audio detection using segmental speech features.

Tianle Yang1, Chengzhe Sun2, Siwei Lyu2

  • 1University at Buffalo, Department of Linguistics, Buffalo, 14260, NY, United States.

Forensic Science International
|December 12, 2025
PubMed
Summary
This summary is machine-generated.

This study shows that specific speech sound features can effectively detect audio deepfakes, unlike general audio characteristics. A new speaker-specific method is proposed for more accurate forensic deepfake detection.

Keywords:
Deepfake audio detectionDeepfake speechForensic voice comparisonLikelihood ratio

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

2.0K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.3K

Related Experiment Videos

Last Updated: Jan 8, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

774
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

2.0K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.3K

Area of Science:

  • Acoustic Phonetics
  • Digital Forensics
  • Artificial Intelligence

Background:

  • Deepfake audio poses a significant challenge to authenticity verification.
  • Current deepfake detection methods often rely on global audio features.
  • Replicating fine-grained articulatory speech characteristics is difficult for deepfake generation models.

Purpose of the Study:

  • To investigate the efficacy of segmental speech sound features for audio deepfake detection.
  • To compare the performance of segmental versus global features in identifying deepfakes.
  • To propose and evaluate a novel speaker-specific framework for deepfake detection.

Main Methods:

  • Analysis of acoustic features of segmental speech sounds.
  • Utilizing features common in forensic voice comparison (FVC).
  • Development and testing of a speaker-specific deepfake detection framework.

Main Results:

  • Certain segmental features, particularly those used in FVC, are effective in detecting audio deepfakes.
  • Global audio features showed limited value in distinguishing deepfakes.
  • The proposed speaker-specific framework demonstrated potential advantages over speaker-independent systems.

Conclusions:

  • Segmental acoustic features offer a promising avenue for audio deepfake detection, distinct from traditional FVC approaches.
  • A speaker-specific detection framework is advantageous for forensic applications requiring high interpretability and sensitivity.
  • Future research should focus on refining speaker-specific models for robust deepfake identification.