Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Testing a Claim about Mean: Known Population SD01:11

Testing a Claim about Mean: Known Population SD

2.7K
A complete procedure of testing the hypothesis about a population mean is explained here.
Estimating a population mean requires the samples to be distributed normally. The data should be collected from the randomly selected samples having no sampling bias. The sample size needed to be higher than 30, and most importantly, the population standard deviation should be already known.
In most realistic situations, the population standard deviation is often unknown, but in rare circumstances, when it...
2.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Simplifying software compliance: AI technologies in drafting technical documentation for the AI Act.

Empirical software engineering·2025
Same author

Visualising data science workflows to support third-party notebook comprehension: an empirical study.

Empirical software engineering·2023
Same author

Workflow analysis of data science code in public GitHub repositories.

Empirical software engineering·2022
Same author

The evolution of the code during review: an investigation on review changes.

Empirical software engineering·2022
Same author

The effects of change decomposition on code review-a controlled experiment.

PeerJ. Computer science·2021
Same author

Does single blind peer review hinder newcomers?

Scientometrics·2017
Same journal

How students use generative AI for software testing: An observational study.

Empirical software engineering·2026
Same journal

Is common sense all you need? Using expert defined rules to identify vulnerability patches instead of machine learning.

Empirical software engineering·2026
Same journal

Less is more: usefulness of data flow diagrams and large language models for security threat validation.

Empirical software engineering·2026
Same journal

SecMLOps: A comprehensive framework for integrating security throughout the machine learning operations lifecycle.

Empirical software engineering·2026
Same journal

Tools and benchmarks evolve: what is their impact on parameter tuning in SBSE experiments?

Empirical software engineering·2025
Same journal

AI support for data scientists: An empirical study on workflow and alternative code recommendations.

Empirical software engineering·2025
See all related articles

Related Experiment Video

Updated: Jun 18, 2025

Modeling Fetal Alcohol Spectrum Disorders in Zebrafish to Characterize the Impact of an Adverse Embryonic Environment on Adult Social Behavior
04:50

Modeling Fetal Alcohol Spectrum Disorders in Zebrafish to Characterize the Impact of an Adverse Embryonic Environment on Adult Social Behavior

Published on: February 9, 2024

314

On Refining the SZZ Algorithm with Bug Discussion Data.

Pooja Rani1, Fernando Petrulio1, Alberto Bacchelli1

  • 1Department of Informatics, University of Zurich, Zurich, Switzerland.

Empirical Software Engineering
|July 29, 2024
PubMed
Summary
This summary is machine-generated.

Incorporating bug discussion details significantly improves the accuracy of the SZZ algorithm in identifying bug-introducing commits. This enhancement aids in pinpointing software defects more precisely by analyzing related files mentioned in developer conversations.

Keywords:
Bug-introducing commitsEmpirical researchMozillaPull requestSZZ algorithmSoftware qualityTaxonomy

More Related Videos

Efficient PAM-Less Base Editing for Zebrafish Modeling of Human Genetic Disease with zSpRY-ABE8e
07:31

Efficient PAM-Less Base Editing for Zebrafish Modeling of Human Genetic Disease with zSpRY-ABE8e

Published on: February 17, 2023

1.1K
The Three-Chamber Choice Behavioral Task using Zebrafish as a Model System
07:55

The Three-Chamber Choice Behavioral Task using Zebrafish as a Model System

Published on: April 14, 2021

3.8K

Related Experiment Videos

Last Updated: Jun 18, 2025

Modeling Fetal Alcohol Spectrum Disorders in Zebrafish to Characterize the Impact of an Adverse Embryonic Environment on Adult Social Behavior
04:50

Modeling Fetal Alcohol Spectrum Disorders in Zebrafish to Characterize the Impact of an Adverse Embryonic Environment on Adult Social Behavior

Published on: February 9, 2024

314
Efficient PAM-Less Base Editing for Zebrafish Modeling of Human Genetic Disease with zSpRY-ABE8e
07:31

Efficient PAM-Less Base Editing for Zebrafish Modeling of Human Genetic Disease with zSpRY-ABE8e

Published on: February 17, 2023

1.1K
The Three-Chamber Choice Behavioral Task using Zebrafish as a Model System
07:55

The Three-Chamber Choice Behavioral Task using Zebrafish as a Model System

Published on: April 14, 2021

3.8K

Area of Science:

  • Software Engineering
  • Empirical Software Engineering
  • Defect Analysis

Background:

  • Software quality research often relies on historical defect data.
  • The SZZ algorithm is a prevailing technique for identifying bug-introducing commits based on code modifications.
  • Existing SZZ variants struggle with accuracy due to issues like tangled and ghost commits.

Purpose of the Study:

  • To investigate if bug discussion content can improve the accuracy of the SZZ algorithm.
  • To identify related and external files from bug discussions to enhance SZZ efficacy.
  • To address limitations of current SZZ methods in pinpointing defect origins.

Main Methods:

  • Leveraged manually linked bug reports from Mozilla developers.
  • Created the RoTEB dataset of 12,472 bug reports.
  • Manually inspected a sample of bug reports to assess file relevance for SZZ.
  • Augmented the SZZ algorithm with information from bug discussions and evaluated its performance.

Main Results:

  • Defined a taxonomy for developer references to files in bug discussions.
  • Observed that bug discussions frequently mention files beneficial for SZZ.
  • Validated that integrating file references from discussions improves SZZ precision in pinpointing bug-introducing commits.
  • Found no significant impact on SZZ recall.

Conclusions:

  • Bug discussions offer valuable information for enhancing SZZ algorithm precision.
  • The RoTEB dataset provides a resource for future research on defect analysis.
  • Further exploration is needed to address tangled and ghost commits effectively.