Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

13.4K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
13.4K
Genomics02:02

Genomics

36.3K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
36.3K
Behavioral Genetics and Its Designs01:23

Behavioral Genetics and Its Designs

366
Behavior genetics explores how genetic inheritance influences human behavior. It focuses on how genes, passed from parents to offspring, contribute to the development of behavioral traits and tendencies. This branch of genetics seeks to understand the complex interplay between inherited genetic factors and environmental influences in shaping our behaviors.
The primary methodologies used in behavior genetics include family studies, twin studies, and adoption studies, each providing unique...
366
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.7K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.7K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

18.8K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
18.8K
Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

2.4K
2.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

STELLAR: A flexible ensemble learning framework integrating rare variants to enhance polygenic risk prediction.

medRxiv : the preprint server for health sciences·2026
Same authorSame journal

Statistics and AI - A Fireside Conversation.

Harvard data science review·2026
Same author

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026
Same author

TESTING FOR THE CAUSAL MEDIATION EFFECTS OF MULTIPLE MEDIATORS USING THE KERNEL MACHINE DIFFERENCE METHOD IN GENOME-WIDE EPIGENETIC STUDIES.

The annals of applied statistics·2026
Same author

EBV strain interacts with host HLA to drive nasopharyngeal carcinoma risk.

Nature·2026
Same author

Rare coding and noncoding variants map 1,342 diseases and biomarkers in 490,549 whole genomes.

medRxiv : the preprint server for health sciences·2026
Same journal

Data Science in Public Health: Building Next Generation Capacity.

Harvard data science review·2026
Same journal

Expanding the Data Science Toolkit: Mixed Methods for Improving Causal Inference.

Harvard data science review·2025
Same journal

Assessing the prognostic utility of clinical and radiomic features for COVID-19 patients admitted to ICU: challenges and lessons learned.

Harvard data science review·2024
Same journal

The Role of Academia in Data Science Education.

Harvard data science review·2024
Same journal

Why Do I Get Side Effects? Personalized (N-of-1) Trials for Statin Intolerance and the Nocebo Effect.

Harvard data science review·2024
See all related articles

Related Experiment Video

Updated: Jul 3, 2025

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

3.7K

Learning Lessons on Reproducibility and Replicability in Large Scale Genome-Wide Association Studies.

Xihong Lin1

  • 1Department of Biostatistics and Department of Statistics, Harvard University.

Harvard Data Science Review
|February 16, 2024
PubMed
Summary
This summary is machine-generated.

Enhancing scientific research requires robust reproducibility and replicability, particularly in large-scale genome-wide association studies. This involves transparent data handling, standardized analysis, and collaborative frameworks to ensure reliable scientific findings.

Keywords:
Analysis reproducibilityAnalysis standardizationBatch effectsCollaborative frameworkCommunity culture buildingData repositoriesData reproducibilityData standardization and harmonizationMulti-phase designOpen sciencePartnershipResult replicabilitySelection biasStatistical inferenceStudy designTransparency

More Related Videos

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.2K
Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

39.0K

Related Experiment Videos

Last Updated: Jul 3, 2025

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

3.7K
Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.2K
Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

39.0K

Area of Science:

  • Genetics
  • Bioinformatics
  • Computational Biology

Background:

  • Reproducibility and replicability are foundational principles in scientific research.
  • Large-scale genome-wide association studies (GWAS) present unique challenges for ensuring robust results.

Purpose of the Study:

  • To emphasize the critical importance of enhancing data reproducibility, analysis reproducibility, and result replicability in GWAS.
  • To provide actionable recommendations for improving the reliability of GWAS findings.

Main Methods:

  • Review and reflection on current practices in large-scale GWAS.
  • Formulation of recommendations for study design, data management, and analysis protocols.

Main Results:

  • Identification of key areas for improvement: study design (batch effects, selection bias), distinct discovery/replication phases, and adequate sample size.
  • Emphasis on systematic, transparent data pipelines and standardized, field-specific analysis protocols.
  • Recommendations for collaborative frameworks, open-access tools, and data/resource sharing infrastructure.

Conclusions:

  • Implementing proposed strategies will significantly enhance the reproducibility and replicability of GWAS.
  • Fostering a culture of reproducible research through incentives and partnerships among researchers, funders, and journals is crucial.