Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Estimating haplotype frequencies and standard errors for multiple single nucleotide polymorphisms.

Shuying Sue Li1, Najma Khalid, Christopher Carlson

  • 1Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.

Biostatistics (Oxford, England)
|October 15, 2003
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Profiling Associations Between IGHG-FCGR Ligand-Receptor Interactions and Disease Progression From Stage 1 and 2 to Stage 3 Type 1 Diabetes.

Diabetes·2025
Same author

Correction: Profiling associations of interactive ligand-receptors (HLA class I and KIR gene products) with the progression to type 1 diabetes among seroconverted participants.

Diabetologia·2025
Same author

Neutralizing antibody responses over time in a demographically and clinically diverse cohort of individuals recovered from SARS-CoV-2 acquisition in Africa: A cohort study.

PLOS global public health·2025
Same author

Profiling associations of interactive ligand-receptors (HLA class I and KIR gene products) with the progression to type 1 diabetes among seroconverted participants.

Diabetologia·2025
Same author

Minimal residual disease status predicts outcomes in patients with follicular lymphoma treated with chemoimmunotherapy on the SWOG S0016 trial.

Haematologica·2025
Same author

Distinct immune responses in people living with HIV following SARS-CoV-2 recovery.

Communications medicine·2025
Same journal

A Bayesian functional concurrent zero-inflated Dirichlet-multinomial regression model with application to infant microbiome.

Biostatistics (Oxford, England)·2026
Same journal

Towards optimal environmental policies: policy learning under arbitrary bipartite network interference.

Biostatistics (Oxford, England)·2026
Same journal

Multilevel functional quantile principal component analysis.

Biostatistics (Oxford, England)·2026
Same journal

Adaptive transfer learning for time-to-event modeling with applications in disease risk assessment.

Biostatistics (Oxford, England)·2026
Same journal

High-dimensional test for one-sided hypotheses.

Biostatistics (Oxford, England)·2026
Same journal

NBSR: a Negative Binomial Softmax Regression model for microRNA-seq data analysis.

Biostatistics (Oxford, England)·2026
See all related articles

This study introduces a novel computational method for estimating haplotype frequencies from large single nucleotide polymorphism (SNP) datasets. The new approach efficiently handles numerous SNPs and subjects, overcoming limitations of existing expectation-maximization (EM) algorithms.

Area of Science:

  • Genetics
  • Computational Biology
  • Statistical Genomics

Background:

  • Estimating haplotype frequencies is crucial for mapping complex disease genes using millions of single nucleotide polymorphisms (SNPs).
  • Existing expectation-maximization (EM) algorithms and Bayesian methods face computational challenges with large numbers of SNPs and subjects.
  • Previous Bayesian methods, like progressive ligation (PL), have limitations on the number of subjects that can be analyzed.

Purpose of the Study:

  • To develop a novel computational method for accurate haplotype frequency estimation in large-scale genetic datasets.
  • To overcome the computational limitations of existing methods when dealing with a high density of SNPs and a large number of individuals.
  • To provide an efficient and robust method for haplotype inference that accommodates missing genotype data.

Related Experiment Videos

Main Methods:

  • Utilized the likelihood formulation from Excoffier and Slatkin's EM algorithm.
  • Applied modified estimating equation principles and the progressive ligation (PL) computational algorithm.
  • Employed sandwich-estimates from the estimating equation for efficient standard error estimation, replacing bootstrap methods.
  • Incorporated handling of missing genotype data under the missing at random assumption.

Main Results:

  • The proposed method effectively handles large datasets with numerous SNPs and subjects.
  • Efficient estimation of standard errors was achieved using the sandwich-estimate.
  • The method produces valid parameter and standard error estimates even with missing genotype data.
  • Successfully addressed computational challenges posed by large SNP datasets.

Conclusions:

  • The new method offers a computationally efficient and scalable solution for haplotype frequency estimation.
  • It provides a robust framework for analyzing large genetic datasets, including those with missing data.
  • This advancement facilitates more accurate genetic mapping of complex diseases by improving haplotype inference capabilities.