Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

14.8K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
14.8K
Hardy-Weinberg Principle01:49

Hardy-Weinberg Principle

74.6K
Diploid organisms have two alleles of each gene, one from each parent, in their somatic cells. Therefore, each individual contributes two alleles to the gene pool of the population. The gene pool of a population is the sum of every allele of all genes within that population and has some degree of variation. Genetic variation is typically expressed as a relative frequency, which is the percentage of the total population that has a given allele, genotype or phenotype.
74.6K
Test for Homogeneity01:23

Test for Homogeneity

2.1K
The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...
2.1K
What is Population Genetics?01:25

What is Population Genetics?

60.9K
A population is composed of members of the same species that simultaneously live and interact in the same area. When individuals in a population breed, they pass down their genes to their offspring. Many of these genes are polymorphic, meaning that they occur in multiple variants. Such variations of a gene are referred to as alleles. The collective set of all the alleles within a population is known as the gene pool.
60.9K
Distributions to Estimate Population Parameter01:26

Distributions to Estimate Population Parameter

4.4K
The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...
4.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Niche Overlap Is Not Enough: Same Overlap, Contrasting Fluctuations.

Ecology letters·2026
Same authorSame journal

Classical HLA allele and haplotype frequency estimates in US populations.

Human immunology·2026
Same author

The Phenotype List String Grammar for Enhanced Protein and Antigen Reporting in the Immunogenetic Context.

HLA·2026
Same author

Retraction notice to "Allele and Haplotype Frequencies in High-Resolution Typing of Human Leukocyte Antigen -A, -B, -C, -DRB1, -DQB1 and -DPB1 in Thai Voluntary Stem Cell Donors" [Transplantation and Cellular Therapy 31 (2025) 546-548].

Transplantation and cellular therapy·2026
Same author

Autoreactomes in Healthy Individuals Vary According to HLA Class II Genotype.

bioRxiv : the preprint server for biology·2026
Same author

Classical HLA Allele and Haplotype Frequency Estimates in US Populations.

bioRxiv : the preprint server for biology·2026
Same journal

The functional relevance of human leukocyte antigens in the development of cord blood derived advanced cellular therapies: A new direction of public cord blood banks.

Human immunology·2026
Same journal

From educational to graded: redefining proficiency testing in ABO genotyping.

Human immunology·2026
Same journal

Recurrent spontaneous abortion: integrated bioinformatics and histological validation identify FURIN and NEDD4 as ferroptosis-associated endometrial markers.

Human immunology·2026
Same journal

A perspective on ABO incompatible transplantation.

Human immunology·2026
Same journal

High KIR diversity in Uganda and Botswana children living with HIV.

Human immunology·2026
See all related articles

Related Experiment Video

Updated: Oct 26, 2025

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.5K

HLA haplotype frequency estimation for heterogeneous populations using a graph-based imputation algorithm.

Sapir Israeli1, Loren Gragert2, Martin Maiers3

  • 1Department of Computer Science, Bar Ilan University, Ramat Gan, Israel.

Human Immunology
|July 30, 2021
PubMed
Summary
This summary is machine-generated.

This study introduces a new multi-region Expectation-Maximization (EM) algorithm for Human Leukocyte Antigen (HLA) haplotype frequency estimation. This method accurately includes individuals with mixed ethnic backgrounds in population genetics models.

Keywords:
HLAHaplotype frequenciesMulti-region expectation-maximization algorithm

More Related Videos

Genotypic Inference of HIV-1 Tropism Using Population-based Sequencing of V3
11:10

Genotypic Inference of HIV-1 Tropism Using Population-based Sequencing of V3

Published on: December 27, 2010

12.5K
Characterizing Mutational Load and Clonal Composition of Human Blood
07:58

Characterizing Mutational Load and Clonal Composition of Human Blood

Published on: July 11, 2019

7.6K

Related Experiment Videos

Last Updated: Oct 26, 2025

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.5K
Genotypic Inference of HIV-1 Tropism Using Population-based Sequencing of V3
11:10

Genotypic Inference of HIV-1 Tropism Using Population-based Sequencing of V3

Published on: December 27, 2010

12.5K
Characterizing Mutational Load and Clonal Composition of Human Blood
07:58

Characterizing Mutational Load and Clonal Composition of Human Blood

Published on: July 11, 2019

7.6K

Area of Science:

  • Population Genetics
  • Immunogenetics
  • Bioinformatics

Background:

  • Estimating Human Leukocyte Antigen (HLA) haplotype frequencies from unphased genotyping data typically uses Expectation-Maximization (EM) algorithms.
  • Current methods assume Hardy-Weinberg Equilibrium (HWE) and exclude individuals of mixed or unknown ethnic backgrounds.
  • Existing HLA imputation and matching systems struggle with multi-region populations and admixture, limiting their utility for diverse donor registries.

Purpose of the Study:

  • To develop a novel multi-region Expectation-Maximization (EM) algorithm for HLA haplotype frequency estimation.
  • To integrate individuals with ambiguous or mixed ethnic backgrounds into HLA population genetics models.
  • To improve HLA imputation and matching for diverse, multi-region populations.

Main Methods:

  • Expanded the GRaph IMputation and Matching (GRIMM) framework by integrating imputation into an iterative EM algorithm.
  • Developed a multi-region EM implementation that treats geographic region as a Bayesian prior.
  • Enabled integration of HLA data from multiple single-region populations, accommodating admixture.

Main Results:

  • The multi-region EM approach demonstrated significantly higher likelihood values compared to single-region EM implementations.
  • Improved HLA haplotype recovery was observed, as measured by Kullback-Leibler divergence.
  • The method performed effectively on both real-world US donor registry data and simulated multi-region datasets.

Conclusions:

  • The novel multi-region EM algorithm successfully incorporates individuals with mixed or ambiguous ethnic backgrounds.
  • This approach enhances the accuracy of HLA haplotype frequency estimation in diverse populations.
  • The enhanced GRIMM framework offers improved HLA imputation and matching capabilities for multi-ethnic donor registries.