Kernel density estimation of allele frequency including undetected alleles
View abstract on PubMed
Summary
This summary is machine-generated.Kernel density estimation (KDE) was explored for estimating genetic diversity using undetected alleles. However, this method showed worse bias and accuracy for both allele frequency and nucleotide diversity, suggesting it is not beneficial for genetic diversity estimation.
Area Of Science
- Population Genetics
- Bioinformatics
- Statistical Ecology
Background
- Undetected species impact species diversity estimation.
- Undetected alleles have not been utilized for genetic diversity estimation.
- While random sampling provides unbiased estimates, undetected alleles could offer biased yet more precise conservation insights.
Purpose Of The Study
- To develop and evaluate Kernel Density Estimation (KDE) for allele frequency, incorporating undetected alleles.
- To assess the utility of KDE in estimating allele frequency and nucleotide diversity.
Main Methods
- Kernel Density Estimation (KDE) was applied to allele frequency data.
- The method was tested using coalescent simulations and real population genetic data.
Main Results
- Contrary to expectations, KDE resulted in increased bias and reduced accuracy for nucleotide diversity estimation.
- KDE-based allele frequency estimation was generally worse, except in cases with very small sample sizes.
- Potential reasons for poor performance include population finiteness and the curse of dimensionality.
Conclusions
- Kernel Density Estimation (KDE) of allele frequency does not improve genetic diversity estimation.
- The approach of including undetected alleles via KDE is not currently beneficial for population genetic analysis.
Related Concept Videos
Diploid organisms have two alleles of each gene, one from each parent, in their somatic cells. Therefore, each individual contributes two alleles to the gene pool of the population. The gene pool of a population is the sum of every allele of all genes within that population and has some degree of variation. Genetic variation is typically expressed as a relative frequency, which is the percentage of the total population that has a given allele, genotype or phenotype.
In the early 20th century,...
A population is composed of members of the same species that simultaneously live and interact in the same area. When individuals in a population breed, they pass down their genes to their offspring. Many of these genes are polymorphic, meaning that they occur in multiple variants. Such variations of a gene are referred to as alleles. The collective set of all the alleles within a population is known as the gene pool.
While some alleles of a given gene might be observed commonly, other variants...
In a population that is not at Hardy-Weinberg equilibrium, the frequency of alleles changes over time. Therefore, any deviations from the five conditions of Hardy-Weinberg equilibrium can alter the genetic variation of a given population. Conditions that change the genetic variability of a population include mutations, natural selection, non-random mating, gene flow, and genetic drift (small population size).
Mechanisms of Genetic Variation
The original sources of genetic variation are...
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).
Hence, the expected frequency of any number appearing when casting a die...
Natural selection—probably the most well-known evolutionary mechanism—increases the prevalence of traits that enhance survival and reproduction. However, evolution does not merely propagate favorable traits, nor does it always benefit populations.
Life is not fair. A deer grazing contentedly in a field can have her meal cut tragically short by a bolt of lightning. If the doomed doe is one of only three in the population, 1/3 of the population’s gene pool is lost. Random...
The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...

