Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Estimating Population Mean with Unknown Standard Deviation01:22

Estimating Population Mean with Unknown Standard Deviation

7.7K
In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...
7.7K
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

133
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
133
Distributions to Estimate Population Parameter01:26

Distributions to Estimate Population Parameter

4.1K
The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...
4.1K
Polygenic Traits01:18

Polygenic Traits

65.9K
When more than one gene is responsible for a given phenotype, the trait is considered polygenic. Human height is a polygenic trait. Studies have uncovered hundreds of loci that influence height, and there are believed to be many more. Due to the high number of genes involved, as well as environmental and nutritional factors, height varies significantly within a given population. The distribution of height forms a bell-shaped curve, with relatively few individuals in the population at the...
65.9K
Estimating Population Standard Deviation01:26

Estimating Population Standard Deviation

3.0K
When the population standard deviation is unknown and the sample size is large, the sample standard deviation s is commonly used as a point estimate of σ. However, it can sometimes under or overestimate the population standard deviation. To overcome this drawback, confidence intervals are determined to estimate population parameters and eliminate any calculation bias accurately. However, this only applies to random samples from normally distributed populations. Knowing the sample mean and...
3.0K
Estimating Population Mean with Known Standard Deviation01:16

Estimating Population Mean with Known Standard Deviation

8.4K
To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate +...
8.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Response to correspondence on 'Upadacitinib vs adalimumab in patients with rheumatoid arthritis and a prior inadequate response or intolerance to a tumour necrosis factor inhibitor: 12-week results from the randomised, double-blind, SELECT-SWITCH study' by Mysler et al.

Annals of the rheumatic diseases·2026
Same author

MR2G: A novel framework for causal network inference using GWAS summary data.

PLoS genetics·2026
Same author

Extracting Genetically-Imputed Causal Features From ECG Data.

Statistical analysis and data mining·2026
Same author

Upadacitinib vs adalimumab in patients with rheumatoid arthritis and a prior inadequate response or intolerance to a tumour necrosis factor inhibitor: 12-week results from the randomised, double-blind, SELECT-SWITCH study.

Annals of the rheumatic diseases·2026
Same author

Large-Scale Genotype-Based Trait Imputation With Multi-Ancestry GWAS Data.

Genetic epidemiology·2026
Same author

Co-expression-wide association studies link genetically regulated interactions with complex traits.

Nature communications·2025
Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026
Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026
Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026
Same journal

Beyond Fixed Thresholds: Optimizing Summaries of Wearable Device Data via Piecewise Linearization of Quantile Functions.

Statistics in medicine·2026
Same journal

A Causal Framework for Evaluating the Total Effect of Strategies Aiming to Expand Screening and to Improve Outcomes.

Statistics in medicine·2026
Same journal

Causal Effects on Nonterminal Event Time With Application to Antibiotic Usage and Future Resistance.

Statistics in medicine·2026
See all related articles

Related Experiment Video

Updated: Jul 9, 2025

Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

39.0K

Statistical inference with large-scale trait imputation.

Jingchen Ren1,2, Wei Pan2

  • 1School of Statistics, University of Minnesota, Minneapolis, Minnesota, USA.

Statistics in Medicine
|December 1, 2023
PubMed
Summary
This summary is machine-generated.

A new method improves large-scale trait imputation by accounting for correlations between imputed values, relaxing a previous incorrect assumption. While the original method performed well, the new approach offers potential improvements in genetic analyses.

Keywords:
GWASLS-imputationSNPleast squareslinear models

More Related Videos

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

3.7K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K

Related Experiment Videos

Last Updated: Jul 9, 2025

Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

39.0K
Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

3.7K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K

Area of Science:

  • Genetics
  • Statistical Genetics
  • Bioinformatics

Background:

  • Large-scale trait imputation using Genome-Wide Association Study (GWAS) summary data and genotyped individuals is crucial for downstream genetic analyses.
  • The existing LS-imputation method assumes trait values are independent, which is a simplification that can impact accuracy.
  • Calculating the full covariance matrix for imputed trait values is computationally challenging due to large datasets.

Purpose of the Study:

  • To develop a method that accounts for the covariance matrix of imputed trait values in large-scale genetic analyses.
  • To relax the assumption of independence among imputed trait values, thereby improving the accuracy of downstream analyses.
  • To enhance the utility of GWAS summary data for individual-level genetic studies.

Main Methods:

  • Proposed a "divide and conquer/combine" strategy to estimate and incorporate the covariance matrix of imputed trait values.
  • Implemented batch processing to manage the computational complexity of covariance matrix estimation.
  • Applied the revised imputation method to UK Biobank data for marginal association analysis.

Main Results:

  • The new method showed some improvements in marginal association analysis compared to the original LS-imputation method in specific cases.
  • The original LS-imputation method demonstrated robust performance, attributed to near-constant variances and weak correlations among imputed values in the tested dataset.
  • The findings suggest that while the independence assumption is technically incorrect, its impact may be limited in datasets with specific covariance structures.

Conclusions:

  • The proposed "divide and conquer/combine" strategy offers a way to account for the covariance of imputed trait values, addressing a limitation of previous methods.
  • The practical benefits of the new method may vary depending on the characteristics of the trait and the dataset.
  • Further research is warranted to explore the performance of the improved imputation method across diverse genetic datasets and traits.