Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Randomized Experiments01:13

Randomized Experiments

8.8K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
8.8K
Wald-Wolfowitz Runs Test I01:17

Wald-Wolfowitz Runs Test I

941
The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...
941
Random Sampling Method01:09

Random Sampling Method

14.1K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
14.1K
Random Variables01:09

Random Variables

17.4K
A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...
17.4K
Genetic Screens02:46

Genetic Screens

5.6K
Genetic screens are tools used to identify genes and mutations responsible for phenotypes of interest. Genetic screens help identify individuals or a group of people at risk of developing  genetic diseases and help them with early intervention, targeted therapy, and reproductive options.
Forward genetic screens
Forward or “classical” genetic screens involve creating random mutations in an organism’s DNA using radiation, mutagens, or insertion of additional bases, which...
5.6K
Factorial Design02:01

Factorial Design

13.7K
Factorial Analysis is an experimental design that applies Analysis of Variance (ANOVA) statistical procedures to examine a change in a dependent variable due to more than one independent variable, also known as factors. Changes in worker productivity can be reasoned, for example, to be influenced by salary and other conditions, such as skill level. One way to test this hypothesis is by categorizing salary into three levels (low, moderate, and high) and skills sets into two levels (entry level...
13.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Genome wide association study meta-analysis of neuropathologic lesions of Alzheimer's disease and related dementias in a multi-site autopsy cohort.

PLoS genetics·2026
Same author

Putative Imprinting Control Regions with Aberrant Blood-Based DNA Methylation are Associated with Hepatocellular Carcinoma Risk.

Journal of hepatocellular carcinoma·2026
Same author

Placental toxic metal concentrations and preterm birth: Modification by social stressors.

Reproductive toxicology (Elmsford, N.Y.)·2026
Same author

Environmental exposures and epigenetic alterations in common chronic diseases: insights and challenges.

Environmental epigenetics·2026
Same author

Early life factors, blood pressure percentile trajectories, and elevated blood pressure in children.

Pediatric research·2026
Same author

Rare coding variants from ADSP R5 whole-genome sequencing implicate novel genes in Alzheimer's disease.

Research square·2026
Same journal

Applying Bayesian Multivariable Mendelian Randomisation to Prioritise Candidate Causal Traits From High-Dimensional Data: Illustration From Estimation of the Effect of Maternal Metabolites on Offspring Birthweight.

Genetic epidemiology·2026
Same journal

Individualized Bayesian Inference Identifies Novel Genetic Variants for Parkinson's Disease.

Genetic epidemiology·2026
Same journal

DRIVE v3: Command Line Application for Identity-by-Descent Haplotype Clustering in Large Biobank Scale Data.

Genetic epidemiology·2026
Same journal

Deep Unsupervised Domain Adaptation for Translating Cancer Dependency Maps From Cell Lines to Breast Cancer Tumor Genomics.

Genetic epidemiology·2026
Same journal

Polygenic Risk Scores for Incident Dementia in the Multi-Ethnic Study of Atherosclerosis.

Genetic epidemiology·2026
Same journal

Outcome and Exposure Polygenic Risk Scores Can Help Reduce Information Bias and Selection Bias in Regression Estimates From Biobank Data.

Genetic epidemiology·2026
See all related articles

Related Experiment Video

Updated: Jan 14, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.5K

Exploring Random Forest in Genetic Risk Score Construction.

Vaishnavi Venkat1, Kaylyn Clark2,3, X Jessie Jeng4

  • 1Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA.

Genetic Epidemiology
|October 25, 2025
PubMed
Summary
This summary is machine-generated.

Random forest (RF) models offer a novel approach to genetic risk scores (GRS), effectively capturing complex genetic interactions for traits. New RF-based GRS strategies, like ctRF, outperform traditional methods for complex diseases.

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.7K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K

Related Experiment Videos

Last Updated: Jan 14, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.5K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.7K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K

Area of Science:

  • Genetics
  • Bioinformatics
  • Computational Biology

Background:

  • Genetic risk scores (GRS) traditionally assess genetic liability using additive models.
  • Complex traits often involve nonadditive genetic interactions (epistasis) missed by conventional GRS.
  • Random forest (RF) models can capture nonlinear interactions, offering a potential improvement.

Purpose of the Study:

  • To investigate RF models for constructing GRS capable of capturing complex genetic interactions.
  • To introduce and evaluate novel RF-based GRS strategies (ctRF and wRF).
  • To assess the performance of RF-GRS compared to traditional methods using simulations and real data.

Main Methods:

  • Developed two RF-based GRS strategies: ctRF (optimizing LD clumping and p-value thresholds) and wRF (adjusting SNP inclusion).
  • Employed simulation studies to test GRS performance under various genetic architectures.
  • Applied RF-GRS methods to real-world data for Alzheimer's disease, body mass index, and atopy.

Main Results:

  • The ctRF strategy consistently outperformed other RF-based methods and classical additive models.
  • RF-based GRS demonstrated superior performance for traits with complex genetic architectures.
  • Incorporating base data into RF-GRS construction improved predictive accuracy.

Conclusions:

  • RF-based GRS effectively capture intricate genetic interactions, including nonlinear effects.
  • ctRF presents a robust alternative to traditional GRS for complex traits.
  • RF models offer a powerful, model-free approach for enhanced genetic risk prediction.