Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Weighted Mean00:57

Weighted Mean

5.5K
While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...
5.5K
Survival Tree01:19

Survival Tree

499
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
499
Multiple Regression01:25

Multiple Regression

3.3K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.3K
Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K
Prediction Intervals01:03

Prediction Intervals

2.5K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.5K
Randomized Experiments01:13

Randomized Experiments

6.3K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
6.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Trans-ancestry genome-wide association meta-analysis of antidepressant response to selective serotonin reuptake inhibitors in clinical studies of depression.

medRxiv : the preprint server for health sciences·2026
Same author

Correction: Thyroid peroxidase antibodies in bipolar disorder: implications for clinical sub-phenotypes and lithium response.

International journal of bipolar disorders·2026
Same author

The Distinct Role of Family History and Polygenic Risk Scores of Psychiatric Disorders on Lithium Response in Bipolar Disorder.

Bipolar disorders·2026
Same author

Accelerated biological aging in bipolar disorder as determined by artificial intelligence-based electrocardiographic assessment.

Journal of affective disorders·2026
Same author

Genotype epigenome phenotype integration reveals peripheral immune contributions to type I bipolar disorder.

Nature communications·2026
Same author

The GA4GH Categorical Variation Representation Specification: A Unified Computational Framework for Reasoning over Genomic Variant Categories.

bioRxiv : the preprint server for biology·2026
Same journal

Extracting Genetically-Imputed Causal Features From ECG Data.

Statistical analysis and data mining·2026
Same journal

Triangulation-Based Spatial Clustering for Adjacent Data With Heterogeneous Density.

Statistical analysis and data mining·2026
Same journal

Bayesian Posterior Interval Calibration to Improve the Interpretability of Observational Studies.

Statistical analysis and data mining·2025
Same journal

A treeless absolutely random forest with closed-form estimators of expected proximities.

Statistical analysis and data mining·2024
Same journal

Data-driven Stochastic Model for Quantifying the Interplay Between Amyloid-beta and Calcium Levels in Alzheimer's Disease.

Statistical analysis and data mining·2024
Same journal

A tree-based gene-environment interaction analysis with rare features.

Statistical analysis and data mining·2023
See all related articles

Related Experiment Videos

A Weighted Random Forests Approach to Improve Predictive Performance.

Stacey J Winham1, Robert R Freimuth1, Joanna M Biernacka2

  • 1Department of Health Sciences Research, Mayo Clinic, 200 First Street Southwest, Rochester, MN 55905 USA.

Statistical Analysis and Data Mining
|February 7, 2014
PubMed
Summary
This summary is machine-generated.

Weighted Random Forests (wRF) offer modest improvements over standard Random Forests (RF) for identifying genetic risk factors in high-dimensional data. However, current wRF may not be effective for complex genetic models common in complex disease research.

Keywords:
Random Forestsgene-gene interactionsgenetic datagenome wide associationhigh-dimensional dataweighting

Related Experiment Videos

Area of Science:

  • Genetics
  • Bioinformatics
  • Computational Biology

Background:

  • Identifying genetic variants for complex diseases is challenging due to high-dimensional data.
  • Gene-gene interactions are often overlooked in genetic analyses.
  • Standard Random Forests (RF) struggle to detect risk factors involved in complex genetic models without strong marginal effects.

Purpose of the Study:

  • To propose Weighted Random Forests (wRF) as an extension of RF.
  • To improve the identification of genetic risk factors in high-dimensional datasets.
  • To address limitations of RF in detecting gene-gene interactions.

Main Methods:

  • Developed Weighted Random Forests (wRF) by incorporating tree-level weights.
  • Weighted trees to emphasize accuracy in prediction and variable importance calculations.
  • Evaluated wRF using simulations and real-world genetic addiction study data.

Main Results:

  • wRF demonstrated modest performance improvements over RF in high-dimensional data.
  • Improvements were most notable in scenarios with larger effect sizes.
  • The method showed potential but with limitations for realistic complex disease genetics.

Conclusions:

  • The current wRF implementation may not significantly enhance the detection of relevant predictors in high-dimensional genetic data.
  • wRF might be applicable in other fields anticipating larger effect sizes.
  • Further research may be needed to adapt wRF for complex genetic etioologies.