Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Predicting Reaction Outcomes02:24

Predicting Reaction Outcomes

10.8K
Kinetics describes the rate and path by which a reaction occurs. In contrast, thermodynamics deals with state functions and describes the properties, behavior, and components of a system. It is not concerned with the path taken by the process and cannot address the rate at which a reaction occurs. Although it does provide information about what can happen during a reaction process, it does not describe the detailed steps of what appears on an atomic or a molecular level. On the other hand,...
10.8K
Random Error01:04

Random Error

9.8K
Random or indeterminate errors originate from various uncontrollable variables, such as variations in environmental conditions, instrument imperfections, or the inherent variability of the phenomena being measured. Usually, these errors cannot be predicted, estimated, or characterized because their direction and magnitude often vary in magnitude and direction even during consecutive measurements. As a result, they are difficult to eliminate. However, the aggregate effect of these errors can be...
9.8K
Random Variables01:09

Random Variables

17.8K
A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...
17.8K
Randomized Experiments01:13

Randomized Experiments

9.0K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
9.0K
Survival Curves01:18

Survival Curves

698
Survival curves are graphical representations that depict the survival experience of a population over time, offering an intuitive way to track the proportion of individuals who remain event-free at each time point. These curves are widely used in fields such as medicine, public health, and reliability engineering to visualize and compare survival probabilities across different groups or conditions.
The Kaplan-Meier estimator is the most common method for constructing survival curves. This...
698
Survival Tree01:19

Survival Tree

418
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
418

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

[Preparation and characterization of anti-human B7-H4 monoclonal antibodies].

Xi bao yu fen zi mian yi xue za zhi = Chinese journal of cellular and molecular immunology·2011
Same author

Cardioprotective effects of salvianolic Acid a on myocardial ischemia-reperfusion injury in vivo and in vitro.

Evidence-based complementary and alternative medicine : eCAM·2011
Same author

Knock-down of ubiquitin-specific protease 22 by micro-RNA interference inhibits colorectal cancer growth.

International journal of colorectal disease·2011
Same author

Abundance and diversity of Sphingomonas in Shenfu petroleum-wastewater irrigation zone, China.

Environmental science and pollution research international·2011
Same author

Up-regulation of cyclin D1 by JNK1/c-Jun is involved in tumorigenesis of human embryo lung fibroblast cells induced by a low concentration of arsenite.

Toxicology letters·2011
Same author

Implantation of a porcine acellular dermal graft in a primate model of rotator cuff repair.

Journal of shoulder and elbow surgery·2011
Same journal

Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

fastkqr: A Fast Algorithm for Kernel Quantile Regression.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Empirical Bayes Covariance Decomposition, and a Solution to the Multiple Tuning Problem in Sparse PCA.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Joint Registration and Conformal Prediction for Partially Observed Functional Data.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Efficient Decision Trees for Tensor Regressions.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Distributed Nonparametric Regression with Heterogeneity Through Prediction-Based Aggregation.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
See all related articles

Related Experiment Video

Updated: Jan 29, 2026

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.4K

A Modified Random Survival Forests Algorithm for High Dimensional Predictors and Self-Reported Outcomes.

Hui Xu1, Xiangdong Gu1, Mahlet G Tadesse2

  • 1Department of Biostatistics and Epidemiology, University of Massachusetts Amherst, Amherst, MA 01003.

Journal of Computational and Graphical Statistics : a Joint Publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America
|February 16, 2019
PubMed
Summary
This summary is machine-generated.

This study introduces a new algorithm for selecting important variables in large datasets with inaccurate time-to-event outcomes. The method aids in discovering genetic associations, such as with Type II diabetes.

Keywords:
High Dimensional DataInterval CensoringRandom Survival ForestsSelf-reportsVariable Selection

More Related Videos

Collecting and Processing Drone-based Remotely Sensed Data for Use in Forest Recovery Monitoring
08:16

Collecting and Processing Drone-based Remotely Sensed Data for Use in Forest Recovery Monitoring

Published on: October 24, 2025

553
Methods of Soil Resampling to Monitor Changes in the Chemical Concentrations of Forest Soils
09:16

Methods of Soil Resampling to Monitor Changes in the Chemical Concentrations of Forest Soils

Published on: November 25, 2016

17.4K

Related Experiment Videos

Last Updated: Jan 29, 2026

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.4K
Collecting and Processing Drone-based Remotely Sensed Data for Use in Forest Recovery Monitoring
08:16

Collecting and Processing Drone-based Remotely Sensed Data for Use in Forest Recovery Monitoring

Published on: October 24, 2025

553
Methods of Soil Resampling to Monitor Changes in the Chemical Concentrations of Forest Soils
09:16

Methods of Soil Resampling to Monitor Changes in the Chemical Concentrations of Forest Soils

Published on: November 25, 2016

17.4K

Area of Science:

  • Biostatistics
  • Epidemiology
  • Bioinformatics

Background:

  • High-dimensional datasets in epidemiology often feature time-to-event outcomes with measurement error.
  • Accurate variable selection is crucial for identifying risk factors in large-scale studies like the Women's Health Initiative.
  • Imperfect outcome data is also common in electronic medical records.

Purpose of the Study:

  • To develop and evaluate an ensemble tree-based algorithm for variable selection with error-prone time-to-event outcomes.
  • To address challenges in analyzing observational data where outcomes are not precisely recorded.
  • To facilitate the discovery of significant variables in complex datasets.

Main Methods:

  • An ensemble tree-based algorithm was developed for variable selection.
  • The algorithm handles high-dimensional data with time-to-event outcomes observed with error.
  • Performance was assessed using simulation studies with continuous and categorical covariates.

Main Results:

  • The proposed algorithm demonstrates effectiveness in variable selection for datasets with imperfect time-to-event outcomes.
  • Simulation studies confirmed the algorithm's performance across different covariate types.
  • The approach successfully identified single nucleotide polymorphisms associated with incident Type II diabetes in the Women's Health Initiative cohort.

Conclusions:

  • The developed ensemble tree-based algorithm provides a robust method for variable selection in the presence of outcome error.
  • This approach is applicable to various settings, including large epidemiologic studies and electronic medical record data.
  • An R package, 'icRSF', is available for implementing these novel methods.