Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

464
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
464
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.5K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.5K
End Point Prediction: Gran Plot01:07

End Point Prediction: Gran Plot

1.4K
A Gran plot is used to predict the equivalence volume or endpoint of a potentiometric or acid-base titration without reaching the endpoint. Typically, titration data is collected as a function of the titrant's volume up to a point less than the equivalence volume and then transformed into a linear format. The straight line is extended to the x-axis, indicating the necessary titrant volume to achieve the equivalence point.
For potentiometric titration, the Gran plot is created by plotting...
1.4K
Censoring Survival Data01:09

Censoring Survival Data

630
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
630
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

1.8K
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
1.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Antibiotic resistance genes in surface water of eutrophic urban lakes are related to heavy metals, antibiotics, lake morphology and anthropic impact.

Ecotoxicology (London, England)·2017
Same author

HER2 assessment in locally advanced gastric cancer: comparing the results obtained with the use of two primary tumour blocks versus those obtained with the use of all primary tumour blocks.

Histopathology·2017
Same author

Inflammatory microRNA-194 and -515 attenuate the biosynthesis of chondroitin sulfate during human intervertebral disc degeneration.

Oncotarget·2017
Same author

Soil Acidification Aggravates the Occurrence of Bacterial Wilt in South China.

Frontiers in microbiology·2017
Same author

Is the Prophylactic Use of Hepatoprotectants Necessary in Anti-Tuberculosis Treatment?

Chemotherapy·2017
Same author

Light-induced aggregation of microbial exopolymeric substances.

Chemosphere·2017
Same journal

A SEQUENTIAL SIGNIFICANCE TEST FOR TREATMENT BY COVARIATE INTERACTIONS.

Statistica Sinica·2026
Same journal

DEFINING AND ESTIMATING PRINCIPAL STRATUM SPECIFIC NATURAL MEDIATION EFFECTS WITH SEMI-COMPETING RISKS DATA.

Statistica Sinica·2026
Same journal

Longitudinal Modeling of Rank-based Global Outcome.

Statistica Sinica·2026
Same journal

INTEGRATING INCOMPLETE DATA FOR MEDIATION ANALYSIS.

Statistica Sinica·2026
Same journal

COMMUNITY EXTRACTION OF NETWORK DATA UNDER STOCHASTIC BLOCK MODELS.

Statistica Sinica·2026
Same journal

STATISTICAL INFERENCE FOR MEAN FUNCTIONS OF COMPLEX 3D OBJECTS.

Statistica Sinica·2025
See all related articles

Related Experiment Videos

Prediction-based Termination Rule for Greedy Learning with Massive Data.

Chen Xu1, Shaobo Lin2, Jian Fang2

  • 1The Pennsylvania State University.

Statistica Sinica
|May 10, 2016
PubMed
Summary
This summary is machine-generated.

This study introduces a novel termination rule for the orthogonal greedy algorithm (OGA), enhancing its efficiency for massive datasets in statistical learning. The new method ensures accurate and fast predictions, crucial for handling big data challenges.

Keywords:
Forward regressionGreedy algorithmsKernel methodsMassive dataNonparametric regressionSparse modeling

Related Experiment Videos

Area of Science:

  • Machine Learning
  • Statistical Learning
  • Computational Statistics

Background:

  • Massive datasets are prevalent in modern scientific research.
  • Classical regression methods face computational challenges with large sample sizes.
  • Orthogonal Greedy Algorithm (OGA) offers an efficient kernel-based learning alternative.

Purpose of the Study:

  • To propose a new, efficient termination rule for OGA.
  • To improve prediction accuracy and speed in large-scale statistical learning.
  • To address the computational cost of regression with massive data.

Main Methods:

  • Investigated the predictive performance of OGA.
  • Developed a novel termination rule based on predictive performance.
  • Utilized a sample-dependent kernel dictionary.
  • Analyzed convergence rates and consistency.

Main Results:

  • The proposed termination rule suggests an optimal number of essential updates for OGA.
  • Demonstrated strong consistency with an O(log n) convergence rate to the oracle prediction.
  • The method is computationally efficient for massive datasets.
  • Validated through simulations and real-world data.

Conclusions:

  • The new termination rule simplifies OGA implementation and enhances efficiency for big data.
  • The method achieves strong consistency and a favorable convergence rate.
  • This approach offers a practical solution for fast and accurate prediction in large-scale statistical learning.