Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

z Scores and Area Under the Curve

z Scores and Area Under the Curve

z scores are the standardized values obtained after converting a normal distribution into a standard normal distribution. A z score is measured in units of the standard deviation. The z score tells you how many standard deviations the value x is above (to the right of) or below (to the left of) the mean, μ. Values of x that are larger than the mean have positive z scores, and values of x that are smaller than the mean have negative z scores. If x equals the mean, then x has a z score of...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Weighted Mean

Weighted Mean

While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A bounded hazard ratio Cox model for the effect of time to treatment on mortality.

Statistical methods in medical research·2026

Same author

Evaluating implementation of the HEARTS model for management of hypertension and diabetes in Guatemala: protocol for a prospective observational hybrid type 3 study.

Implementation science communications·2026

Same author

Surveillance of Pharmaceutical Risk-Mitigation Behavior: Applying and Comparing Statistical Process Control Methods Using Real World Data.

Learning health systems·2026

Same author

Accuracy of Surveys for Estimating Coverage for Hepatitis A and B Vaccinations in Adults.

Preventing chronic disease·2026

Same author

Gradient boosting-based discrete failure time model for selecting time-varying effects and interactions.

Lifetime data analysis·2026

Same author

Refining normoxemia targets in acute burn care: Reply.

The journal of trauma and acute care surgery·2026

Same journal

Design of Trials with Composite Endpoints with the R Package CompAREdesign.

Statistics in biosciences·2026

Same journal

Pan-Cancer Drug Response Prediction Using Integrative Principal Component Regression.

Statistics in biosciences·2026

Same journal

Variance Estimation for Weighted Average Treatment Effects.

Statistics in biosciences·2026

Same journal

Bayesian Modeling on Microbiome Data Analysis: Application to Subgingival Microbiome Study.

Statistics in biosciences·2026

Same journal

Canopy2: Tumor Phylogeny Inference by Bulk DNA and Single-Cell RNA Sequencing.

Statistics in biosciences·2026

Same journal

Multilevel Multivariate Functional Principal Component Analysis of Evoked and Induced Event-Related Spectral Perturbations.

Statistics in biosciences·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 9, 2026

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Locally Weighted Score Estimation for Quantile Classification in Binary Regression Models.

John D Rice¹, Jeremy M G Taylor¹

¹University of Michigan, Department of Biostatistics, 1415 Washington Heights, Ann Arbor, MI 48104, USA.

Statistics in Biosciences

|December 27, 2016

Summary

This summary is machine-generated.

This study introduces a new method for binary response regression, incorporating application-specific probability thresholds for improved classification accuracy. The locally weighted score approach enhances prediction for high- and low-risk groups, reducing error rates compared to traditional methods.

Keywords:

asymmetric loss binary classification local likelihood logistic regression robust estimation

More Related Videos

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Related Experiment Videos

Last Updated: Mar 9, 2026

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Area of Science:

Statistics
Biostatistics
Machine Learning

Background:

Binary response regression is widely used for classification tasks.
Existing methods often ignore application-specific probability thresholds.
Maximum likelihood estimation is a common but potentially suboptimal approach.

Purpose of the Study:

To develop a novel estimation procedure for linear logistic models that incorporates a priori probability thresholds.
To improve classification accuracy, particularly for high- and low-risk groups.
To reduce prediction error rates in binary response regression.

Main Methods:

A locally weighted score equation approach using a kernel-like weight function centered at the threshold.
Cross-validation of a hybrid loss function combining classification error and divergence.
Exploration of alternative cross-validation functions based on common binary classification metrics.

Main Results:

The proposed method demonstrates reduced error rates compared to maximum likelihood estimation.
Effectiveness is particularly notable under certain forms of model misspecification.
Simulations and a melanoma dataset analysis validate the practical utility of the method.

Conclusions:

Incorporating application-specific thresholds into the estimation procedure enhances classification performance.
The locally weighted approach offers a robust alternative for predictive modeling in binary response regression.
This method provides a valuable tool for risk stratification and classification in various applications.