Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.3K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.3K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

9.7K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
9.7K
Regression Toward the Mean01:52

Regression Toward the Mean

7.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
7.3K
z Scores and Area Under the Curve01:17

z Scores and Area Under the Curve

19.9K
z scores are the standardized values obtained after converting a normal distribution into a standard normal distribution. A z score is measured in units of the standard deviation. The z score tells you how many standard deviations the value x is above (to the right of) or below (to the left of) the mean, μ. Values of x that are larger than the mean have positive z scores, and values of x that are smaller than the mean have negative z scores. If x equals the mean, then x has a z score of...
19.9K
Prediction Intervals01:03

Prediction Intervals

3.5K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
3.5K
Weighted Mean00:57

Weighted Mean

7.2K
While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...
7.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A bounded hazard ratio Cox model for the effect of time to treatment on mortality.

Statistical methods in medical research·2026
Same author

Evaluating implementation of the HEARTS model for management of hypertension and diabetes in Guatemala: protocol for a prospective observational hybrid type 3 study.

Implementation science communications·2026
Same author

Surveillance of Pharmaceutical Risk-Mitigation Behavior: Applying and Comparing Statistical Process Control Methods Using Real World Data.

Learning health systems·2026
Same author

Accuracy of Surveys for Estimating Coverage for Hepatitis A and B Vaccinations in Adults.

Preventing chronic disease·2026
Same author

Gradient boosting-based discrete failure time model for selecting time-varying effects and interactions.

Lifetime data analysis·2026
Same author

Refining normoxemia targets in acute burn care: Reply.

The journal of trauma and acute care surgery·2026
Same journal

Design of Trials with Composite Endpoints with the R Package CompAREdesign.

Statistics in biosciences·2026
Same journal

Pan-Cancer Drug Response Prediction Using Integrative Principal Component Regression.

Statistics in biosciences·2026
Same journal

Variance Estimation for Weighted Average Treatment Effects.

Statistics in biosciences·2026
Same journal

Bayesian Modeling on Microbiome Data Analysis: Application to Subgingival Microbiome Study.

Statistics in biosciences·2026
Same journal

Canopy2: Tumor Phylogeny Inference by Bulk DNA and Single-Cell RNA Sequencing.

Statistics in biosciences·2026
Same journal

Multilevel Multivariate Functional Principal Component Analysis of Evoked and Induced Event-Related Spectral Perturbations.

Statistics in biosciences·2026
See all related articles

Related Experiment Video

Updated: Mar 9, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.7K

Locally Weighted Score Estimation for Quantile Classification in Binary Regression Models.

John D Rice1, Jeremy M G Taylor1

  • 1University of Michigan, Department of Biostatistics, 1415 Washington Heights, Ann Arbor, MI 48104, USA.

Statistics in Biosciences
|December 27, 2016
PubMed
Summary
This summary is machine-generated.

This study introduces a new method for binary response regression, incorporating application-specific probability thresholds for improved classification accuracy. The locally weighted score approach enhances prediction for high- and low-risk groups, reducing error rates compared to traditional methods.

Keywords:
asymmetric lossbinary classificationlocal likelihoodlogistic regressionrobust estimation

More Related Videos

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.4K

Related Experiment Videos

Last Updated: Mar 9, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.7K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.4K

Area of Science:

  • Statistics
  • Biostatistics
  • Machine Learning

Background:

  • Binary response regression is widely used for classification tasks.
  • Existing methods often ignore application-specific probability thresholds.
  • Maximum likelihood estimation is a common but potentially suboptimal approach.

Purpose of the Study:

  • To develop a novel estimation procedure for linear logistic models that incorporates a priori probability thresholds.
  • To improve classification accuracy, particularly for high- and low-risk groups.
  • To reduce prediction error rates in binary response regression.

Main Methods:

  • A locally weighted score equation approach using a kernel-like weight function centered at the threshold.
  • Cross-validation of a hybrid loss function combining classification error and divergence.
  • Exploration of alternative cross-validation functions based on common binary classification metrics.

Main Results:

  • The proposed method demonstrates reduced error rates compared to maximum likelihood estimation.
  • Effectiveness is particularly notable under certain forms of model misspecification.
  • Simulations and a melanoma dataset analysis validate the practical utility of the method.

Conclusions:

  • Incorporating application-specific thresholds into the estimation procedure enhances classification performance.
  • The locally weighted approach offers a robust alternative for predictive modeling in binary response regression.
  • This method provides a valuable tool for risk stratification and classification in various applications.