Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Analysis01:11

Regression Analysis

5.7K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.7K
Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K
Errors In Hypothesis Tests01:14

Errors In Hypothesis Tests

4.2K
When performing a hypothesis test, there are four possible outcomes depending on the actual truth (or falseness) of the null hypothesis and the decision to reject or not.
4.2K
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

2.5K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
2.5K
Goodness-of-Fit Test01:16

Goodness-of-Fit Test

3.3K
The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
3.3K
Truncation in Survival Analysis01:09

Truncation in Survival Analysis

194
Truncation in survival analysis refers to the exclusion of individuals or events from the dataset based on specific criteria related to the time of the event. This exclusion can happen in two primary forms: left truncation and right truncation.
Left truncation occurs when individuals who experienced the event of interest before a certain time are not included in the study. This is often due to a "delayed entry" into the study where only those who survive until a certain entry point are...
194

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A robust likelihood approach to inference for paired multiple binary endpoints data.

Journal of applied statistics·2024
Same author

Testing for association between ordinal traits and genetic variants in pedigree-structured samples by collapsing and kernel methods.

The international journal of biostatistics·2023
Same author

A model selection criterion for clustered survival analysis with informative cluster size.

Pharmaceutical statistics·2022
Same author

Associating Multivariate Traits with Genetic Variants Using Collapsing and Kernel Methods with Pedigree- or Population-Based Studies.

Computational and mathematical methods in medicine·2021
Same author

Combining dependent p-values by gamma distributions.

Statistical applications in genetics and molecular biology·2020
Same author

A rank-based normalization method with the fully adjusted full-stage procedure in genetic association studies.

PloS one·2020
Same journal

Elastic functional Cox regression model with shape predictors.

Journal of applied statistics·2026
Same journal

An improved two-stage binary relevance method for multilabel classification.

Journal of applied statistics·2026
Same journal

Classification of multivariate functional data with an application to ADHD fMRI data.

Journal of applied statistics·2026
Same journal

Assessing the performance of longitudinal T-lymphocytes as biomarkers of immune recovery in HIV-infected children with or without TB co-infection.

Journal of applied statistics·2026
Same journal

Sparse long-only Markowitz portfolio optimization.

Journal of applied statistics·2026
Same journal

Homogeneity of multinomial populations when data are classified into a large number of groups.

Journal of applied statistics·2026
See all related articles

Related Experiment Video

Updated: Jun 25, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Regression Diagnostic under Model Misspecification.

Li-Chu Chien1, Tsung-Shan Tsou2

  • 1Division of Biostatistics and Bioinformatics, National Health Research Institutes, Taiwan.

Journal of Applied Statistics
|May 31, 2024
PubMed
Summary
This summary is machine-generated.

We introduce new methods to detect influential data points in linear regression. These diagnostics identify observations significantly altering the likelihood function, improving regression analysis reliability.

Keywords:
Cook's distanceDFBETASDFFITSInfluential diagnosticrobust likelihoodrobust normal regression

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.1K
Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.2K

Related Experiment Videos

Last Updated: Jun 25, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.1K
Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.2K

Area of Science:

  • Statistics
  • Econometrics
  • Data Science

Background:

  • Traditional regression diagnostics assess data point influence by deletion effects.
  • Existing methods focus on parameter estimates or predicted values.
  • Limitations exist in current diagnostics for identifying subtle influential observations.

Purpose of the Study:

  • To propose novel diagnostic measures for influential observations in linear regression.
  • To offer alternative methods beyond deletion diagnostics.
  • To enhance the robustness of regression parameter estimation.

Main Methods:

  • Developed two new diagnostic statistics for influential observations.
  • Focused on the impact of data point inclusion on the likelihood function.
  • Utilized asymptotic properties for broad applicability.

Main Results:

  • The proposed methods identify influential observations based on likelihood function changes.
  • These diagnostics are asymptotically valid for distributions with existing second moments.
  • Offers a new perspective on detecting influential data points in regression.

Conclusions:

  • The novel diagnostics provide a valuable tool for assessing influential observations in linear regression.
  • These methods complement existing diagnostics by focusing on likelihood.
  • Enhances the reliability and interpretability of regression models.