Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Weighted Mean

Weighted Mean

While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Variation

Variation

An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when researchers try to extrapolate results...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Vine copula mixed models for meta-analysis of diagnostic accuracy studies without a gold standard.

Biometrics·2025

Same author

Joint meta-analysis of two diagnostic tests accounting for within and between studies dependence.

Statistical methods in medical research·2024

Same author

Likelihood Inference for Factor Copula Models with Asymmetric Tail Dependence.

Entropy (Basel, Switzerland)·2024

Same author

Factor Tree Copula Models for Item Response Data.

Psychometrika·2023

Same author

Bi-factor and Second-Order Copula Models for Item Response Data.

Psychometrika·2022

Same author

Copula diagnostics for asymmetries and conditional dependence.

Journal of applied statistics·2022

Same journal

A Bayesian functional concurrent zero-inflated Dirichlet-multinomial regression model with application to infant microbiome.

Biostatistics (Oxford, England)·2026

Same journal

Towards optimal environmental policies: policy learning under arbitrary bipartite network interference.

Biostatistics (Oxford, England)·2026

Same journal

Multilevel functional quantile principal component analysis.

Biostatistics (Oxford, England)·2026

Same journal

Adaptive transfer learning for time-to-event modeling with applications in disease risk assessment.

Biostatistics (Oxford, England)·2026

Same journal

High-dimensional test for one-sided hypotheses.

Biostatistics (Oxford, England)·2026

Same journal

NBSR: a Negative Binomial Softmax Regression model for microRNA-seq data analysis.

Biostatistics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 3, 2026

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Weighted scores method for regression models with dependent data.

Aristidis K Nikoloulopoulos¹, Harry Joe, N Rao Chaganty

¹School of Computing Sciences, University of East Anglia, Norwich NR4 7TJ, UK. a.nikoloulopoulos@uea.ac.uk

Biostatistics (Oxford, England)

|March 26, 2011

Summary

This summary is machine-generated.

This study introduces a weighted scores method for analyzing dependent data in regression models, offering a robust and efficient alternative to complex copula methods when dependence is not the primary focus. The new approach provides nearly maximum likelihood efficiency for analyzing health care utilization.

More Related Videos

Decomposing the Variance in Reading Comprehension to Reveal the Unique and Common Effects of Language and Decoding

Decomposing the Variance in Reading Comprehension to Reveal the Unique and Common Effects of Language and Decoding

Published on: October 11, 2018

Related Experiment Videos

Last Updated: Jun 3, 2026

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Decomposing the Variance in Reading Comprehension to Reveal the Unique and Common Effects of Language and Decoding

Decomposing the Variance in Reading Comprehension to Reveal the Unique and Common Effects of Language and Decoding

Published on: October 11, 2018

Area of Science:

Biostatistics
Statistical Modeling
Health Services Research

Background:

Existing copula-based models for dependent data (e.g., clustered, longitudinal overdispersed counts) offer straightforward estimation but can be complex when dependence is not the primary interest.
Regression analysis with dependent data often requires specialized models that account for complex correlation structures.

Purpose of the Study:

To propose and evaluate a novel "weighted scores method" for regression analysis of dependent data.
To provide a method that focuses on univariate regression parameters while effectively handling data dependence.
To assess the robustness and efficiency of the proposed method compared to existing approaches.

Main Methods:

The weighted scores method involves weighting score functions of univariate margins.
Weight matrices are derived from fitting a discretized multivariate normal distribution to capture dependence.
The methodology is applied to negative binomial regression models for overdispersed count data.

Main Results:

Asymptotic and small-sample efficiency calculations demonstrate the robustness of the weighted scores method.
The proposed method achieves efficiency comparable to maximum likelihood estimation in fully specified copula models.
The method is effective for analyzing health care utilization data based on family characteristics.

Conclusions:

The weighted scores method offers a practical and efficient alternative for regression with dependent data when the focus is on marginal parameters.
This approach simplifies analysis without sacrificing significant statistical efficiency.
The method is applicable to real-world health services research, as shown in the healthcare utilization example.