Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Variation

Variation

An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...

Correlation and Regression

Correlation and Regression

In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Composite Liquid Marble Templated Millimetric Capsule With Tunable Rigidity, Porosity, and Thermal Reconfigurability Toward 3D Cell Culture.

Advanced materials (Deerfield Beach, Fla.)·2026

Same author

A new method for visualizing cerebrovascular vessels.

Frontiers in molecular neuroscience·2026

Same author

The Comparative Sensitivity to Ketamine-Induced Neuronal Death in Juvenile and Adult Rats.

International journal of toxicology·2026

Same author

In-Situ Grown Mixed-Linker Based Redox-Active Cd (II)-Metal Organic Framework on Nickel Foam: A Self-Supported Trifunctional Electrocatalyst for Energy-Efficient Urea-Assisted Overall Water Splitting.

ChemPlusChem·2026

Same author

Potential link of high fat diet and mRNA expression of Alzheimer's disease-related genes in the enteric mucosa of a rat model of Alzheimer's disease.

Journal of Alzheimer's disease reports·2025

Same author

Developing non-invasive molecular markers for early risk assessment of Alzheimer's disease.

Biomarkers in neuropsychiatry·2025

Same journal

VALIDITY IN DESIGN SCIENCE.

MIS quarterly : management information systems·2026

Same journal

AI-Augmented Content Validation in Behavioral Research: Development and Evaluation of the RATER System.

MIS quarterly : management information systems·2025

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 27, 2026

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Digression and Value Concatenation to Enable Privacy-Preserving Regression.

Xiao-Bai Li¹, Sumit Sarkar²

¹Department of Operations and Information Systems, Manning School of Business, University of Massachusetts Lowell, Lowell, MA 01854 U.S.A. { xiaobai_li@uml.edu }

MIS Quarterly : Management Information Systems

|January 12, 2016

Summary

This summary is machine-generated.

Regression attacks can expose sensitive data using regression trees. This study introduces "digression" to assess risk and prune trees, protecting privacy while preserving data utility.

Keywords:

Privacy anonymization data analytics data mining regression regression trees

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Related Experiment Videos

Last Updated: Mar 27, 2026

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Area of Science:

Computer Science
Data Privacy
Machine Learning

Background:

Regression techniques are susceptible to inferring private individual data.
Existing privacy-preserving methods are inadequate against regression attacks.

Purpose of the Study:

To address the novel problem of regression attacks on sensitive data.
To propose a new privacy-preserving approach against such attacks.

Main Methods:

Introduced a novel measure, 'digression', to assess sensitive data disclosure risk.
Developed a tree-pruning algorithm using digression to limit data disclosure.
Proposed a dynamic value-concatenation method for data anonymization.

Main Results:

The proposed approach effectively protects sensitive data privacy.
The method preserves data utility for research and analysis.
Demonstrated effectiveness on real-world financial, economic, and healthcare data.

Conclusions:

The digression measure and associated algorithms offer a robust defense against regression attacks.
The dynamic value-concatenation method enhances data anonymization efficacy.
The approach is versatile for both numeric and categorical data anonymization.