Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

CYP2B6 polymorphisms and suicidal behaviour in people living with HIV treated with efavirenz-containing combination antiretroviral therapy: a global case-control study.

Frontiers in pharmacology·2026

Same author

Racial, ethnic, and regional disparities in HIV testing during the COVID-19 pandemic in the USA: a nationwide, retrospective, observational study using National Clinical Cohort Collaborative data.

The lancet. HIV·2026

Same author

Proteomic and genetic predictors and risk scores of cardiovascular diseases in persons living with HIV.

Frontiers in cardiovascular medicine·2026

Same author

HIV Status and COVID-19 Treatment Disparities in the US National Clinical Cohort Collaborative.

Open forum infectious diseases·2026

Same author

A Bayesian Integrative Mixed Modeling Framework for Analysis of the Multi-Site Adolescent Brain and Cognitive Development Study.

Data science in science·2026

Same author

Identifying People Living With or Those at Risk for HIV in a Nationally Sampled Electronic Health Record Repository Called the National Clinical Cohort Collaborative: Computational Phenotyping Study.

JMIR medical informatics·2025

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 24, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

Sparse sliced inverse regression for high dimensional data analysis.

Haileab Hilafu¹, Sandra E Safo²

¹Department of Business Analytics and Statistics, University of Tennessee, Knoxville, TN, 37996, USA. hhilafu@utk.edu.

BMC Bioinformatics

|May 7, 2022

Summary

This summary is machine-generated.

This study introduces a new method for dimension reduction in high-dimensional data analysis, enhancing variable selection and model interpretability. The approach improves estimation and prediction accuracy in complex datasets.

Keywords:

Generalized eigenvalue decomposition High-dimensional data Linear discriminant analysis Semiparametric model Sliced inverse regression

More Related Videos

A Data-Driven Approach to Quantifying Immune States in Sepsis

A Data-Driven Approach to Quantifying Immune States in Sepsis

Published on: February 7, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Related Experiment Videos

Last Updated: Sep 24, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

A Data-Driven Approach to Quantifying Immune States in Sepsis

A Data-Driven Approach to Quantifying Immune States in Sepsis

Published on: February 7, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Area of Science:

Statistics
Data Science
Bioinformatics

Background:

High-dimensional data analysis requires effective dimension reduction and variable selection.
Semi-parametric multi-index models are suitable for analyzing complex, high-dimensional datasets.
Sliced inverse regression (SIR) provides a model-free approach for estimating indices in these models.

Purpose of the Study:

To develop a method for achieving sparse estimates of eigenvectors in SIR.
To facilitate variable selection and improve model interpretability and parsimony.

Main Methods:

A group-Dantzig selector formulation is proposed to induce row-sparsity.
This formulation is applied to sliced inverse regression dimension reduction vectors.

Main Results:

Extensive simulation studies demonstrate the method's performance.
The proposed method is compared against existing state-of-the-art techniques.

Conclusions:

The method achieves competitive performance in estimation, prediction, and variable selection.
Real-world data applications, including a metabolomics depression study, confirm its practical effectiveness.