Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Analysis01:11

Regression Analysis

6.1K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
6.1K
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

311
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
311
Multiple Regression01:25

Multiple Regression

3.2K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.2K
Regression Toward the Mean01:52

Regression Toward the Mean

6.5K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.5K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.9K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.9K
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

144
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
144

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

CYP2B6 polymorphisms and suicidal behaviour in people living with HIV treated with efavirenz-containing combination antiretroviral therapy: a global case-control study.

Frontiers in pharmacology·2026
Same author

Racial, ethnic, and regional disparities in HIV testing during the COVID-19 pandemic in the USA: a nationwide, retrospective, observational study using National Clinical Cohort Collaborative data.

The lancet. HIV·2026
Same author

Proteomic and genetic predictors and risk scores of cardiovascular diseases in persons living with HIV.

Frontiers in cardiovascular medicine·2026
Same author

HIV Status and COVID-19 Treatment Disparities in the US National Clinical Cohort Collaborative.

Open forum infectious diseases·2026
Same author

A Bayesian Integrative Mixed Modeling Framework for Analysis of the Multi-Site Adolescent Brain and Cognitive Development Study.

Data science in science·2026
Same author

Identifying People Living With or Those at Risk for HIV in a Nationally Sampled Electronic Health Record Repository Called the National Clinical Cohort Collaborative: Computational Phenotyping Study.

JMIR medical informatics·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Sep 24, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.8K

Sparse sliced inverse regression for high dimensional data analysis.

Haileab Hilafu1, Sandra E Safo2

  • 1Department of Business Analytics and Statistics, University of Tennessee, Knoxville, TN, 37996, USA. hhilafu@utk.edu.

BMC Bioinformatics
|May 7, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces a new method for dimension reduction in high-dimensional data analysis, enhancing variable selection and model interpretability. The approach improves estimation and prediction accuracy in complex datasets.

Keywords:
Generalized eigenvalue decompositionHigh-dimensional dataLinear discriminant analysisSemiparametric modelSliced inverse regression

More Related Videos

A Data-Driven Approach to Quantifying Immune States in Sepsis
07:42

A Data-Driven Approach to Quantifying Immune States in Sepsis

Published on: February 7, 2025

322
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.6K

Related Experiment Videos

Last Updated: Sep 24, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.8K
A Data-Driven Approach to Quantifying Immune States in Sepsis
07:42

A Data-Driven Approach to Quantifying Immune States in Sepsis

Published on: February 7, 2025

322
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.6K

Area of Science:

  • Statistics
  • Data Science
  • Bioinformatics

Background:

  • High-dimensional data analysis requires effective dimension reduction and variable selection.
  • Semi-parametric multi-index models are suitable for analyzing complex, high-dimensional datasets.
  • Sliced inverse regression (SIR) provides a model-free approach for estimating indices in these models.

Purpose of the Study:

  • To develop a method for achieving sparse estimates of eigenvectors in SIR.
  • To facilitate variable selection and improve model interpretability and parsimony.

Main Methods:

  • A group-Dantzig selector formulation is proposed to induce row-sparsity.
  • This formulation is applied to sliced inverse regression dimension reduction vectors.

Main Results:

  • Extensive simulation studies demonstrate the method's performance.
  • The proposed method is compared against existing state-of-the-art techniques.

Conclusions:

  • The method achieves competitive performance in estimation, prediction, and variable selection.
  • Real-world data applications, including a metabolomics depression study, confirm its practical effectiveness.