Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Regression01:25

Multiple Regression

4.3K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
4.3K
Regression Analysis01:11

Regression Analysis

8.8K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
8.8K
Types of Selection01:46

Types of Selection

46.1K
Natural selection influences the frequencies of particular alleles and phenotypes within populations in several different ways. Primarily, natural selection can be directional, stabilizing, or disruptive. Directional selection favors one extreme trait and shifts the population towards that phenotype while selecting against individuals displaying alternate traits. Stabilizing selection favors an intermediate trait with a narrow range of variation. Deviation from the optimal phenotype towards an...
46.1K
Regression Toward the Mean01:52

Regression Toward the Mean

7.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
7.3K
Choosing Between z and t Distribution01:25

Choosing Between z and t Distribution

3.7K
The z and the Student t distribution estimate the population mean using the sample mean and standard deviation. However, to decide which distribution to use for a calculation, one needs to determine the sample size, the nature of the distribution, and whether the population standard deviation is known. If the population standard deviation is known and the population is normally distributed, or if the sample size is greater than 30, the z distribution is preferred. The Student t distribution is...
3.7K
Calibration Curves: Linear Least Squares01:20

Calibration Curves: Linear Least Squares

4.9K
A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...
4.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Multilevel functional quantile principal component analysis.

Biostatistics (Oxford, England)·2026
Same author

Spinal cord stimulation for upper limb motor function in people with chronic post-stroke hemiparesis: a feasibility trial.

Nature medicine·2026
Same author

Correcting Measurement Error and Zero Inflation in Functional Covariates for Scalar-on-Function Quantile Regression.

Statistics in medicine·2026
Same author

Oregon's Hospital Rate Cap: A Questionable Measure.

Health affairs (Project Hope)·2026
Same author

Community severance and mental health-related hospital visits in New York City.

Environmental epidemiology (Philadelphia, Pa.)·2026
Same author

Characteristics of Transient Elastography for Hepatic Steatosis and Fibrosis in Children Undergoing Liver Biopsy.

The Journal of pediatrics·2026
Same journal

On the Connections Among Three Transfer Learning Paradigms.

Stat (International Statistical Institute)·2025
Same journal

Accelerating Resident Research within Quantitative Collaboration Units in Academic Healthcare.

Stat (International Statistical Institute)·2025
Same journal

Multivariate differential association analysis.

Stat (International Statistical Institute)·2024
Same journal

Developing partnerships for academic data science consulting and collaboration units.

Stat (International Statistical Institute)·2024
Same journal

Deep learning models to predict primary open-angle glaucoma.

Stat (International Statistical Institute)·2024
Same journal

What is it that you say you do here? Advocating for the critical role of data scientists in research infrastructure.

Stat (International Statistical Institute)·2024
See all related articles

Related Experiment Video

Updated: Mar 17, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

Variable Selection in Function-on-Scalar Regression.

Yakuan Chen1, Jeff Goldsmith1, Todd Ogden1

  • 1Department of Biostatistics, Mailman School of Public Health, Columbia University.

Stat (International Statistical Institute)
|July 19, 2016
PubMed
Summary
This summary is machine-generated.

This study introduces a new variable selection method for function-on-scalar regression models, accounting for correlated residuals. The proposed approach improves prediction accuracy in complex functional data analysis.

Keywords:
Group MCPKinematic DataPre-whiteningSplines

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.7K

Related Experiment Videos

Last Updated: Mar 17, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.7K

Area of Science:

  • Statistics
  • Functional Data Analysis
  • Machine Learning

Background:

  • Function-on-scalar regression models often involve numerous predictors, posing challenges for variable selection.
  • Existing methods for these models typically do not address the correlation structure within residual curves.

Purpose of the Study:

  • To develop a novel variable selection technique for function-on-scalar models that accounts for residual curve correlations.
  • To enhance the accuracy and interpretability of regression models with functional responses.

Main Methods:

  • Representing coefficient functions using B-spline bases to transform the problem into multivariate regression.
  • Employing group-minimax concave penalty (MCP) for effective variable selection.
  • Adapting generalized least squares for residual covariance by pre-whitening and developing an iterative algorithm for coefficient and covariance updates.

Main Results:

  • The proposed iterative algorithm demonstrates performance comparable to pre-whitening with true covariance.
  • The method significantly outperforms approaches that ignore the covariance structure of residuals.
  • Application to stroke severity and motor control data yielded lower prediction errors compared to existing methods.

Conclusions:

  • The developed method offers a robust approach to variable selection in function-on-scalar models, particularly when residual curves are correlated.
  • This technique enhances predictive performance and provides a more accurate understanding of predictor effects in functional data.
  • The findings have implications for analyzing complex biological and biomechanical data, such as motor control studies.