Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures from...

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

Gaussian Elimination: Problem Solving

Gaussian Elimination: Problem Solving

Systems of linear equations in several variables are pivotal in modeling complex scenarios involving multiple unknowns and constraints. Such systems are widely used in various fields to represent relationships where several conditions must be simultaneously satisfied. Each variable in the system corresponds to an unknown quantity, while each equation imposes a linear constraint, leading to a structured approach for analyzing and solving real-world problems.A system of three equations with three...

Random Variables

Random Variables

A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...

Calculating and Interpreting the Linear Correlation Coefficient

Calculating and Interpreting the Linear Correlation Coefficient

The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable, x, and the dependent variable, y. Hence, it is also known as the Pearson product-moment correlation coefficient. It can be calculated using the following equation:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Co-expression analysis identifies organ-specific gene regulatory networks responsive to phosphate limitation in hemp (Cannabis sativa L.).

BMC plant biology·2026

Same author

Mitochondrial retrograde signaling: Integrating metabolic, environmental, and hormonal cues to shape plant development and growth.

Molecular plant·2026

Same author

Fatty Acid Desaturases GmROD1s Are Involved in Nodulation by Regulating the Flux of Polyunsaturated Fatty Acids.

Plant, cell & environment·2026

Same author

A four-dimensional spatial transcriptome atlas of barley caryopsis development and germination.

The Plant cell·2026

Same author

Transcriptome assemblies for two drug-type cannabis chemotypes by long-read RNA sequencing.

The plant genome·2026

Same author

Limited validity of an AI-powered app for dietary assessment in females with obesity.

NPJ digital medicine·2026

Same journal

Balanced mediated pathway detection in genomic data.

Statistical applications in genetics and molecular biology·2026

Same journal

Annealed variational mixtures for disease subtyping and biomarker discovery.

Statistical applications in genetics and molecular biology·2026

Same journal

Performance of the permutation test approach with base calling errors for detecting changes in variant allele frequencies in ctDNA for a single patient.

Statistical applications in genetics and molecular biology·2026

Same journal

BLOG: Bayesian longitudinal omics with group constraints.

Statistical applications in genetics and molecular biology·2026

Same journal

AI-driven risk prediction and categorization in cystic fibrosis leveraging AttentiveLSTM and Fox Wolf Optimizer.

Statistical applications in genetics and molecular biology·2026

Same journal

Perfect collinearity not created equal: measuring and visualizing the severity of multi-collinearity of modern omics data.

Statistical applications in genetics and molecular biology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 22, 2026

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

Exploring multicollinearity using a random matrix theory approach.

Kristen Feher¹, James Whelan, Samuel Müller

¹Australian Research Council Centre of Excellence in Plant Energy Biology and Centre of Excellence in Computational System Biology, University of Western Australia.

Statistical Applications in Genetics and Molecular Biology

|May 23, 2012

Summary

This summary is machine-generated.

This study introduces a new multicollinear model to understand gene expression data. The model helps characterize gene clusters and estimate dimensions, warning against isolated pairwise correlations.

More Related Videos

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

Related Experiment Videos

Last Updated: May 22, 2026

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

Area of Science:

Bioinformatics
Computational Biology
Statistical Genetics

Background:

Gene expression data clustering often aims for dimension reduction.
Understanding low-dimensional signals in high-dimensional data is crucial but poorly understood.

Purpose of the Study:

Introduce a multicollinear model based on random matrix theory.
Characterize gene cluster correlation matrices and estimate cluster dimensions.
Investigate the behavior of correlation matrices with embedded low-dimensional signals.

Main Methods:

Utilized a multicollinear model based on random matrix theory (spiked covariance model).
Projected a one-dimensional signal into multiple dimensions.
Empirically examined the eigenspectrum of the correlation matrix via simulation with added noise.

Main Results:

The model characterizes gene cluster correlation matrices effectively.
Simulation results inform a dimension estimation procedure for gene clusters.
Demonstrated that low pairwise gene correlations can arise from high dimensionality and noise.

Conclusions:

The eigenspectrum provides collective information about all variables, surpassing pairwise correlations.
The proposed model offers insights into gene expression data structure and noise effects.
Highlights the importance of considering the overall structure rather than isolated correlations.