Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

DNA Microarrays

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...

Calculating and Interpreting the Linear Correlation Coefficient

Calculating and Interpreting the Linear Correlation Coefficient

The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable, x, and the dependent variable, y. Hence, it is also known as the Pearson product-moment correlation coefficient. It can be calculated using the following equation:

Coefficient of Correlation

Coefficient of Correlation

The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable x and the dependent variable y.
If you suspect a linear relationship between x and y, then r can measure how strong the linear relationship is.
What the VALUE of r tells us:
The value of r is always between –1 and +1: –1 ≤ r ≤ 1.
The size of the correlation r indicates the strength of the linear...

Microsoft Excel: Pearson's Correlation

Microsoft Excel: Pearson's Correlation

Microsoft Excel is a powerful tool for statistical analysis, including calculating Pearson's correlation coefficient, which measures the strength and direction of a linear relationship between two continuous variables. Pearson's correlation coefficient, often denoted as "r," ranges from -1 to 1. A value close to 1 indicates a strong positive correlation, meaning as one variable increases, the other does too. A value close to -1 indicates a strong negative correlation, implying that as one...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Efficacy of aromatherapy for motion sickness: A self-controlled pre-post trial.

Explore (New York, N.Y.)·2026

Same author

Endovascular treatment of intracranial vertebral artery dissecting aneurysm in a patient with aplastic anemia: A case report and literature review.

SAGE open medical case reports·2026

Same author

A Dual-Activated Catalytic Hairpin Assembly Signal Amplification Nanoplatform for High-Sensitivity Detection of Dual-Target MicroRNAs.

ACS omega·2026

Same author

Assessing the factors associated with nurses' perceptions of decent work: a multicenter cross-sectional study.

Frontiers in psychology·2026

Same author

Association of NDRG4 gene methylation in peripheral blood leukocytes with gastric cancer risk, chemotherapy efficacy and prognosis.

Frontiers in oncology·2026

Same author

Association Between Cerebellar Metabolic Markers and Activities of Daily Living in Patients With Spinocerebellar Ataxia Type 3.

Molecular genetics & genomic medicine·2026

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 10, 2026

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

Comprehensive analysis of correlation coefficients estimated from pooling heterogeneous microarray data.

Márcia M Almeida-de-Macedo¹, Nick Ransom, Yaping Feng

¹Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50011, USA. marcia.almeida_de_macedo@syngenta.com.

BMC Bioinformatics

|July 5, 2013

Summary

This summary is machine-generated.

Synthesizing microarray data by pooling (melting pot) can introduce bias due to mean differences and heteroskedasticity. Combining statistical results (mosaic) is better for analyzing gene expression correlations across studies.

More Related Videos

Using Microarrays to Interrogate Microenvironmental Impact on Cellular Phenotypes in Cancer

Using Microarrays to Interrogate Microenvironmental Impact on Cellular Phenotypes in Cancer

Published on: May 21, 2019

A High-throughput Cell Microarray Platform for Correlative Analysis of Cell Differentiation and Traction Forces

A High-throughput Cell Microarray Platform for Correlative Analysis of Cell Differentiation and Traction Forces

Published on: March 1, 2017

Related Experiment Videos

Last Updated: May 10, 2026

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

Using Microarrays to Interrogate Microenvironmental Impact on Cellular Phenotypes in Cancer

Using Microarrays to Interrogate Microenvironmental Impact on Cellular Phenotypes in Cancer

Published on: May 21, 2019

A High-throughput Cell Microarray Platform for Correlative Analysis of Cell Differentiation and Traction Forces

A High-throughput Cell Microarray Platform for Correlative Analysis of Cell Differentiation and Traction Forces

Published on: March 1, 2017

Area of Science:

Bioinformatics
Statistical Genetics
Computational Biology

Background:

Microarray data synthesis uses 'mosaic' (combining results) or 'melting pot' (pooling data) approaches.
Data heterogeneity in microarray studies (lab differences, experimental conditions) can yield ambiguous results with the 'melting pot' method.

Purpose of the Study:

To investigate the impact of mean differences and heteroskedasticity on gene-to-gene Pearson correlation coefficients in pooled microarray data.
To compare the biases of pooled correlation coefficients with those from an effect-size model and classical meta-analysis.

Main Methods:

Applied statistical theory to analyze 19 groups of microarray data.
Quantified biases of pooled coefficients and compared them to an effect-size model.
Used simulation studies to corroborate findings on mean differences and heteroskedasticity.

Main Results:

Mean differences across microarray groups significantly influenced the magnitude and sign of pooled correlation coefficients, increasing bias.
Heteroskedasticity led to less efficient correlation estimations compared to classical meta-analysis.
Pooled coefficients showed the largest bias when approaching ±1.

Conclusions:

Combining statistical results (mosaic approach) is the preferred method for synthesizing gene expression correlations across multiple microarray studies.
The 'melting pot' approach is susceptible to biases introduced by data heterogeneity.