Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

DNA Microarrays02:34

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
Calculating and Interpreting the Linear Correlation Coefficient01:11

Calculating and Interpreting the Linear Correlation Coefficient

The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable, x, and the dependent variable, y. Hence, it is also known as the Pearson product-moment correlation coefficient. It can be calculated using the following equation:
Coefficient of Correlation01:12

Coefficient of Correlation

The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable x and the dependent variable y.
If you suspect a linear relationship between x and y, then r can measure how strong the linear relationship is.
What the VALUE of r tells us:
The value of r is always between –1 and +1: –1 ≤ r ≤ 1.
The size of the correlation r indicates the strength of the linear...
Microsoft Excel: Pearson's Correlation01:18

Microsoft Excel: Pearson's Correlation

Microsoft Excel is a powerful tool for statistical analysis, including calculating Pearson's correlation coefficient, which measures the strength and direction of a linear relationship between two continuous variables. Pearson's correlation coefficient, often denoted as "r," ranges from -1 to 1. A value close to 1 indicates a strong positive correlation, meaning as one variable increases, the other does too. A value close to -1 indicates a strong negative correlation, implying that as one...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Efficacy of aromatherapy for motion sickness: A self-controlled pre-post trial.

Explore (New York, N.Y.)·2026
Same author

Endovascular treatment of intracranial vertebral artery dissecting aneurysm in a patient with aplastic anemia: A case report and literature review.

SAGE open medical case reports·2026
Same author

A Dual-Activated Catalytic Hairpin Assembly Signal Amplification Nanoplatform for High-Sensitivity Detection of Dual-Target MicroRNAs.

ACS omega·2026
Same author

Assessing the factors associated with nurses' perceptions of decent work: a multicenter cross-sectional study.

Frontiers in psychology·2026
Same author

Association of NDRG4 gene methylation in peripheral blood leukocytes with gastric cancer risk, chemotherapy efficacy and prognosis.

Frontiers in oncology·2026
Same author

Association Between Cerebellar Metabolic Markers and Activities of Daily Living in Patients With Spinocerebellar Ataxia Type 3.

Molecular genetics & genomic medicine·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: May 10, 2026

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

Comprehensive analysis of correlation coefficients estimated from pooling heterogeneous microarray data.

Márcia M Almeida-de-Macedo1, Nick Ransom, Yaping Feng

  • 1Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50011, USA. marcia.almeida_de_macedo@syngenta.com.

BMC Bioinformatics
|July 5, 2013
PubMed
Summary
This summary is machine-generated.

Synthesizing microarray data by pooling (melting pot) can introduce bias due to mean differences and heteroskedasticity. Combining statistical results (mosaic) is better for analyzing gene expression correlations across studies.

More Related Videos

Using Microarrays to Interrogate Microenvironmental Impact on Cellular Phenotypes in Cancer
08:20

Using Microarrays to Interrogate Microenvironmental Impact on Cellular Phenotypes in Cancer

Published on: May 21, 2019

A High-throughput Cell Microarray Platform for Correlative Analysis of Cell Differentiation and Traction Forces
12:04

A High-throughput Cell Microarray Platform for Correlative Analysis of Cell Differentiation and Traction Forces

Published on: March 1, 2017

Related Experiment Videos

Last Updated: May 10, 2026

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

Using Microarrays to Interrogate Microenvironmental Impact on Cellular Phenotypes in Cancer
08:20

Using Microarrays to Interrogate Microenvironmental Impact on Cellular Phenotypes in Cancer

Published on: May 21, 2019

A High-throughput Cell Microarray Platform for Correlative Analysis of Cell Differentiation and Traction Forces
12:04

A High-throughput Cell Microarray Platform for Correlative Analysis of Cell Differentiation and Traction Forces

Published on: March 1, 2017

Area of Science:

  • Bioinformatics
  • Statistical Genetics
  • Computational Biology

Background:

  • Microarray data synthesis uses 'mosaic' (combining results) or 'melting pot' (pooling data) approaches.
  • Data heterogeneity in microarray studies (lab differences, experimental conditions) can yield ambiguous results with the 'melting pot' method.

Purpose of the Study:

  • To investigate the impact of mean differences and heteroskedasticity on gene-to-gene Pearson correlation coefficients in pooled microarray data.
  • To compare the biases of pooled correlation coefficients with those from an effect-size model and classical meta-analysis.

Main Methods:

  • Applied statistical theory to analyze 19 groups of microarray data.
  • Quantified biases of pooled coefficients and compared them to an effect-size model.
  • Used simulation studies to corroborate findings on mean differences and heteroskedasticity.

Main Results:

  • Mean differences across microarray groups significantly influenced the magnitude and sign of pooled correlation coefficients, increasing bias.
  • Heteroskedasticity led to less efficient correlation estimations compared to classical meta-analysis.
  • Pooled coefficients showed the largest bias when approaching ±1.

Conclusions:

  • Combining statistical results (mosaic approach) is the preferred method for synthesizing gene expression correlations across multiple microarray studies.
  • The 'melting pot' approach is susceptible to biases introduced by data heterogeneity.