Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Selected Data About Geographic Locations01:25

Selected Data About Geographic Locations

278
Geographic Information Systems (GIS) rely on two core types of data: spatial data and attribute data.Spatial DataSpatial data defines the physical location of features within a coordinate system, typically expressed in terms of latitude and longitude. It provides precise positioning for elements like roads, rivers, or buildings.Attribute DataAttribute data complements spatial data by adding descriptive information about these features. For example, a road's spatial data includes its start and...
278
Correlation of Experimental Data01:23

Correlation of Experimental Data

491
Dimensional analysis simplifies complex physical problems and guides experimental investigations, but it does not provide complete solutions. It identifies the dimensionless groups that influence a phenomenon, but experimental data is needed to establish the specific relationships and validate theoretical predictions.
For example, a spherical particle moving through a viscous fluid experiences drag. Dimensional analysis shows that the drag force depends on the particle's diameter, velocity,...
491
Correlations02:20

Correlations

36.4K
Correlation means that there is a relationship between two or more variables (such as ice cream consumption and crime), but this relationship does not necessarily imply cause and effect. When two variables are correlated, it simply means that as one variable changes, so does the other. We can measure correlation by calculating a statistic known as a correlation coefficient. A correlation coefficient is a number from -1 to +1 that indicates the strength and direction of the relationship between...
36.4K
Correlation and Causation01:27

Correlation and Causation

42.8K
Statistical tests can calculate whether there is a relationship, or correlation, between independent and dependent variables. An indirect relationship of the variables signifies a correlation, while a direct relationship shows causation. If it is determined that no connection exists between the variables, then the correlation is a coincidence.
Correlation versus Causation
If the dependent variable increases or decreases when the independent variable increases, there is a positive or negative...
42.8K
Antibiotic Selection00:57

Antibiotic Selection

60.0K
Overview
60.0K
Correlation01:09

Correlation

15.2K
In statistics, two variables are said to be correlated if the values of one variable are associated with the other variable. Depending on the relationship between two variables, correlation can be of three types– positive correlation, negative correlation, and zero correlation.
Two variables, for example, a and b, are said to be positively correlated if both variables move in the same direction. In other words, a positive correlation exists between two variables, a and b, if:
15.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Cross-Domain Communication Method Based on Load Balancing for SDNs.

Sensors (Basel, Switzerland)·2025
Same author

An Accuracy-Aware Energy-Efficient Multipath Routing Algorithm for WSNs.

Sensors (Basel, Switzerland)·2024
Same author

Bayesian data assimilation for estimating instantaneous reproduction numbers during epidemics: Applications to COVID-19.

PLoS computational biology·2022
Same author

COVID-19 Rehabilitation With Herbal Medicine and Cardiorespiratory Exercise: Protocol for a Clinical Study.

JMIR research protocols·2021
Same author

Identification of the miRNA-mRNA regulatory network in multiple sclerosis.

Neurological research·2016
Same author

Fluorescent Nanocomposite for Visualizing Cross-Talk between MicroRNA-21 and Hydrogen Peroxide in Ischemia-Reperfusion Injury in Live Cells and In Vivo.

Analytical chemistry·2016
Same journal

Analysis of strength degradation of coal and rock masses and stability of mined areas under long term immersion environment.

PloS one·2026
Same journal

Biogenic Silver-Selenium nanocomposite with anticancer activity and potent efficacy against vancomycin-resistant Staphylococcus aureus.

PloS one·2026
Same journal

Preparation and physicochemical characterization of a biodegradable chitosan/carboxymethyl cellulose hydrogel synthesized in NaOH/urea medium.

PloS one·2026
Same journal

Action-guilt, survivor-guilt, and depression in combat-related PTSD.

PloS one·2026
Same journal

Explainable machine learning for predicting activities of daily living at discharge in stroke patients: A retrospective study using SHAP interpretability.

PloS one·2026
Same journal

Deep learning based two-way feature depiction model for brain tumor detection.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Feb 8, 2026

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data
04:57

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

17.5K

Sparse Bayesian classification and feature selection for biological expression data with high correlations.

Xian Yang1, Wei Pan2, Yike Guo1

  • 1Data Science Institute, Imperial College London, London, SW7 2AZ, United Kingdom.

Plos One
|December 28, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces a unified framework for robust biological data classification and biomarker discovery. The novel approach efficiently handles large datasets and correlated features, improving disease subtype prediction and pathway analysis.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K
A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

11.7K

Related Experiment Videos

Last Updated: Feb 8, 2026

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data
04:57

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

17.5K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K
A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

11.7K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Biological expression data analysis is crucial for disease subtype classification and biomarker identification.
  • Existing methods face challenges with high dimensionality, scalability, and feature correlation in large biological datasets.
  • A unified approach is needed to address these limitations comprehensively.

Purpose of the Study:

  • To propose a unified framework for robust classification and feature selection from biological expression data.
  • To address challenges of small biomarker numbers, data scalability, and feature correlation.
  • To enable efficient biomarker discovery for improved disease subtyping and pathway analysis.

Main Methods:

  • Formulated classification and feature selection as a non-convex optimization problem.
  • Relaxed the problem and solved iteratively using convex optimization procedures.
  • Enabled distributed computation for efficient implementation on advanced infrastructures.

Main Results:

  • Demonstrated the framework's competence on simulated datasets under various conditions.
  • Successfully analyzed a real-world gene expression dataset for embryonal tumors.
  • Selected features facilitated downstream functional and pathway analysis, revealing biological insights.

Conclusions:

  • The proposed unified framework effectively addresses key challenges in biological data classification and feature selection.
  • The method offers a scalable and efficient solution for biomarker discovery.
  • Selected biomarkers provide valuable insights into biological functions and pathways relevant to disease.