Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Bar Graph01:07

Multiple Bar Graph

As the name suggests, a multiple bar graph is the same as a bar graph but has multiple bars to depict relationships between different data values. One can include as many parameters as possible. However, each parameter must have the same unit of measurement.
Each bar or column in the multiple bar graph represents a data value. These graphs are used primarily in interrelating two or more sets of data. The categories of different kinds of data are listed along the horizontal or x-axis, whereas...
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
Data: Types and Distribution01:19

Data: Types and Distribution

In biostatistics, data are the observations collected for analysis. There are two main types: parametric and non-parametric. Parametric data, which include continuous (e.g., weight) and discrete numerical data (e.g., number of tablets), assume a particular distribution pattern, often the normal distribution. Non-parametric data do not adhere to a specific distribution and typically comprise nominal (e.g., gender) and ordinal categorical data (e.g., pain scale ratings).
Distributions in...
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
Multiple Regression01:25

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
Multiple Allele Traits01:49

Multiple Allele Traits

The Concept of Multiple Allelism

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The Oncology Research Information Exchange Network (ORIEN) - Building a real-world collaborative, patient-driven infrastructure for discovery research and precision oncology.

Research square·2026
Same author

A case of <i>Nocardia asiatica</i> pneumonia: Relapse caused by trimethoprim-sulfamethoxazole treatment nonadherence.

IDCases·2026
Same author

The Offender Personality Disorder Pathway for Men: Staff Perceptions About Possible Impact on Re-offending in High-Risk Individuals with Personality Disorder.

International journal of offender therapy and comparative criminology·2026
Same author

<i>CanDrivR-CS</i>: a cancer-specific machine learning framework for distinguishing recurrent and rare variants.

Bioinformatics advances·2026
Same author

Living Cells Employ Ubiquitin-Proteasomal System and Nucleotide Excision Repair Pathways to Remove Reactive Oxygen Species-Induced DNA-Protein Crosslinks (ROS-DPCs).

bioRxiv : the preprint server for biology·2026
Same author

Plasma cell identity escape drives resistance to anti-BCMA T-cell-redirecting therapy in multiple myeloma.

bioRxiv : the preprint server for biology·2025
Same journal

Annealed variational mixtures for disease subtyping and biomarker discovery.

Statistical applications in genetics and molecular biology·2026
Same journal

Performance of the permutation test approach with base calling errors for detecting changes in variant allele frequencies in ctDNA for a single patient.

Statistical applications in genetics and molecular biology·2026
Same journal

BLOG: Bayesian longitudinal omics with group constraints.

Statistical applications in genetics and molecular biology·2026
Same journal

AI-driven risk prediction and categorization in cystic fibrosis leveraging AttentiveLSTM and Fox Wolf Optimizer.

Statistical applications in genetics and molecular biology·2026
Same journal

Perfect collinearity not created equal: measuring and visualizing the severity of multi-collinearity of modern omics data.

Statistical applications in genetics and molecular biology·2026
Same journal

Corrigendum to: Choice of baseline hazards in joint modeling of longitudinal and time-to-event cancer survival data.

Statistical applications in genetics and molecular biology·2025
See all related articles

Related Experiment Video

Updated: Jun 22, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Bayesian unsupervised learning with multiple data types.

Phaedra Agius1, Yiming Ying, Colin Campbell

  • 1MSKCC, USA. phaedragius@gmail.com

Statistical Applications in Genetics and Molecular Biology
|July 4, 2009
PubMed
Summary
This summary is machine-generated.

We developed Bayesian models for unsupervised learning using gene expression and microRNA data. Our approach identifies key features and subtypes in breast cancer, revealing a genetic signature originating from microRNA expression.

Related Experiment Videos

Last Updated: Jun 22, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

  • Computational biology
  • Bioinformatics
  • Machine learning

Background:

  • Unsupervised learning is crucial for identifying patterns in complex biological data.
  • Integrating multi-modal data, like gene expression and microRNA, offers deeper insights into disease mechanisms.
  • Existing methods may not fully capture dependencies between different data types.

Purpose of the Study:

  • To propose novel Bayesian generative models for unsupervised learning with two dependent data types.
  • To develop algorithms that identify relevant features and determine the optimal number of clusters.
  • To apply these models to a breast cancer dataset for subtype discovery and biomarker identification.

Main Methods:

  • Bayesian generative models with shared latent variables across datasets.
  • Correspondence modeling to capture dependencies between gene expression and microRNA data.
  • Evaluation on synthetic data and a real-world breast cancer dataset (gene expression and microRNA).

Main Results:

  • The models successfully identified relevant features and appropriate cluster numbers in both synthetic and real data.
  • Application to breast cancer data revealed a genetic signature for the basal-like subtype.
  • This signature was found to arise from and be derivative of microRNA expression clustering.

Conclusions:

  • Bayesian generative models provide a powerful framework for integrated, unsupervised analysis of multi-modal biological data.
  • The developed methods can uncover disease-related signatures and their underlying molecular drivers.
  • MicroRNA expression data may play a primary role in defining certain breast cancer subtypes.