Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
Correlation of Experimental Data01:23

Correlation of Experimental Data

Dimensional analysis simplifies complex physical problems and guides experimental investigations, but it does not provide complete solutions. It identifies the dimensionless groups that influence a phenomenon, but experimental data is needed to establish the specific relationships and validate theoretical predictions.
For example, a spherical particle moving through a viscous fluid experiences drag. Dimensional analysis shows that the drag force depends on the particle's diameter, velocity, and...
Multiple Regression01:25

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
Applications of Integration to Probability Density Functions01:27

Applications of Integration to Probability Density Functions

Continuous probability distributions are used to model random variables that can take on any real value within a specified range. These variables do not take on isolated or countable values but rather exist on a continuum. For example, the height of an individual can be measured with increasing precision—such as 163.5 or 165.25 centimeters—demonstrating that height is a continuous random variable.The behavior of such variables is described using a probability density function (PDF), which...
Correlations02:20

Correlations

Correlation means that there is a relationship between two or more variables (such as ice cream consumption and crime), but this relationship does not necessarily imply cause and effect. When two variables are correlated, it simply means that as one variable changes, so does the other. We can measure correlation by calculating a statistic known as a correlation coefficient. A correlation coefficient is a number from -1 to +1 that indicates the strength and direction of the relationship between...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Real-World Validation of PinPoint Blood Tests in the NHS: Multivariable Machine Learning to Predict Cancer Risk in Primary Care Urgent Referrals.

Mayo Clinic proceedings. Digital health·2026
Same author

Statistical Discrimination of Urinary Steroid Biomarkers in the Athlete Biological Passport: A Novel Approach to an Abnormal Steroid Profile Score (ASPS).

Drug testing and analysis·2026
Same author

High Quality, Granular, Timely, Trustworthy and Efficient Vertebrate Species Distribution Data Across a 30,000 km<sup>2</sup> Protected Area Complex.

Ecology letters·2025
Same author

Clustering long-term health conditions among 67728 people with multimorbidity using electronic health records in Scotland.

PloS one·2023
Same author

Sociodemographic characteristics and longitudinal progression of multimorbidity: A multistate modelling analysis of a large primary care records dataset in England.

PLoS medicine·2023
Same author

Competitive performance as a discriminator of doping status in elite athletes.

Drug testing and analysis·2023
Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026
Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026
Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026
Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026
Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: May 17, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Bayesian correlated clustering to integrate multiple datasets.

Paul Kirk1, Jim E Griffin, Richard S Savage

  • 1Systems Biology Centre, University of Warwick, Coventry, CV4 7AL, UK.

Bioinformatics (Oxford, England)
|October 11, 2012
PubMed
Summary
This summary is machine-generated.

We developed Multiple Dataset Integration (MDI), a Bayesian method to combine diverse biological datasets. MDI successfully integrates multiple data types, revealing underlying similarities and identifying co-regulated genes.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Related Experiment Videos

Last Updated: May 17, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

  • Systems Biology
  • Genomic Medicine
  • Bioinformatics

Background:

  • Integrating diverse biological datasets is crucial but challenging.
  • High-throughput technologies yield complementary data types.
  • Unsupervised integrative modeling is needed for comprehensive analysis.

Purpose of the Study:

  • To present a novel Bayesian method, Multiple Dataset Integration (MDI), for unsupervised integrative modeling.
  • To demonstrate MDI's capability in handling multiple datasets and data types simultaneously.
  • To advance systems biology and genomic medicine through enhanced data integration.

Main Methods:

  • Developed MDI, a Bayesian approach for unsupervised integrative modeling.
  • Modeled individual datasets using Dirichlet-multinomial allocation (DMA) mixture models.
  • Captured inter-dataset dependencies using agreement parameters and Gaussian processes for time series data.

Main Results:

  • MDI successfully integrated multiple datasets, capturing underlying structural similarities.
  • Performance was comparable to state-of-the-art methods in two-dataset integration.
  • Integrated gene expression, ChIP-chip, and protein-protein interaction data to identify cell cycle-regulated protein complexes.

Conclusions:

  • MDI offers a powerful, competitive approach for unsupervised data integration.
  • The method provides insights difficult to obtain with non-integrative or other unsupervised techniques.
  • MDI advances the ability to model complex biological systems by integrating diverse data sources.