Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Software for Data Analysis and Clinical Trials01:12

Statistical Software for Data Analysis and Clinical Trials

1.4K
Statistical software is pivotal in data analysis and clinical trials by providing tools to analyze data, draw conclusions, and make predictions. These software packages range from simple data management applications to complex analytical platforms, supporting various statistical tests, models, and simulation techniques. Their significance lies in their ability to handle vast amounts of data with precision and efficiency, enabling researchers to validate hypotheses, identify trends, and make...
1.4K
Biostatistics: Overview01:20

Biostatistics: Overview

705
Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...
705
Genomics02:02

Genomics

39.6K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
39.6K
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

885
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
885
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.6K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.6K
Overview of Minitab01:11

Overview of Minitab

557
Minitab is a statistical software package designed for data analysis. With its origins in the 1970s and development at Pennsylvania State University, Minitab has grown significantly in its capabilities and applications. It plays a crucial role in quality management projects, especially in Six Sigma initiatives, by offering tools for process improvement and statistical analysis. Minitab's significance lies in its user-friendly interface, making complex statistical analysis accessible to...
557

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Use of clustering techniques for clinical and epidemiological research: practical tips using an example from rheumatology.

Journal of clinical epidemiology·2026
Same author

Development of multiple microbiome biomarkers using penalized regression methods.

Journal of applied microbiology·2025
Same author

A T-cell-based metric of immune age predicts outcomes in older patients with myeloma receiving daratumumab-based therapy.

Blood·2025
Same author

A hierarchical negative-binomial model for analysis of correlated sequencing data: practical implementations.

Bioinformatics advances·2025
Same author

Long-Term Experiences of Health Care Providers Using Iris Scanning as an Identification Tool in a Vaccine Trial in the Democratic Republic of the Congo: Qualitative Study.

JMIR formative research·2025
Same author

Computational Tools for Hydrogen-Deuterium Exchange Mass Spectrometry Data Analysis.

Chemical reviews·2024
Same journal

Region-aware bridge modeling enables interpretable mesoscale representation of spatial transcriptomic tissue sections.

Bioinformatics advances·2026
Same journal

Microbiome differential abundance methodologies to detect relevant taxa associated with chemotherapy toxicity rate in colorectal cancer.

Bioinformatics advances·2026
Same journal

maldipickr dereplicates microbial MALDI-TOF spectra to facilitate multiplexed isolation.

Bioinformatics advances·2026
Same journal

RAM-MSA: an anytime memory-bounded method for exact multiple sequence alignment using path finding.

Bioinformatics advances·2026
Same journal

Interpretable machine learning for low-sample multi-omics: a case study of ferret vaccine response.

Bioinformatics advances·2026
Same journal

DeepTaxa: a hybrid CNN-BERT framework for 16S rRNA taxonomic classification.

Bioinformatics advances·2026
See all related articles

Related Experiment Video

Updated: Jan 10, 2026

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

4.0K

SUMO: an R package for simulating multi-omics data for methods development and testing.

Bernard Isekah Osang'ir1,2, Surya Gupta1, Ziv Shkedy2

  • 1Microbiology Unit, Nuclear Medical Applications, Belgian Nuclear Research Centre, SCK CEN, Boeretang 190, Mol, 2400, Belgium.

Bioinformatics Advances
|November 24, 2025
PubMed
Summary
This summary is machine-generated.

SUMO is an R package that generates simulated multi-omics datasets. This tool allows researchers to create complex datasets for testing computational methods in multi-omics research.

More Related Videos

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

2.0K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.2K

Related Experiment Videos

Last Updated: Jan 10, 2026

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

4.0K
Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

2.0K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.2K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Data Science

Background:

  • Multi-omics research integrates diverse biological data types, driving demand for advanced computational tools.
  • Evaluating new multi-omics computational methods is challenging due to the lack of standardized datasets with defined signal structures.

Purpose of the Study:

  • To introduce SUMO (SimUlating Multi-Omics), an open-source R package designed for generating synthetic multi-omics datasets.
  • To provide researchers with a flexible tool for creating datasets with controllable latent structures, noise levels, and complexity.

Main Methods:

  • SUMO utilizes factor analysis-based approaches to simulate multi-omics data.
  • The package allows users to specify distinct or shared non-overlapping latent factors, offering precise control over signal characteristics.

Main Results:

  • SUMO enables the generation of high-quality, factor analysis-based multi-omics datasets.
  • The package facilitates reproducible testing and validation of computational methods by providing customizable synthetic data.

Conclusions:

  • SUMO addresses the critical need for benchmark datasets in multi-omics research.
  • This tool supports methodological innovation by enabling rigorous evaluation of new computational approaches in the field.