Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Synthetic Biology02:55

Synthetic Biology

5.3K
Synthetic biology is an interdisciplinary science that involves using principles from disciplines such as engineering, molecular biology, cell biology, and systems biology. It involves remodeling existing organisms from nature or constructing completely new synthetic organisms for applications such as protein or enzyme production, bioremediation, value-added macromolecule production, and the addition of desirable traits to crops, to name a few.
Golden rice
Golden rice is a genetically modified...
5.3K
Genomics02:02

Genomics

39.1K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
39.1K
Data Collection by Observations01:08

Data Collection by Observations

14.2K
Data collection refers to a systematic way of obtaining, observing, measuring, and analyzing accurate information. Observational studies are one of the most widely used methods of data collection. It involves collecting data by observing the behavior and physical characteristics of a sample without making any modifications to the sample.
An astronomer viewing the motion and brightness of stars in the sky and recording the data is an example of observational data collection. A botanist recording...
14.2K
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

15.0K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
15.0K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

20.1K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
20.1K
RNA-seq03:21

RNA-seq

11.3K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
11.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Sparse dimensionality reduction for analyzing single-cell-resolved interactions.

Bioinformatics advances·2026
Same author

Impact of catheter-directed thrombolysis on the socio-economic burden of pulmonary embolism in Germany: a cost-effectiveness analysis.

European heart journal. Quality of care & clinical outcomes·2026
Same author

Identifying Biomedical Entities for Datasets in Scientific Articles: 4-Step Cache-Augmented Generation Approach Using GPT-4o and PubTator 3.0.

JMIR formative research·2025
Same author

Sparse dimensionality reduction for analyzing single-cell-resolved interactions.

Bioinformatics (Oxford, England)·2025
Same author

Exploring Feature Preferences for a Treatment-Accompanying App in Patients Undergoing Radiation Therapy: Cross-Sectional Study.

JMIR cancer·2025
Same author

Evaluating discrepancies in dimensionality reduction for time-series single-cell RNA-sequencing data.

Briefings in bioinformatics·2025
Same journal

STED: flexible cross-modal topic modeling infers cell-type-specific regulatory landscapes from bulk epigenomics.

Briefings in bioinformatics·2026
Same journal

A knowledge-guided deep learning framework for quantitative nucleic acid testing.

Briefings in bioinformatics·2026
Same journal

Optimal transport for label transfer in single-cell multi-omics integration.

Briefings in bioinformatics·2026
Same journal

Continuous multi-omics pathway enrichment analysis resolves hidden functional heterogeneity.

Briefings in bioinformatics·2026
Same journal

Evaluating completeness, coherence, and consistency of genome-scale function annotations.

Briefings in bioinformatics·2026
Same journal

Transformers for single-cell RNA sequencing: a survey.

Briefings in bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Dec 7, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.1K

Synthetic observations from deep generative models and binary omics data with limited sample size.

Jens Nußberger1, Frederic Boesel1, Stefan Lenz1

  • 1Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Germany.

Briefings in Bioinformatics
|October 1, 2020
PubMed
Summary
This summary is machine-generated.

Deep generative models like deep Boltzmann machines (DBMs) and variational autoencoders (VAEs) effectively represent binary omics data, even with limited samples. Generative adversarial networks (GANs) require larger datasets for stable results.

Keywords:
SNP databenchmarkingdata privacygenerative modelssynthetic data

More Related Videos

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.5K
Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

4.6K

Related Experiment Videos

Last Updated: Dec 7, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.1K
Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.5K
Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

4.6K

Area of Science:

  • Computational biology
  • Genomics
  • Machine learning

Background:

  • Deep generative models can model joint data distributions for tasks like noise removal and privacy preservation.
  • Their performance with limited sample sizes, particularly for binary omics data like single nucleotide polymorphisms (SNPs), remains under-explored.

Purpose of the Study:

  • To evaluate the effectiveness of variational autoencoders (VAEs), deep Boltzmann machines (DBMs), and generative adversarial networks (GANs) for modeling binary omics data with limited sample sizes.
  • To assess the ability of these models to recover pairwise relationships, specifically odds ratios (ORs), in SNP data.

Main Methods:

  • Investigated three deep generative models: VAEs, DBMs, and GANs.
  • Utilized both simulated and real single nucleotide polymorphism (SNP) data.
  • Assessed model performance based on the recovery of pairwise odds ratios (ORs).
  • Included conditional modeling, such as gene expression conditional on SNPs.

Main Results:

  • DBMs demonstrated ability to recover structure for up to 300 variables but tended to overestimate ORs without careful tuning.
  • VAEs accurately captured the direction and relative strength of pairwise relations but underestimated ORs.
  • GANs yielded stable results only with larger sample sizes and strong pairwise associations.

Conclusions:

  • DBMs and VAEs are suitable for binary omics data analysis, even with small sample sizes, unlike GANs.
  • These findings support the use of DBMs and VAEs for generating synthetic omics data in various applications.