Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

What is Variation?01:14

What is Variation?

17.6K
Apart from the measures of central tendency, distribution, outliers, and the changing characteristics of data with time, an important characteristic of any data set is its variation or spread. In some data sets, the data values are concentrated closely near the mean; in others, the data values are more widely spread out from the mean.
The range, standard deviation, standard error, and variance are the different measures of variation.
Range: The range is the difference between its maximum and...
17.6K
Conservative Site-specific Recombination and Phase Variation02:53

Conservative Site-specific Recombination and Phase Variation

6.7K
Because the DNA segments are cut and reorganized in a direction-specific manner, site-specific recombination has emerged as an efficient genetic engineering technique. Flippase and Cyclization recombinases or Flp and Cre, respectively, are two members of the tyrosine recombinase family derived from bacteriophages, that are used to mediate site-specific DNA insertions, deletions, and targeted expression of proteins in mammalian cell lines.
The recognition sites for Cre recombinase called LoxP...
6.7K
Variation01:19

Variation

7.7K
An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...
7.7K
Genomics02:02

Genomics

39.8K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
39.8K
Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

9.0K
While every living organism has a genome of some kind (be it RNA, or DNA), there is considerable variation in the sizes of these blueprints. One major factor that impacts genome size is whether the organism is prokaryotic or eukaryotic. In prokaryotes, the genome contains little to no non-coding sequence, such that genes are tightly clustered in groups or operons sequentially along the chromosome. Conversely, the genes in eukaryotes are punctuated by long stretches of non-coding sequence.
9.0K
Variation of Atmospheric Pressure01:18

Variation of Atmospheric Pressure

4.1K
Change in atmospheric pressure with height is particularly interesting. The decrease in atmospheric pressure with increasing altitude is due to the decreasing gravitational force per unit area as we move away from the surface of the earth.
Assuming the air temperature is constant at a given altitude and that the ideal gas law of thermodynamics describes the atmosphere to a good approximation, one can find the variation of atmospheric pressure with height.
Let p(y) be the atmospheric pressure at...
4.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Non-Parametric Ancestry Adjustment for Polygenic Scores.

medRxiv : the preprint server for health sciences·2026
Same author

The landscape of genomic and socioeconomic variables in colorectal cancer patients based on genetic ancestry.

Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology·2026
Same author

Graph transformer for ancient ancestry inference.

bioRxiv : the preprint server for biology·2026
Same author

The diploid reference genome of a human embryonic stem cell line.

bioRxiv : the preprint server for biology·2026
Same author

Clinical genetic variation across Hispanic populations in the Mexican Biobank.

Nature medicine·2026
Same author

Analysis of a deeply-phenotyped familial hypercholesterolemia cohort from Mexico shows a role for both rare and common alleles across known dyslipidemia genes and reveals structural variation in a novel locus.

Human genomics·2025
Same journal

A unified analysis of cell type- and trajectory-associated pathways in single-cell data using Phoenix.

Genome research·2026
Same journal

Resf1 is required for proper placental development and configuration of trophoblast cell-specific heterochromatin.

Genome research·2026
Same journal

Telomere-driven replicative crisis is driven by large-scale changes in genomic architecture.

Genome research·2026
Same journal

Spatially informed reference-free cell-type deconvolution for spatial transcriptomics with SpatialCD.

Genome research·2026
Same journal

Spatially resolved profiling of steroid nuclear receptors reveals a role for the disordered N-terminal domains in genome targeting and AP-1 interaction.

Genome research·2026
Same journal

Flexible and scalable inference of spatially varying correlation in spatial transcriptomics with spCorr.

Genome research·2026
See all related articles

Related Experiment Video

Updated: Jan 22, 2026

Ultra-long Read Sequencing for Whole Genomic DNA Analysis
10:34

Ultra-long Read Sequencing for Whole Genomic DNA Analysis

Published on: March 15, 2019

23.9K

Autoencoders for genomic variation analysis.

Margarita Geleta1,2,3, Daniel Mas Montserrat4, Xavier Giro-I-Nieto2

  • 1Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California 94305, USA; geleta@berkeley.edu.

Genome Research
|January 20, 2026
PubMed
Summary
This summary is machine-generated.

Deep learning using variational autoencoders (VAEs) effectively processes population genomics data. These models capture fine-scale genetic structure for interpretation, compression, and simulation in diverse human and canid populations.

More Related Videos

An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants
09:32

An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants

Published on: November 8, 2017

8.2K
Genome-wide Analysis using ChIP to Identify Isoform-specific Gene Targets
11:19

Genome-wide Analysis using ChIP to Identify Isoform-specific Gene Targets

Published on: July 7, 2010

15.0K

Related Experiment Videos

Last Updated: Jan 22, 2026

Ultra-long Read Sequencing for Whole Genomic DNA Analysis
10:34

Ultra-long Read Sequencing for Whole Genomic DNA Analysis

Published on: March 15, 2019

23.9K
An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants
09:32

An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants

Published on: November 8, 2017

8.2K
Genome-wide Analysis using ChIP to Identify Isoform-specific Gene Targets
11:19

Genome-wide Analysis using ChIP to Identify Isoform-specific Gene Targets

Published on: July 7, 2010

15.0K

Area of Science:

  • Genomics
  • Bioinformatics
  • Machine Learning

Background:

  • Biobanks generate vast high-resolution genomic data from diverse populations.
  • Existing algorithmic tools struggle to accurately represent complex genetic compositions in admixed populations.

Purpose of the Study:

  • To explore variational autoencoders (VAEs) for processing population genomic data.
  • To evaluate VAEs for genomic data interpretation, compression, classification, and simulation.
  • To assess VAE performance with and without ancestry conditioning.

Main Methods:

  • Applied unsupervised deep learning (VAEs) to whole genome datasets (human and canid).
  • Utilized single nucleotide polymorphisms (SNPs) for analysis.
  • Evaluated dimensionality reduction, data compression, and classification tasks.

Main Results:

  • VAEs effectively capture granular population structure and infer latent genetic factors.
  • Learned latent spaces represent distinct genetic clusters, enabling dimensionality reduction and simulation.
  • Demonstrated VAEs for lossless compression of genotype sequences, with varying compression ratios across populations.
  • Showcased differentiated classification accuracies and analyzed SNP data entropy's relation to compression and migration.

Conclusions:

  • VAEs offer a powerful unsupervised approach for analyzing population genomics.
  • The method facilitates fine-scale genetic structure detection, data compression, and simulation.
  • VAE-derived latent representations provide insights into population genetics and historical migrations.