Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.2K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.2K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

17.9K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
17.9K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.3K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.3K
Next-generation Sequencing03:00

Next-generation Sequencing

92.6K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
92.6K
RNA-seq03:21

RNA-seq

10.4K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
10.4K
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

15.9K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
15.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Near-perfect genome sequencing in medical genetics.

Nature genetics·2026
Same author

HORoSCOPE: Decoding human centromere architecture from short reads using <i>k</i> -mer signatures.

bioRxiv : the preprint server for biology·2026
Same author

Multi-omic analysis of deep learning-derived phenotypes links ophthalmic imaging to cardiovascular and neurological traits.

Nature cardiovascular research·2026
Same author

The neuro-skeletal crosstalk: Mechanisms, clinical implications, and smart material interventions.

Journal of orthopaedic translation·2026
Same author

Clinical Long-Read Genome Sequencing for Rare-Disease Diagnostics.

The New England journal of medicine·2026
Same author

Population-scale Y chromosome assemblies reveal recurrent remodeling within constrained architectures.

bioRxiv : the preprint server for biology·2026
Same journal

Inside the new political screening that's stalling NIH grants.

Nature·2026
Same journal

Europe's record heatwave: does the continent have a new climate?

Nature·2026
Same journal

Daily briefing: Humans and great apes giggle in the same rhythms.

Nature·2026
Same journal

The surprising career parallels between footballers and researchers.

Nature·2026
Same journal

I study World Cup penalty shoot-outs: they say a lot about the psychology of performance under pressure.

Nature·2026
Same journal

CRISPR's next act: the companies editing the epigenome to treat disease.

Nature·2026
See all related articles
  1. Home
  2. Structural Variation In 1,019 Diverse Humans Based On Long-read Sequencing.
  1. Home
  2. Structural Variation In 1,019 Diverse Humans Based On Long-read Sequencing.

Related Experiment Video

Ultra-long Read Sequencing for Whole Genomic DNA Analysis
10:34

Ultra-long Read Sequencing for Whole Genomic DNA Analysis

Published on: March 15, 2019

23.1K

Structural variation in 1,019 diverse humans based on long-read sequencing.

Siegfried Schloissnig1, Samarendra Pani2,3, Jana Ebler2,3

  • 1Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Vienna, Austria.

Nature
|July 23, 2025

View abstract on PubMed

Summary
This summary is machine-generated.

Long-read sequencing of 1,019 humans revealed over 100,000 genomic structural variants (SVs) and 300,000 tandem repeats. This advances understanding of genetic diversity and disease by characterizing SVs across diverse populations.

Frequently Asked Questions

More Related Videos

Following the Dynamics of Structural Variants in Experimentally Evolved Populations
04:52

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Published on: February 3, 2023

1.1K
Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

34.0K

Related Experiment Videos

Ultra-long Read Sequencing for Whole Genomic DNA Analysis
10:34

Ultra-long Read Sequencing for Whole Genomic DNA Analysis

Published on: March 15, 2019

23.1K
Following the Dynamics of Structural Variants in Experimentally Evolved Populations
04:52

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Published on: February 3, 2023

1.1K
Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

34.0K

Area of Science:

  • Human evolutionary genetics and population genomics.
  • Bioinformatics focusing on genomic structural variants and graph-based genome analysis.
  • Molecular biology of retrotransposition and mobile genetic elements.

Background:

The human genome contains vast architectural differences that influence phenotypic diversity and clinical outcomes. Prior research has shown that these large-scale alterations, often exceeding fifty base pairs, represent a significant portion of genetic variation between individuals. Traditional short-read technologies frequently fail to resolve complex regions or repetitive sequences where these mutations cluster. Existing databases lacked the resolution to capture the full spectrum of insertions, inversions, and tandem repeats across global populations. The scientific community recognized that many pathogenic variants remain hidden within these dark regions of the genetic code. Understanding the evolutionary history of these rearrangements requires a more granular view than what was previously available. This absence of evidence motivated the development of higher-fidelity mapping strategies to catalog these elusive genomic features.

Based on this study's findings, Long interspersed nuclear element-1 (L1) and SINE-VNTR-Alu (SVA) retrotransposition activities mediate the transduction of unique sequence stretches. These events occur at the 5' or 3' ends of the genome, depending on the specific mobile element class and the chromosomal locus involved.

The researchers uncovered over 100,000 sequence-resolved biallelic structural variants and genotyped 300,000 multiallelic variable number of tandem repeats. These findings were derived from an intermediate-coverage resource encompassing 1,019 diverse humans from 26 distinct populations within the 1000 Genomes Project.

The study integrated graph genome-based analyses with linear methods to resolve complex structural variants that short-read surveys often miss. This dual approach enabled the characterization of over 100,000 biallelic variants and 300,000 multiallelic variable number of tandem repeats across diverse human populations.

The findings are confined to an intermediate-coverage resource based on 1,019 individuals from 26 populations within the 1000 Genomes Project. While it advances structural variant characterization, the authors suggest that further investigation is required to prioritize variants in specific patient genomes.

The study's authors propose that this open-access resource underscores the value of long-read sequencing in advancing structural variant characterization. They conclude that the dataset enables guiding variant prioritization in patient genomes, potentially improving the diagnostic accuracy for genetic diseases linked to complex rearrangements.

Purpose Of The Study:

This investigation sought to establish a comprehensive, sequence-resolved resource of structural diversity across a globally representative cohort. Researchers aimed to leverage advanced sequencing technologies to overcome the limitations of previous population-scale surveys. The project focused on identifying biallelic and multiallelic variations within twenty-six distinct human groups. Scientists intended to clarify the mechanisms driving the formation of deletions, duplications, and mobile element insertions. The team prioritized the creation of a reference that accounts for the unique genetic backgrounds of diverse ethnic lineages. By mapping these variants, the study intended to bridge the gap between raw sequence data and functional biological insights. The team worked to provide an open-access framework for prioritizing variants in clinical diagnostics.

Main Methods:

The study utilized long-read sequencing to generate intermediate-coverage data for 1,019 participants representing twenty-six global populations. Bioinformaticians integrated linear reference alignments with graph genome-based analyses to detect complex rearrangements across the human genome. The pipeline specifically targeted 300,000 multiallelic variable number of tandem repeats and retrotransposon-mediated events. Computational tools characterized the breakpoints of deletions and insertions to identify underlying mutational signatures and homology-mediated processes. The researchers mapped these findings back to the 1000 Genomes Project framework to ensure population-level relevance and diversity. The team employed sophisticated algorithms to distinguish between biallelic and multiallelic states in highly repetitive or complex loci. Statistical frameworks were applied to validate the frequency of these structural changes across the diverse lineages sampled in the cohort.

Main Results:

The analysis uncovered more than 100,000 sequence-resolved biallelic structural variants across the diverse cohort of 1,019 individuals. Genotyping efforts successfully identified 300,000 multiallelic variable number of tandem repeats within the twenty-six human populations. Long interspersed nuclear element-1 and SINE-VNTR-Alu activities were found to mediate specific sequence transductions at genomic loci. These retrotransposition events occurred at either the 5' or 3' ends depending on the source mobile element class. Breakpoint evaluations revealed that homology-mediated processes significantly contribute to recurrent deletion events and overall structural formation. The data showed that insertions and inversions are distributed unevenly across different chromosomal regions and specific population groups. The study successfully resolved complex loci that were previously considered inaccessible to standard short-read sequencing platforms used in earlier surveys.

Conclusions:

The resulting dataset provides an unprecedented view of the architectural complexity inherent in the human species across global populations. These findings demonstrate that long-read technologies are essential for capturing variation missed by previous short-read methodologies. The cataloged variants offer a new foundation for understanding how structural changes influence disease susceptibility and genetic diversity. Clinicians can now use this resource to improve the prioritization of candidate mutations in patient genomes for diagnostic purposes. Future genomic studies will likely rely on these high-resolution maps to interpret the functional impact of non-coding variation. The open-access nature of this resource ensures that researchers worldwide can integrate these findings into their own diagnostic pipelines. This work marks a significant shift toward a more inclusive and accurate representation of the global human pangenome.