Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

8.9K
While every living organism has a genome of some kind (be it RNA, or DNA), there is considerable variation in the sizes of these blueprints. One major factor that impacts genome size is whether the organism is prokaryotic or eukaryotic. In prokaryotes, the genome contains little to no non-coding sequence, such that genes are tightly clustered in groups or operons sequentially along the chromosome. Conversely, the genes in eukaryotes are punctuated by long stretches of non-coding sequence.
8.9K
Interval Level of Measurement00:55

Interval Level of Measurement

17.8K
For effective statistical analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
Data measured using the interval scale are similar to ordinal level data because they have a definite arrangement. However, in the interval level of measurement, the differences between data values are meaningful even though the data does not have a starting point.
Temperature is measured using the interval scale. It is measurable data, and the difference between...
17.8K
Gene Duplication and Divergence02:37

Gene Duplication and Divergence

7.7K
The seminal work of Ohno in 1970 popularized the idea of gene duplication and divergence. DNA sequence comparison studies reveal that a large portion of the genes in bacteria, archaebacteria, and eukaryotes was  generated by gene duplication and divergence, indicating its critical role in evolution.
The duplicated copies of the gene are called Paralogs. Paralogs with similar sequences and functions form a gene family. Across several species, a large number of gene families are...
7.7K
Genetic Variation01:25

Genetic Variation

1.1K
Genetic variation is the diversity in DNA sequences found among individuals of the same species. This diversity is crucial for a species' survival because it helps organisms adapt to environmental changes. Genetic variation begins with fertilization, where an egg and sperm cell merge. Each of these cells carries 23 chromosomes, up to 46 in the fertilized egg. Chromosomes are long DNA strands that contain genes, the basic units of heredity.
Genes exist in different versions called alleles,...
1.1K
Maxam-Gilbert Sequencing01:05

Maxam-Gilbert Sequencing

12.5K
In the same year as the discovery of the Sanger sequencing method, another group of scientists, Allan Maxam and Walter Gilbert, demonstrated their chemical-cleavage method for DNA sequencing. The Maxam-Gilbert method relies on using different chemicals that can cleave the DNA sequence at specific sites, the separation of resulting DNA fragments of variable size using electrophoresis, and deciphering the DNA sequence from the resulting gel bands.
Challenges of the Maxam-Gilbert Method
The...
12.5K
Genetic Screens02:46

Genetic Screens

5.5K
Genetic screens are tools used to identify genes and mutations responsible for phenotypes of interest. Genetic screens help identify individuals or a group of people at risk of developing  genetic diseases and help them with early intervention, targeted therapy, and reproductive options.
Forward genetic screens
Forward or “classical” genetic screens involve creating random mutations in an organism’s DNA using radiation, mutagens, or insertion of additional bases, which...
5.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Automated biomedical hypothesis generation with time-aware hypergraph contrastive learning.

Knowledge and information systems·2026
Same author

Carbon peaking pathways for topographic-constrained megacities: multi-scenario simulations and regional comparisons based on Chongqing.

Scientific reports·2026
Same author

Biallelic DAW1 variants reveal tissue-specific role in heterotaxy without primary ciliary dyskinesia.

Research square·2026
Same author

Patient-informed CRISPR screen identifies FLNB as a congenital heart disease and ciliopathy gene.

HGG advances·2026
Same author

Compound heterozygous DAW1 variants reveal tissue-specific roles in left-right patterning and congenital heart disease without primary ciliary dyskinesia.

medRxiv : the preprint server for health sciences·2026
Same author

Clinical characteristics, management, and hematopoietic cell transplantation of patients with TLR8 gain-of-function.

Blood advances·2025
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Jan 4, 2026

Identification of Functionally-Relevant Lentivirus Integration Sites in an Insertional Mutagenesis Cell Library
07:28

Identification of Functionally-Relevant Lentivirus Integration Sites in an Insertional Mutagenesis Cell Library

Published on: January 10, 2025

642

Augmented Interval List: a novel data structure for efficient genomic interval search.

Jianglin Feng1, Aakrosh Ratan1,2,3, Nathan C Sheffield1,2,3,4

  • 1Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA.

Bioinformatics (Oxford, England)
|June 1, 2019
PubMed
Summary
This summary is machine-generated.

We developed the Augmented Interval List (AIList), a new data structure for efficiently searching genomic interval data. AIList offers faster query times and reduced memory usage compared to existing methods, improving scalability for large genomic datasets.

More Related Videos

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.7K
Competitive Genomic Screens of Barcoded Yeast Libraries
11:59

Competitive Genomic Screens of Barcoded Yeast Libraries

Published on: August 11, 2011

18.7K

Related Experiment Videos

Last Updated: Jan 4, 2026

Identification of Functionally-Relevant Lentivirus Integration Sites in an Insertional Mutagenesis Cell Library
07:28

Identification of Functionally-Relevant Lentivirus Integration Sites in an Insertional Mutagenesis Cell Library

Published on: January 10, 2025

642
Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.7K
Competitive Genomic Screens of Barcoded Yeast Libraries
11:59

Competitive Genomic Screens of Barcoded Yeast Libraries

Published on: August 11, 2011

18.7K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Genomic data is commonly stored as intervals, necessitating efficient interval comparison for analysis.
  • The increasing volume of genomic data requires scalable search methods.

Purpose of the Study:

  • To introduce a novel data structure, the Augmented Interval List (AIList), for efficient interval intersection queries.
  • To improve the performance and scalability of fundamental genomic analysis operations.

Main Methods:

  • The AIList is constructed by sorting intervals by start coordinate, decomposing into flattened sublists, and augmenting with running maximum end coordinates.
  • Querying involves enumerating intersections between a query interval and the interval set R.

Main Results:

  • AIList query time complexity is O(log2N+n+m).
  • AIList demonstrated 5-18 times faster performance than standard methods like BEDTools on real genomic datasets.
  • AIList achieved 4-60% lower memory usage compared to other methods for large datasets.

Conclusions:

  • The AIList data structure provides a significant improvement for fundamental interval search operations in genomics.
  • AIList offers a highly scalable solution for analyzing large-scale genomic interval data.