Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

16.7K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
16.7K
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

12.6K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
12.6K
Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

2.5K
2.5K
Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

7.5K
While every living organism has a genome of some kind (be it RNA, or DNA), there is considerable variation in the sizes of these blueprints. One major factor that impacts genome size is whether the organism is prokaryotic or eukaryotic. In prokaryotes, the genome contains little to no non-coding sequence, such that genes are tightly clustered in groups or operons sequentially along the chromosome. Conversely, the genes in eukaryotes are punctuated by long stretches of non-coding sequence.
7.5K
Genomics02:02

Genomics

35.5K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
35.5K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Stabilizing and directional selection shape the genetic variation of gene expression in the transcriptome of Chlamydomonas.

Molecular biology and evolution·2026
Same author

Genetic prediction with ARG-powered linear algebra.

Genetics·2026
Same author

Tracing the evolutionary histories of ultra-rare variants using variational dating of large ancestral recombination graphs.

bioRxiv : the preprint server for biology·2026
Same author

A Pandemic-Scale Ancestral Recombination Graph for SARS-CoV-2.

bioRxiv : the preprint server for biology·2025
Same author

On ARGs, pedigrees, and genetic relatedness matrices.

Genetics·2025
Same author

Accessible, Realistic Genome Simulation with Selection Using stdpopsim.

Molecular biology and evolution·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: May 5, 2026

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine
10:40

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine

Published on: December 22, 2017

9.8K

Processing genome scale tabular data with wormtable.

Jerome Kelleher1, Rob W Ness, Daniel L Halligan

  • 1University of Edinburgh, King's Buildings, West Mains Road, Edinburgh, EH9 3JT, UK. jerome.kelleher@ed.ac.uk.

BMC Bioinformatics
|December 7, 2013
PubMed
Summary
This summary is machine-generated.

Researchers can now analyze large biological datasets faster and more easily with wormtable, a new Python library. This tool offers efficient data access and indexing for tabular files, simplifying complex analyses.

More Related Videos

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.1K
Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens
09:14

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Published on: June 28, 2018

6.7K

Related Experiment Videos

Last Updated: May 5, 2026

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine
10:40

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine

Published on: December 22, 2017

9.8K
Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.1K
Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens
09:14

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Published on: June 28, 2018

6.7K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Data Science

Background:

  • Modern biological research generates massive datasets, posing significant analytical challenges.
  • Tabular data in plain text files requires slow, error-prone line-by-line parsing.
  • Lack of simple indexing hinders rapid retrieval of specific data rows.

Purpose of the Study:

  • Introduce a novel data format and software library, wormtable, for efficient tabular data access in Python.
  • Address the challenges of large-scale biological data analysis.
  • Simplify and accelerate data processing for researchers.

Main Methods:

  • Developed wormtable, a Python library with a compact binary format for tabular data.
  • Implemented random access to rows and sophisticated column indexing.
  • Provided conversion utilities for common formats like VCF and GTF.

Main Results:

  • Wormtable enables data processing orders of magnitude faster than text parsing.
  • Indexing facilities allow efficient data subset access and column summarization.
  • Simplified analysis code by eliminating the need for third-party parsers.

Conclusions:

  • Wormtable significantly reduces code complexity and increases performance for biological data analysis.
  • The library offers greater user freedom in exploring large datasets.
  • Streamlined data handling and analysis workflows for researchers.