Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Distance Problem01:29

Distance Problem

75
When an object's velocity changes over time, the total distance traveled can be determined by summing small displacement intervals over short increments. This approach approximates the true distance through numerical summation and the use of integral calculus. An estimate of the total displacement can be obtained by measuring velocity at regular intervals and multiplying each value by the corresponding time step.If a runner accelerates over the first three seconds of a race, speed measurements...
75
The Distance Formula01:20

The Distance Formula

658
In geometry, measuring the direct distance between two points on a plane is essential in various practical and theoretical applications. Whether in navigation, engineering, or computer graphics, determining the shortest path between two locations involves using the distance formula. This formula is derived from the Pythagorean Theorem, which relates the lengths of the sides of a right triangle. On a coordinate plane, the horizontal and vertical distances between two points serve as the legs of...
658
Distance Corrections01:15

Distance Corrections

301
To achieve precise distance measurements, especially in surveying and construction, certain corrections must be applied to account for potential sources of error like the standardization errors, temperature variations, and slope adjustments.Standardization error emerges when measurement equipment undergoes changes, such as wear, repairs, or weather impacts. To address this, surveyors compare the equipment’s readings to a standard. This process identifies any deviation that might lead to...
301
Short-distance Transport of Resources02:12

Short-distance Transport of Resources

17.8K
Short-distance transport refers to transport that occurs over a distance of just 2-3 cells, crossing the plasma membrane in the process. Small uncharged molecules, such as oxygen, carbon dioxide, and water, can diffuse across the plasma membrane on their own. In contrast, ions and larger molecules require the assistance of transport proteins due to their charge or size. Transport across membranes also occurs within individual cells, playing a variety of essential roles for the plant as a whole.
17.8K
Distance Measurements by Taping01:18

Distance Measurements by Taping

479
Tapes are essential in surveying for accurate, durable, and short-distance measurements. Made from lightweight, nylon-coated steel, they offer flexibility and strength for rugged outdoor use. The nylon coating protects against rust and wear, extending the tape's life. Standard lengths, around 30 meters, are marked in meters and millimeters for precision.Surveyors select tapes based on site conditions and accuracy needs. Lightweight, nylon-coated tapes are commonly used for ease of handling and...
479
Electronic Distance Measuring Instruments01:30

Electronic Distance Measuring Instruments

541
Electronic Distance Measuring Instruments (EDMs) are essential tools in modern surveying, offering precise distance measurements by emitting electromagnetic signals and calculating the time required for these signals to travel to a target and return. Two primary types of signals are used in EDMs — light waves and microwaves — each suited to specific environmental and distance requirements. Light-wave-based EDMs utilize either infrared or laser light, providing high accuracy over...
541

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Nevermore: Target-Conditioned Protein-Ligand Representation Learning for Multi-Objective Lead Optimization with Database-Grounded Retrieval.

Biology·2026
Same author

Regulating genome language models: navigating policy challenges at the intersection of AI and genetics.

Human genetics·2025
Same author

A Systematic Review of Secondary Traumatic Stress in School Personnel: A Synthesis of Quantitative Findings.

The Journal of school health·2025
Same author

Correction: Development and extensive sequencing of a broadly-consented Genome in a Bottle matched tumor-normal pair.

Scientific data·2025
Same author

Development and extensive sequencing of a broadly-consented Genome in a Bottle matched tumor-normal pair.

Scientific data·2025
Same author

iSeqSearch: incremental protein search for iBlast/iMMSeqs2/iDiamond.

PeerJ·2025
Same journal

Bacterial communities associated with the blue dragon nudibranch <i>Glaucilla marginata</i> from Phuket, Thailand.

PeerJ·2026
Same journal

Comparison of <i>Palythoa tuberculosa</i>-dominated and hard coral-dominated benthic communities in Okinawa, Japan.

PeerJ·2026
Same journal

UBE2C contributes to malignant phenotypes in clear cell renal cell carcinoma <i>via</i> cell cycle and apoptosis regulation.

PeerJ·2026
Same journal

A case-based course teaching integrative taxonomy delimits three new ant species in the <i>Pheidole sexspinosa</i> complex (Hymenoptera, Formicidae) from Vanuatu.

PeerJ·2026
Same journal

Predicting non-surgical treatment failure in patients with spontaneous pneumothorax-the Base-C score: a retrospective study.

PeerJ·2026
Same journal

The enigmatic historical record of the leaf beetle <i>Acentroptera norrisii</i> in Spain: failed biological introduction or curatorial artefact?

PeerJ·2026
See all related articles
  1. Home
  2. Normalized Compression Distance For Dna Classification.
  1. Home
  2. Normalized Compression Distance For Dna Classification.

Related Experiment Video

Calibrated Forceps Model of Spinal Cord Compression Injury
09:41

Calibrated Forceps Model of Spinal Cord Compression Injury

Published on: April 24, 2015

19.4K

Normalized compression distance for DNA classification.

Gavin Hearne1, Mohammadsaleh S Refahi1, Haozhe Neil Duan1

  • 1Department of Electrical Engineering, Drexel University, Philadelphia, PA, United States of America.

Peerj
|February 11, 2026

View abstract on PubMed

Summary
This summary is machine-generated.

Normalized compression distance (NCD) offers a low-resource method for genomic sequence analysis. This approach shows promise in gene labeling and taxonomic classification, especially in low-data scenarios.

Keywords:
Alignment-free methodsBioinformaticsCompression distanceGenomic classificationGenomic sequence analysisGzip compressionMetagenomics

More Related Videos

Detection of Retrotransposition Activity of Hot LINE-1s by Long-Distance Inverse PCR
10:54

Detection of Retrotransposition Activity of Hot LINE-1s by Long-Distance Inverse PCR

Published on: July 27, 2019

9.1K
A Microfluidic Platform for Stimulating Chondrocytes with Dynamic Compression
07:23

A Microfluidic Platform for Stimulating Chondrocytes with Dynamic Compression

Published on: September 13, 2019

7.1K

Related Experiment Videos

Calibrated Forceps Model of Spinal Cord Compression Injury
09:41

Calibrated Forceps Model of Spinal Cord Compression Injury

Published on: April 24, 2015

19.4K
Detection of Retrotransposition Activity of Hot LINE-1s by Long-Distance Inverse PCR
10:54

Detection of Retrotransposition Activity of Hot LINE-1s by Long-Distance Inverse PCR

Published on: July 27, 2019

9.1K
A Microfluidic Platform for Stimulating Chondrocytes with Dynamic Compression
07:23

A Microfluidic Platform for Stimulating Chondrocytes with Dynamic Compression

Published on: September 13, 2019

7.1K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Genomic sequence analysis is crucial for understanding biological systems, including the human microbiome.
  • Normalized Compression Distance (NCD) is a low-resource technique that approximates Kolmogorov complexity, showing potential beyond text classification.
  • Existing tools like Many-against-Many sequence searching (MMseqs) and Kraken2 are effective but can be resource-intensive.

Purpose of the Study:

  • To explore the application of a gzip-based NCD for gene labeling of open reading frames (ORFs) and taxonomic classification of short genomic reads.
  • To evaluate NCD's performance as an alternative to deep neural networks (DNNs) and traditional sequence analysis tools in genomics.
  • To assess NCD's effectiveness in both in-distribution and out-of-distribution classification tasks.

Main Methods:

  • Implementation of a gzip-based Normalized Compression Distance (NCD) algorithm.
  • Application of NCD for gene labeling of open reading frames (ORFs).
  • Utilizing NCD for taxonomic classification of short sequencing reads.

Main Results:

  • Achieved 0.89 accuracy and 0.88 macro-F1 for human gene classification, outperforming other NCD methods.
  • Demonstrated superior classification accuracy compared to alignment and exact-match tools in out-of-distribution prokaryotic gene labeling.
  • Outperformed sequence-embedding methods in in-distribution prokaryotic gene labeling tasks.

Conclusions:

  • Compression-based approaches, specifically NCD, offer an effective alternative for genomic sequence classification.
  • NCD shows particular strength in low-data environments and out-of-distribution scenarios.
  • Computational complexity (O(MN)) is a limitation for extremely large datasets, but NCD remains a viable low-resource strategy.