Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

The Nucleosome

The Nucleosome

Human DNA is almost two meters long. However, it is compressed inside a tiny nucleus measuring only a few microns in diameter. To make this degree of compaction possible, DNA is organized into several sequential levels so that it can fit into such a tiny space. The most compact form of DNA is a chromosome that can be seen under a microscope in a dividing cell.
In a chromosome, DNA is wound twice around a protein complex called a histone octamer core, which consists of 8 histone proteins. This...

The Nucleosome

The Nucleosome

DNA in a human cell is almost 2m long and it is packed inside a tiny nucleus that is only a few microns in diameter. The level of compaction of DNA inside the nucleus is astonishing. It is organized into several sequentially higher levels of compaction to fit into such a tiny space. The most compact form of DNA is a chromosome that can be seen under a microscope in a dividing cell.
DNA is wound twice around a protein complex called histone core, that consist of 8 histone proteins. This complex...

Sanger Sequencing

Sanger Sequencing

DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...

DNA Isolation

DNA Isolation

DNA isolation protocols can be fast and straightforward or complex and time-consuming depending on the type and quality of DNA required for further processing. For example, plasmid DNA extraction is a bit more complicated than genomic DNA extraction because of the need for an appropriate lysis method to separate plasmid DNA from gDNA during isolation. However, for specific applications, such as long-range DNA sequencing that require a good yield of high- quality DNA samples, we need to follow...

DNA Isolation

DNA Isolation

DNA from cells is required for many biotechnology and research applications, such as molecular cloning. To remove and purify DNA from cells, researchers use various methods of DNA extraction. While the specifics of different protocols may vary, some general concepts underlie the process of DNA extraction.

Multi-species Conserved Sequences

Multi-species Conserved Sequences

Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Impact of Hand Fractures on Return to Skate, Performance, Time on Ice, and Physicality in the National Hockey League.

Cureus·2026

Same author

Advances in Congestion Assessment in Decompensated Heart Failure.

Cardiac failure review·2026

Same author

Sural Nerve Schwannoma in the Setting of Chronic Lateral Ankle Instability: A Case Report.

Foot & ankle specialist·2025

Same author

Chemprop v2: An Efficient, Modular Machine Learning Package for Chemical Property Prediction.

Journal of chemical information and modeling·2025

Same author

Finding low-complexity DNA sequences with longdust.

ArXiv·2025

Same author

A novel treatment score (QUAD score) to promote treatment optimization in heart failure with a reduced ejection fraction.

ESC heart failure·2025

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026

Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026

Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026

Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 15, 2026

Ultra-long Read Sequencing for Whole Genomic DNA Analysis

Ultra-long Read Sequencing for Whole Genomic DNA Analysis

Published on: March 15, 2019

Finding low-complexity DNA sequences with longdust.

Heng Li^1,2,3, Brian Li⁴

¹Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02215, United States.

Bioinformatics (Oxford, England)

|March 14, 2026

Summary

This summary is machine-generated.

Longdust efficiently identifies low-complexity DNA sequences, such as satellite and tandem repeats. This new algorithm improves variant calling accuracy by statistically modeling sequence complexity.

More Related Videos

Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing

Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing

Published on: March 31, 2017

Application of DNA Fingerprinting using the D1S80 Locus in Lab Classes

Application of DNA Fingerprinting using the D1S80 Locus in Lab Classes

Published on: July 17, 2021

Related Experiment Videos

Last Updated: Mar 15, 2026

Ultra-long Read Sequencing for Whole Genomic DNA Analysis

Ultra-long Read Sequencing for Whole Genomic DNA Analysis

Published on: March 15, 2019

Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing

Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing

Published on: March 31, 2017

Application of DNA Fingerprinting using the D1S80 Locus in Lab Classes

Application of DNA Fingerprinting using the D1S80 Locus in Lab Classes

Published on: July 17, 2021

Area of Science:

Genomics
Bioinformatics

Background:

Low-complexity (LC) DNA sequences are repetitive and can cause errors in genetic analysis.
Existing algorithms for identifying LC sequences are often imprecise or inefficient.

Purpose of the Study:

To introduce Longdust, a novel algorithm for efficient identification of long low-complexity DNA sequences.
To improve the accuracy of variant calling by addressing artifacts caused by LC sequences.

Main Methods:

Developed Longdust, an algorithm defining string complexity via statistical modeling of k-mer count distribution.
Utilized parameters including k-mer length, context window size, and complexity threshold.
Implemented and tested on real genomic data.

Main Results:

Longdust efficiently identifies long LC sequences, including centromeric satellite and tandem repeats.
The algorithm demonstrates high performance and consistency with established methods.
Provides a statistically rigorous definition of sequence complexity.

Conclusions:

Longdust offers an efficient and accurate method for detecting low-complexity DNA sequences.
This tool can mitigate variant calling artifacts, improving genomic analysis reliability.
The algorithm's statistical approach provides a robust measure of sequence complexity.