Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

The Nucleosome01:19

The Nucleosome

4.6K
Human DNA is almost two meters long. However, it is compressed inside a tiny nucleus measuring only a few microns in diameter. To make this degree of compaction possible, DNA is organized into several sequential levels so that it can fit into such a tiny space. The most compact form of DNA is a chromosome that can be seen under a microscope in a dividing cell.
In a chromosome, DNA is wound twice around a protein complex called a histone octamer core, which consists of 8 histone proteins. This...
4.6K
The Nucleosome02:33

The Nucleosome

19.6K
DNA in a human cell is almost 2m long and it is packed inside a tiny nucleus that is only a few microns in diameter. The level of compaction of DNA inside the nucleus is astonishing. It is organized into several sequentially higher levels of compaction to fit into such a tiny space. The most compact form of DNA is a chromosome that can be seen under a microscope in a dividing cell.
DNA is wound twice around a protein complex called histone core, that consist of 8 histone proteins. This complex...
19.6K
Sanger Sequencing01:57

Sanger Sequencing

777.3K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
777.3K
DNA Isolation01:24

DNA Isolation

45.8K
DNA isolation protocols can be fast and straightforward or complex and time-consuming depending on the type and quality of DNA required for further processing. For example, plasmid DNA extraction is a bit more complicated than genomic DNA extraction because of the need for an appropriate lysis method to separate plasmid DNA from gDNA during isolation. However, for specific applications, such as long-range DNA sequencing that require a good yield of high- quality DNA samples, we need to follow...
45.8K
DNA Isolation01:34

DNA Isolation

201.9K
DNA from cells is required for many biotechnology and research applications, such as molecular cloning. To remove and purify DNA from cells, researchers use various methods of DNA extraction. While the specifics of different protocols may vary, some general concepts underlie the process of DNA extraction.
201.9K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.9K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Impact of Hand Fractures on Return to Skate, Performance, Time on Ice, and Physicality in the National Hockey League.

Cureus·2026
Same author

Advances in Congestion Assessment in Decompensated Heart Failure.

Cardiac failure review·2026
Same author

Sural Nerve Schwannoma in the Setting of Chronic Lateral Ankle Instability: A Case Report.

Foot & ankle specialist·2025
Same author

Chemprop v2: An Efficient, Modular Machine Learning Package for Chemical Property Prediction.

Journal of chemical information and modeling·2025
Same author

Finding low-complexity DNA sequences with longdust.

ArXiv·2025
Same author

A novel treatment score (QUAD score) to promote treatment optimization in heart failure with a reduced ejection fraction.

ESC heart failure·2025
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Mar 15, 2026

Ultra-long Read Sequencing for Whole Genomic DNA Analysis
10:34

Ultra-long Read Sequencing for Whole Genomic DNA Analysis

Published on: March 15, 2019

24.2K

Finding low-complexity DNA sequences with longdust.

Heng Li1,2,3, Brian Li4

  • 1Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02215, United States.

Bioinformatics (Oxford, England)
|March 14, 2026
PubMed
Summary
This summary is machine-generated.

Longdust efficiently identifies low-complexity DNA sequences, such as satellite and tandem repeats. This new algorithm improves variant calling accuracy by statistically modeling sequence complexity.

More Related Videos

Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing
06:56

Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing

Published on: March 31, 2017

12.3K
Application of DNA Fingerprinting using the D1S80 Locus in Lab Classes
08:35

Application of DNA Fingerprinting using the D1S80 Locus in Lab Classes

Published on: July 17, 2021

23.4K

Related Experiment Videos

Last Updated: Mar 15, 2026

Ultra-long Read Sequencing for Whole Genomic DNA Analysis
10:34

Ultra-long Read Sequencing for Whole Genomic DNA Analysis

Published on: March 15, 2019

24.2K
Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing
06:56

Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing

Published on: March 31, 2017

12.3K
Application of DNA Fingerprinting using the D1S80 Locus in Lab Classes
08:35

Application of DNA Fingerprinting using the D1S80 Locus in Lab Classes

Published on: July 17, 2021

23.4K

Area of Science:

  • Genomics
  • Bioinformatics

Background:

  • Low-complexity (LC) DNA sequences are repetitive and can cause errors in genetic analysis.
  • Existing algorithms for identifying LC sequences are often imprecise or inefficient.

Purpose of the Study:

  • To introduce Longdust, a novel algorithm for efficient identification of long low-complexity DNA sequences.
  • To improve the accuracy of variant calling by addressing artifacts caused by LC sequences.

Main Methods:

  • Developed Longdust, an algorithm defining string complexity via statistical modeling of k-mer count distribution.
  • Utilized parameters including k-mer length, context window size, and complexity threshold.
  • Implemented and tested on real genomic data.

Main Results:

  • Longdust efficiently identifies long LC sequences, including centromeric satellite and tandem repeats.
  • The algorithm demonstrates high performance and consistency with established methods.
  • Provides a statistically rigorous definition of sequence complexity.

Conclusions:

  • Longdust offers an efficient and accurate method for detecting low-complexity DNA sequences.
  • This tool can mitigate variant calling artifacts, improving genomic analysis reliability.
  • The algorithm's statistical approach provides a robust measure of sequence complexity.