Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Videos

An efficient algorithm for finding short approximate non-tandem repeats.

E F Adebiyi¹, T Jiang, M Kaufmann

¹Wilhelm-Schickard-Institut für Informatik, Universität Tübingen, Sand 13, Tübingen, 72076, Germany. adebiyi@informatik.uni-tuebingen.de

Bioinformatics (Oxford, England)

|July 27, 2001

Summary

This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

[Risk factors for an extended hospital stay after elective minimally invasive colonic surgery in Germany-Data analysis of the StuDoQ register].

Chirurgie (Heidelberg, Germany)·2026

Same author

Immune checkpoint inhibitors therapy for solid organ malignancies after allogeneic hematopoietic stem cell transplantation: a retrospective study from the EBMT Transplant Complications Working Party.

Bone marrow transplantation·2024

Same author

Investigating non-inferiority of internet-delivered versus face-to-face cognitive behavioural therapy for insomnia (CBT-I): a randomised controlled trial (iSleep well).

Trials·2024

Same author

Infantile hypercalcemia type 1 (HCINF1): a rare disease resulting in nephrolithiasis and nephrocalcinosis caused by mutations in the vitamin D catabolic enzyme, CYP24A1.

Journal of endocrinological investigation·2024

Same author

Long-term trajectories of densely reported depressive symptoms during an extended period of the COVID-19 pandemic in Switzerland: Social worries matter.

Comprehensive psychiatry·2024

Same author

Correction to: Two-stage laparoscopic transversus abdominis plane block as an equivalent alternative to thoracic epidural anaesthesia in bowel resection-an explorative cohort study.

International journal of colorectal disease·2024

Same journal

Biomedical Concept Recognition with Error-aware Negative-enhanced Ranking Framework.

Bioinformatics (Oxford, England)·2026

Same journal

TEDLH: Domain HMMs for sensitive detection of remote homologues.

Bioinformatics (Oxford, England)·2026

Same journal

PLNFGL: Joint Estimation of Multi-Condition Gene Networks from Single-cell RNA-seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026

Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026

Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026

See all related articles

This study introduces an efficient algorithm for finding approximate repeats in long sequences, crucial for biological data analysis. The method significantly speeds up the identification of repeating patterns with variations like insertions, deletions, and mismatches.

Area of Science:

Bioinformatics
Computational Biology
Stringology

Background:

Identifying repeating patterns in biological sequences is fundamental for understanding genome structure and function.
Existing methods often struggle with approximate repeats, which include variations like insertions, deletions, and mismatches.

Purpose of the Study:

To develop an efficient algorithm for extracting approximate non-tandem repeats from long biological sequences.
To theoretically characterize the 'seeds' or maximal exact repeats required for approximate repeat detection.

Main Methods:

The algorithm leverages a theoretical characterization of maximal exact repeats (seeds).
It employs a sub-quadratic approach to identify short approximate repeats within a given threshold of differences.

Related Experiment Videos

The analysis focuses on sequences of length P with at most D differences.

Main Results:

A sublinear bound on the expected number of required seeds was proven.
An efficient sub-quadratic algorithm was presented for finding approximate repeats of length O(log N).
The algorithm's running time is O(DN(3pow(epsilon)-1)log N), where epsilon = D/P.

Conclusions:

The developed algorithm provides a significant advancement in efficiently detecting approximate repeats in large biological datasets.
This method is particularly relevant for analyzing DNA and protein sequences where variations are common.