Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Trial and Error and Algorithm

Trial and Error and Algorithm

A problem-solving strategy is a plan of action used to find a solution. Different strategies have distinct action plans. Trial and error involves trying different solutions until one works. For instance, to fix a broken printer, you might check ink levels, ensure the paper tray isn't jammed, and verify the printer's connection to your laptop. This method can be time-consuming but is commonly used. Thomas Edison, for example, used trial and error to find a suitable filament for the light...

Gene Evolution - Fast or Slow?

Gene Evolution - Fast or Slow?

The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Vesicular Tubular Clusters

Vesicular Tubular Clusters

After budding out from the ER membrane, some COPII vesicles lose their coat and fuse with one another to form larger vesicles and interconnected tubules called vesicular tubular clusters or VTCs. These clusters constitute a compartment at the ER-Golgi interface known as ERGIC (Endoplasmic Reticulum Golgi Intermediate Compartment). The ERGIC is a mobile membrane-bound cargo transport system that sorts proteins secreted from ER and delivers them to the Golgi.
With the help of motor proteins such...

Fast Fourier Transform

Fast Fourier Transform

The Fast Fourier Transform (FFT) is a computational algorithm designed to compute the Discrete Fourier Transform (DFT) efficiently. By breaking down the calculations into smaller, manageable sections, the FFT significantly reduces the computational complexity involved. Direct computation of an N-point DFT requires N2 complex multiplications, whereas the FFT algorithm needs only (N/2)log⁡2N multiplications, offering a much faster performance.
The computational efficiency of the FFT becomes...

Cis-regulatory Sequences

Cis-regulatory Sequences

Cis-regulatory sequences are short fragments of non-coding DNA that are present on the same chromosomes as the genes that they regulate. These fragments serve as binding sites for transcriptional regulators, proteins that are responsible for controlling gene transcription and differential gene expression across cell types in eukaryotes. Cis-regulatory sequences can be close to the gene of interest or thousands of bases away in the DNA sequence; however, those sequences that are further away are...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Genome-wide association meta-analyses over one million individuals identify 54 loci associated with urinary incontinence and its subtypes.

medRxiv : the preprint server for health sciences·2026

Same author

The HUNT study identifies host genetic factors reproducibly associated with human gut microbiota composition.

Nature genetics·2026

Same author

Genome-wide association analyses highlight the role of the intestinal molecular environment in human gut microbiota variation.

Nature genetics·2026

Same author

Transcriptome analysis reveals effects of ethynylestradiol and bisphenol A on multiple endocrine and metabolic pathways in the pituitary and liver of female Atlantic cod (<i>Gadus morhua</i>).

Frontiers in endocrinology·2025

Same author

High SHBG and Low Bioavailable Testosterone are Strongly Causally Associated with Increased Forearm Fracture Risk in Women: An MR Study Leveraging Novel Female-Specific Data.

Calcified tissue international·2024

Same author

Publisher Correction: A plasma protein-based risk score to predict hip fractures.

Nature aging·2024

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026

Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026

Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026

Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 23, 2026

Author Spotlight: Advancements in Molecular Biomarker Testing for Non-Squamous Non-Small Cell Lung Cancer

Author Spotlight: Advancements in Molecular Biomarker Testing for Non-Squamous Non-Small Cell Lung Cancer

Published on: September 8, 2023

Fast sequence clustering using a suffix array algorithm.

Ketil Malde¹, Eivind Coward, Inge Jonassen

¹Department of Informatics, University of Bergen, HIB, N5020 Norway. ketil@ii.uib.no

Bioinformatics (Oxford, England)

|July 2, 2003

Summary

This summary is machine-generated.

A new algorithm efficiently clusters expressed sequence tags (ESTs) using suffix arrays, offering a faster alternative to current methods. This approach achieves sub-quadratic time complexity for large biological datasets.

More Related Videos

An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants

An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants

Published on: November 8, 2017

Ligand Nano-cluster Arrays in a Supported Lipid Bilayer

Ligand Nano-cluster Arrays in a Supported Lipid Bilayer

Published on: April 23, 2017

Related Experiment Videos

Last Updated: Jan 23, 2026

Author Spotlight: Advancements in Molecular Biomarker Testing for Non-Squamous Non-Small Cell Lung Cancer

Author Spotlight: Advancements in Molecular Biomarker Testing for Non-Squamous Non-Small Cell Lung Cancer

Published on: September 8, 2023

An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants

An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants

Published on: November 8, 2017

Ligand Nano-cluster Arrays in a Supported Lipid Bilayer

Ligand Nano-cluster Arrays in a Supported Lipid Bilayer

Published on: April 23, 2017

Area of Science:

Bioinformatics
Computational Biology
Genomics

Background:

Efficient clustering of expressed sequence tag (EST) data is crucial for managing large biological sequence datasets.
Current clustering methods often rely on all-against-all comparisons, leading to quadratic time complexity that hinders scalability.
The rapid growth of EST data necessitates novel, more efficient computational approaches.

Purpose of the Study:

To introduce a novel, fast algorithm for clustering expressed sequence tag (EST) data.
To address the limitations of existing quadratic time complexity methods in handling large EST datasets.
To provide a scalable solution for EST data analysis.

Main Methods:

Development of a new EST clustering algorithm utilizing suffix arrays.
Achieving sub-quadratic time complexity through the suffix array-based approach.
Implementation of a prototype for the developed algorithm.

Main Results:

The prototype implementation demonstrated promising results on a benchmark dataset.
Clusterings generated by the new algorithm were validated against existing methods.
The suffix array approach successfully reduced computational time complexity.

Conclusions:

The presented algorithm offers an efficient and scalable solution for EST clustering.
This method is well-suited for analyzing the ever-increasing volume of EST data.
The source code is publicly available, promoting further research and application.