Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Trial and Error and Algorithm01:12

Trial and Error and Algorithm

403
A problem-solving strategy is a plan of action used to find a solution. Different strategies have distinct action plans. Trial and error involves trying different solutions until one works. For instance, to fix a broken printer, you might check ink levels, ensure the paper tray isn't jammed, and verify the printer's connection to your laptop. This method can be time-consuming but is commonly used. Thomas Edison, for example, used trial and error to find a suitable filament for the light...
403
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

8.0K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
8.0K
Cluster Sampling Method01:20

Cluster Sampling Method

14.2K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
14.2K
Vesicular Tubular Clusters01:45

Vesicular Tubular Clusters

3.1K
After budding out from the ER membrane, some COPII vesicles lose their coat and fuse with one another to form larger vesicles and interconnected tubules called vesicular tubular clusters or VTCs. These clusters constitute a compartment at the ER-Golgi interface known as ERGIC (Endoplasmic Reticulum Golgi Intermediate Compartment). The ERGIC is a mobile membrane-bound cargo transport system that sorts proteins secreted from ER and delivers them to the Golgi.
With the help of motor proteins such...
3.1K
Fast Fourier Transform01:10

Fast Fourier Transform

927
The Fast Fourier Transform (FFT) is a computational algorithm designed to compute the Discrete Fourier Transform (DFT) efficiently. By breaking down the calculations into smaller, manageable sections, the FFT significantly reduces the computational complexity involved. Direct computation of an N-point DFT requires N2 complex multiplications, whereas the FFT algorithm needs only (N/2)log⁡2N multiplications, offering a much faster performance.
The computational efficiency of the FFT becomes...
927
Cis-regulatory Sequences02:02

Cis-regulatory Sequences

11.6K
Cis-regulatory sequences are short fragments of non-coding DNA that are present on the same chromosomes as the genes that they regulate. These fragments serve as binding sites for transcriptional regulators, proteins that are responsible for controlling gene transcription and differential gene expression across cell types in eukaryotes. Cis-regulatory sequences can be close to the gene of interest or thousands of bases away in the DNA sequence; however, those sequences that are further away are...
11.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Genome-wide association meta-analyses over one million individuals identify 54 loci associated with urinary incontinence and its subtypes.

medRxiv : the preprint server for health sciences·2026
Same author

The HUNT study identifies host genetic factors reproducibly associated with human gut microbiota composition.

Nature genetics·2026
Same author

Genome-wide association analyses highlight the role of the intestinal molecular environment in human gut microbiota variation.

Nature genetics·2026
Same author

Transcriptome analysis reveals effects of ethynylestradiol and bisphenol A on multiple endocrine and metabolic pathways in the pituitary and liver of female Atlantic cod (<i>Gadus morhua</i>).

Frontiers in endocrinology·2025
Same author

High SHBG and Low Bioavailable Testosterone are Strongly Causally Associated with Increased Forearm Fracture Risk in Women: An MR Study Leveraging Novel Female-Specific Data.

Calcified tissue international·2024
Same author

Publisher Correction: A plasma protein-based risk score to predict hip fractures.

Nature aging·2024
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Jan 23, 2026

Author Spotlight: Advancements in Molecular Biomarker Testing for Non-Squamous Non-Small Cell Lung Cancer
07:59

Author Spotlight: Advancements in Molecular Biomarker Testing for Non-Squamous Non-Small Cell Lung Cancer

Published on: September 8, 2023

1.6K

Fast sequence clustering using a suffix array algorithm.

Ketil Malde1, Eivind Coward, Inge Jonassen

  • 1Department of Informatics, University of Bergen, HIB, N5020 Norway. ketil@ii.uib.no

Bioinformatics (Oxford, England)
|July 2, 2003
PubMed
Summary
This summary is machine-generated.

A new algorithm efficiently clusters expressed sequence tags (ESTs) using suffix arrays, offering a faster alternative to current methods. This approach achieves sub-quadratic time complexity for large biological datasets.

More Related Videos

An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants
09:32

An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants

Published on: November 8, 2017

8.2K
Ligand Nano-cluster Arrays in a Supported Lipid Bilayer
10:34

Ligand Nano-cluster Arrays in a Supported Lipid Bilayer

Published on: April 23, 2017

7.3K

Related Experiment Videos

Last Updated: Jan 23, 2026

Author Spotlight: Advancements in Molecular Biomarker Testing for Non-Squamous Non-Small Cell Lung Cancer
07:59

Author Spotlight: Advancements in Molecular Biomarker Testing for Non-Squamous Non-Small Cell Lung Cancer

Published on: September 8, 2023

1.6K
An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants
09:32

An Array-based Comparative Genomic Hybridization Platform for Efficient Detection of Copy Number Variations in Fast Neutron-induced Medicago truncatula Mutants

Published on: November 8, 2017

8.2K
Ligand Nano-cluster Arrays in a Supported Lipid Bilayer
10:34

Ligand Nano-cluster Arrays in a Supported Lipid Bilayer

Published on: April 23, 2017

7.3K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Efficient clustering of expressed sequence tag (EST) data is crucial for managing large biological sequence datasets.
  • Current clustering methods often rely on all-against-all comparisons, leading to quadratic time complexity that hinders scalability.
  • The rapid growth of EST data necessitates novel, more efficient computational approaches.

Purpose of the Study:

  • To introduce a novel, fast algorithm for clustering expressed sequence tag (EST) data.
  • To address the limitations of existing quadratic time complexity methods in handling large EST datasets.
  • To provide a scalable solution for EST data analysis.

Main Methods:

  • Development of a new EST clustering algorithm utilizing suffix arrays.
  • Achieving sub-quadratic time complexity through the suffix array-based approach.
  • Implementation of a prototype for the developed algorithm.

Main Results:

  • The prototype implementation demonstrated promising results on a benchmark dataset.
  • Clusterings generated by the new algorithm were validated against existing methods.
  • The suffix array approach successfully reduced computational time complexity.

Conclusions:

  • The presented algorithm offers an efficient and scalable solution for EST clustering.
  • This method is well-suited for analyzing the ever-increasing volume of EST data.
  • The source code is publicly available, promoting further research and application.