Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Clustering DNA sequences by feature vectors.

Libin Liu1, Yee-kin Ho, Stephen Yau

  • 1Department of Mathematics, Statistics and Computer Science, University of Illinois at Chicago, M/C 249 Chicago, IL 60607-7045, USA.

Molecular Phylogenetics and Evolution
|July 18, 2006
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Finite element-guided optimization of personalized vacuum bell devices for pectus excavatum: an exploratory retrospective clinical study.

Pediatric surgery international·2026
Same author

Delta radiomics for predicting early radiation-induced lung injury after thoracic radiotherapy: a retrospective paired-CT study.

Strahlentherapie und Onkologie : Organ der Deutschen Rontgengesellschaft ... [et al]·2026
Same author

Non-caveolar caveolin-1 in retinal Müller glia promotes innate immune responses.

The Journal of biological chemistry·2026
Same author

Development of a machine learning-based prediction model for hypothyroidism-associated delirium in elderly hypothyroid patients in the intensive care unit.

BMC geriatrics·2026
Same author

Non-high-density lipoprotein cholesterol to high-density lipoprotein cholesterol ratio is associated with cardiovascular outcomes after percutaneous coronary intervention in patients with type 2 diabetes.

Frontiers in endocrinology·2026
Same author

Interfacial flow-kinetics coupling enables nanofiltration with decoupled antibiotic rejection and salt permeation in high-salinity wastewater.

Water research·2026
Same journal

Phylogenomic assessment of microhylid frogs reveals widespread taxonomic confusion in the Asterophryinae and establishes the timing of diversification in Australia.

Molecular phylogenetics and evolution·2026
Same journal

Phylogenomics, introgression, and demographic history of extant flamingos (Aves: Phoenicopteridae).

Molecular phylogenetics and evolution·2026
Same journal

The biogeographic and evolutionary history of Leiosauridae lizards reveal the influence of complex environmental and geological past dynamics in the Neotropical biomes.

Molecular phylogenetics and evolution·2026
Same journal

Rapid ecological speciation in gall inducers.

Molecular phylogenetics and evolution·2026
Same journal

Integrative systematics reveal the deep evolutionary origin and familial placements of the wirrahs (Teleostei: Acanthistiidae: Acanthistius) and the toadstool grouper (Teleostei: Trachypomatidae fam. nov.: Trachypoma).

Molecular phylogenetics and evolution·2026
Same journal

Geographic distance and climate-mediated landscape resistance explain the genetic differentiation of Rhopalus sapporensis across China.

Molecular phylogenetics and evolution·2026
See all related articles

This study introduces a novel genomic space for comparing millions of DNA sequences by representing them as 12D vectors. This method effectively clusters homologous sequences, aiding in gene discovery and property prediction.

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Global DNA sequence comparison is computationally intensive.
  • Existing methods may not efficiently handle millions of genes simultaneously.
  • Understanding gene relationships is crucial for biological research.

Purpose of the Study:

  • To develop a new genomic space for simultaneous global DNA sequence comparison.
  • To represent DNA sequences in a high-dimensional vector space for clustering homologous sequences.
  • To facilitate the discovery and property prediction of new genes.

Main Methods:

  • Representing DNA sequences as 12-dimensional vectors based on nucleotide content and distribution.
  • Calculating vector distances to cluster homologous sequences.

Related Experiment Videos

  • Applying the method to gene families like myoglobin, beta-globin, histone-4, lysozyme, and rhodopsin.
  • Comparing results with the BLAST method.
  • Main Results:

    • Homologous DNA sequences cluster together in the 12D genomic space.
    • Vector distances effectively differentiate members within and between gene families.
    • The method distinguishes random sequences with similar base composition.
    • Sequence comparison results align with the established BLAST method.
    • New genes can be located in this space to predict their properties based on proximity to known genes.

    Conclusions:

    • The proposed 12D genomic space enables efficient global comparison of millions of DNA sequences.
    • This approach facilitates the identification and functional prediction of novel genes.
    • The method offers a consistent and potentially more efficient alternative to existing sequence comparison tools.