Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

RNA-seq03:21

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while microarray-based...
Maxam-Gilbert Sequencing01:05

Maxam-Gilbert Sequencing

In the same year as the discovery of the Sanger sequencing method, another group of scientists, Allan Maxam and Walter Gilbert, demonstrated their chemical-cleavage method for DNA sequencing. The Maxam-Gilbert method relies on using different chemicals that can cleave the DNA sequence at specific sites, the separation of resulting DNA fragments of variable size using electrophoresis, and deciphering the DNA sequence from the resulting gel bands.
Challenges of the Maxam-Gilbert Method
The...
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved DNA...
Sequences01:29

Sequences

Sequences are fundamental mathematical objects consisting of ordered lists of numbers that follow a specific rule or pattern. Sequences are critical in various mathematical concepts, including calculus, series, and number theory. They can model real-world phenomena such as population growth, financial investments, and physical processes like the diminishing height of a bouncing ball.Each number in a sequence is referred to as a term. Typically, the terms are denoted as a1, a2, a3,…, where the...
Next-generation Sequencing03:00

Next-generation Sequencing

The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features.
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Radiation Hybrid Map of Barley Chromosome 3H.

The plant genome·2020
Same author

Reliable Radiation Hybrid Maps: An Efficient Scalable Clustering-Based Approach.

IEEE/ACM transactions on computational biology and bioinformatics·2015
Same author

A radiation hybrid map of chromosome 1D reveals synteny conservation at a wheat speciation locus.

Functional & integrative genomics·2013
Same author

Wheat Zapper: a flexible online tool for colinearity studies in grass genomes.

Functional & integrative genomics·2013
Same author

DNA repair and crossing over favor similar chromosome regions as discovered in radiation hybrid of Triticum.

BMC genomics·2012
Same author

A resource for the in silico identification of fungal polyketide synthases from predicted fungal proteomes.

Molecular plant pathology·2011
Same journal

DiffGRN: differential gene regulatory network analysis.

International journal of data mining and bioinformatics·2019
Same journal

Integration of multi-omics data for integrative gene regulatory network inference.

International journal of data mining and bioinformatics·2018
Same journal

The development of non-coding RNA ontology.

International journal of data mining and bioinformatics·2016
Same journal

Learning multiple distributed prototypes of semantic categories for named entity recognition.

International journal of data mining and bioinformatics·2015
Same journal

Weighted fusion regularisation and predicting microbial interactions with vector autoregressive model.

International journal of data mining and bioinformatics·2015
Same journal

Application of consensus string matching in the diagnosis of allelic heterogeneity involving transposition mutation.

International journal of data mining and bioinformatics·2015
See all related articles

Related Experiment Video

Updated: Jun 21, 2026

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

Clustering sequences by overlap.

Dietmar H Dorr1, Anne M Denton

  • 1Department of Computer Science, North Dakota State University, Fargo, ND 58105, USA. dietmar.dorr@ndsu.edu

International Journal of Data Mining and Bioinformatics
|July 24, 2009
PubMed
Summary
This summary is machine-generated.

This study introduces a novel clustering algorithm that integrates motif finding for enhanced sequence analysis. It discovers remote homologies and improves annotation consistency compared to existing methods.

More Related Videos

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

Single Cell Multiplex Reverse Transcription Polymerase Chain Reaction After Patch-clamp
10:44

Single Cell Multiplex Reverse Transcription Polymerase Chain Reaction After Patch-clamp

Published on: June 20, 2018

Related Experiment Videos

Last Updated: Jun 21, 2026

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

Single Cell Multiplex Reverse Transcription Polymerase Chain Reaction After Patch-clamp
10:44

Single Cell Multiplex Reverse Transcription Polymerase Chain Reaction After Patch-clamp

Published on: June 20, 2018

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Sequence Analysis

Background:

  • Traditional clustering and motif-finding algorithms have limitations in identifying distant relationships.
  • Discovering remote homologies is crucial for understanding protein function and evolution.

Purpose of the Study:

  • To develop a novel clustering algorithm combining strengths of clustering and motif finding.
  • To enable the discovery of remote homologies missed by existing methods.
  • To improve the consistency of biological domain annotations.

Main Methods:

  • A new clustering algorithm identifying clusters based on unambiguously defined sequence sections.
  • Utilizing transitive similarity matches within clusters.
  • Constructing Directed Acyclic Graph (DAG) structures to link clusters of varying lengths.

Main Results:

  • The algorithm successfully identifies clusters and enables the discovery of remote homologies.
  • Clustering results show improved consistency compared to annotations based on InterPro domains.
  • Directed Acyclic Graph (DAG) structures effectively link short and long sequence clusters.

Conclusions:

  • The proposed algorithm offers a more sensitive and consistent approach to sequence clustering and homology detection.
  • This method advances bioinformatics by improving the identification of evolutionary relationships and functional annotations.