Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

RNA-seq

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases.
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while microarray-based...

Evolutionary Relationships through Genome Comparisons

Evolutionary Relationships through Genome Comparisons

Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Genome Annotation and Assembly

Genome Annotation and Assembly

The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.

Cis-regulatory Sequences

Cis-regulatory Sequences

Cis-regulatory sequences are short fragments of non-coding DNA that are present on the same chromosomes as the genes that they regulate. These fragments serve as binding sites for transcriptional regulators, proteins that are responsible for controlling gene transcription and differential gene expression across cell types in eukaryotes. Cis-regulatory sequences can be close to the gene of interest or thousands of bases away in the DNA sequence; however, those sequences that are further away are...

DNA Microarrays

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Phylogenetic inference reveals clonal heterogeneity in circulating tumor cell clusters.

Nature genetics·2025

Same author

Single-cell copy number calling and event history reconstruction.

Bioinformatics (Oxford, England)·2025

Same author

Mutation order in acute myeloid leukemia identifies uncommon patterns of evolution and illuminates phenotypic heterogeneity.

Leukemia·2024

Same author

Mutation order in acute myeloid leukemia identifies uncommon patterns of evolution and illuminates phenotypic heterogeneity.

Research square·2023

Same author

COMPASS: joint copy number and mutation phylogeny reconstruction from amplicon single-cell sequencing data.

Nature communications·2023

Same author

To be or not to be stressed: Designing autonomy to reduce stress at work.

Work (Reading, Mass.)·2023

Same journal

GMSA: A Graph Matching and Point Cloud Registration-Based Method for Spatial Transcriptomics Data Alignment.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Investigations on Multiple Protein Scaffold Filling.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Cell Type Prediction for Single-Cell RNA Sequencing Utilizing Unsupervised Domain Adaptation and Semi-Supervised Learning.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

PPIGAN: Prediction of Protein-Protein Interactions Using Generative Adversarial Networks.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Deep Structure-Enhanced Cell Clustering Model for Single-Cell RNA Sequencing Data.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Asymmetric Drug-Drug Interaction Prediction Based on Generative Adversarial Networks and Knowledge Graph.

Journal of computational biology : a journal of computational molecular cell biology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 29, 2026

A Computational Pipeline for Intergenic/Intragenic Enhancer RNA Quantification in Mouse Embryonic Stem Cells

A Computational Pipeline for Intergenic/Intragenic Enhancer RNA Quantification in Mouse Embryonic Stem Cells

Published on: October 28, 2025

Efficient computation of approximate gene clusters based on reference occurrences.

Katharina Jahn¹

¹AG Genominformatik, Technische Fakultät, Universität Bielefeld, Bielefeld, Germany. kjahn@cebitec.uni-bielefeld.de

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology

|September 9, 2011

Summary

This summary is machine-generated.

This study introduces an efficient set distance method for identifying approximate gene clusters, crucial for comparative genomics. The new approach offers comparable results to existing methods but with significantly improved computational efficiency.

More Related Videos

Amplification, Next-generation Sequencing, and Genomic DNA Mapping of Retroviral Integration Sites

Amplification, Next-generation Sequencing, and Genomic DNA Mapping of Retroviral Integration Sites

Published on: March 22, 2016

Related Experiment Videos

Last Updated: May 29, 2026

A Computational Pipeline for Intergenic/Intragenic Enhancer RNA Quantification in Mouse Embryonic Stem Cells

A Computational Pipeline for Intergenic/Intragenic Enhancer RNA Quantification in Mouse Embryonic Stem Cells

Published on: October 28, 2025

Amplification, Next-generation Sequencing, and Genomic DNA Mapping of Retroviral Integration Sites

Amplification, Next-generation Sequencing, and Genomic DNA Mapping of Retroviral Integration Sites

Published on: March 22, 2016

Area of Science:

Genomics
Bioinformatics
Computational Biology

Background:

Comparative genomics utilizes gene cluster conservation for whole genome analysis.
Functionally related genes often remain co-located across species, forming approximate gene clusters.
Identifying these imperfectly conserved clusters is computationally challenging.

Purpose of the Study:

To present an efficient set distance-based algorithm for detecting approximate gene clusters.
To demonstrate the algorithm's performance and scalability in comparative genomics.

Main Methods:

Developed a set distance-based approach using reference occurrences for approximate gene cluster computation.
Evaluated the algorithm's efficiency and accuracy against non-reference based and max-gap based methods.

Main Results:

The proposed method achieves results comparable to non-reference based approaches.
Its polynomial runtime enables approximate gene cluster detection in previously infeasible parameter ranges.
Demonstrated superior performance and predictive power compared to a state-of-the-art max-gap approach.

Conclusions:

The set distance-based algorithm provides an efficient and effective tool for identifying approximate gene clusters.
This advancement facilitates more comprehensive comparative genomic analyses.
The method expands the feasibility of detecting gene clusters with complex conservation patterns.