Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

GeneAlign: a coding exon prediction tool based on phylogenetical comparisons.

Shu Ju Hsieh1, Chun Yuan Lin, Ning Han Liu

  • 1Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan, 300, ROC.

Nucleic Acids Research
|July 18, 2006
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Clover: a clustering-oriented de novo assembler for Illumina sequences.

BMC bioinformatics·2020
Same author

Hadoop-MCC: Efficient Multiple Compound Comparison Algorithm Using Hadoop.

Combinatorial chemistry & high throughput screening·2018
Same author

MGUPGMA: A Fast UPGMA Algorithm With Multiple Graphics Processing Units Using NCCL.

Evolutionary bioinformatics online·2017
Same author

Bacterial diversity among four healthcare-associated institutes in Taiwan.

Scientific reports·2017
Same author

Prevalence and comparative analysis of the type IV secretion system in Aggregatibacter actinomycetemcomitan.

Journal of microbiology, immunology, and infection = Wei mian yu gan ran za zhi·2017
Same author

Subgingival microbiota in individuals with severe chronic periodontitis.

Journal of microbiology, immunology, and infection = Wei mian yu gan ran za zhi·2016
Same journal

Correction to 'scSuperAnnotator: A platform for benchmarking comparison and visualizing automated cellular annotation methods for scRNA-seq data'.

Nucleic acids research·2026
Same journal

Correction to 'Differentiable partition function calculation for RNA'.

Nucleic acids research·2026
Same journal

Deployment of non-canonical splicing in tunicate genomes is mediated by divergent U2AF function and changing m6A modification in U1 and U6 snRNA.

Nucleic acids research·2026
Same journal

Bacillus subtilis DnaB forms multiple protein-protein interactions essential for DNA replication initiation.

Nucleic acids research·2026
Same journal

Multiple forms of protein-protein and DNA binding are exhibited by BrxC from the BREX phage restriction system.

Nucleic acids research·2026
Same journal

Biosynthesis of glycosylated 5-hydroxycytosine in the DNA of diverse viruses.

Nucleic acids research·2026
See all related articles

GeneAlign accurately predicts protein coding genes in new genomes by comparing DNA sequences to annotated genes in related species. This tool achieves high accuracy at both gene and exon levels, aiding genomic research.

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Accurate identification of protein-coding genes is crucial for understanding newly sequenced genomes.
  • Experimental verification of gene annotations enables comparative genomics approaches for gene prediction.

Purpose of the Study:

  • To develop and evaluate GeneAlign, a novel tool for predicting protein-coding genes in newly sequenced genomes.
  • To leverage sequence homology and conserved gene structures for enhanced prediction accuracy.

Main Methods:

  • GeneAlign utilizes CORAL, a heuristic linear time alignment tool, to compare genomic regions with annotated coding exons.
  • It identifies candidate regions based on conserved signals (initiation and stop codons) and measures sequence homology.
  • The tool employs conservation of gene structures and sequence similarities between protein-coding regions.

Related Experiment Videos

Main Results:

  • GeneAlign demonstrated an average sensitivity and specificity of 81% at the gene level on the Projector dataset (human-mouse homologous pairs).
  • Prediction accuracy exceeded 96% at the exon level.
  • Rates of missing or incorrect exons were below 1%.

Conclusions:

  • GeneAlign provides a highly accurate and efficient method for protein-coding gene prediction in comparative genomics.
  • The tool's performance suggests its utility for annotating newly sequenced genomes.
  • GeneAlign is freely available for research use.