Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

7.2K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
7.2K
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

831
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
831

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Body Size Evolution in Burying Beetles (Staphylinidae: Silphinae: <i>Nicrophorus</i>).

Ecology and evolution·2026
Same author

Soaring Systematics: an evaluation of biogeography and flight behavior in dragonflies and damselflies (Insecta: Odonata) using phylogenomics.

Systematic biology·2026
Same author

Elevational and oceanic barriers shape the distribution, dispersal, and diversity of Aotearoa's Kapokapowai (Uropetala) dragonflies.

Evolution; international journal of organic evolution·2025
Same author

Antennal sensilla diversity in diurnal and nocturnal fireflies (Coleoptera, Lampyridae).

PloS one·2025
Same author

Validation of earlier proposed taxa Tibionemini new tribe and Hiekeolycus winkleri new species.

Zootaxa·2025
Same author

Polarized vision in the eyes of the most effective predators: dragonflies and damselflies (Odonata).

Die Naturwissenschaften·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Mar 25, 2026

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.6K

Detecting false positive sequence homology: a machine learning approach.

M Stanley Fujimoto1, Anton Suvorov2, Nicholas O Jensen3

  • 1Computer Science Department, Brigham Young University, Provo, Utah, 84602, USA.

BMC Bioinformatics
|February 26, 2016
PubMed
Summary
This summary is machine-generated.

This study introduces a machine learning method to improve the accuracy of identifying homologous genes. The approach effectively filters out false positives from heuristic algorithms, enhancing evolutionary and functional genomics research.

More Related Videos

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.5K
Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.6K

Related Experiment Videos

Last Updated: Mar 25, 2026

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.6K
Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.5K
Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.6K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Evolutionary Biology

Background:

  • Accurate detection of homologous gene relationships is crucial for evolutionary studies and gene annotation.
  • Existing heuristic tools often yield high false positive rates, especially with incomplete sequencing data.
  • Lack of post-processing tools exacerbates homology detection inaccuracies.

Purpose of the Study:

  • To develop a machine learning method for enhancing the accuracy of homologous gene detection.
  • To provide a post-processing solution for existing clustering algorithms.
  • To reduce false positive rates in homology identification, particularly from low-coverage sequencing data.

Main Methods:

  • Extraction of biologically informative features from multiple sequence alignments of putative homologous genes.
  • Training a machine learning model on known homology clusters (OrthoDB) and non-homologous sequences.
  • Utilizing guided experimentation to verify false positive outcomes.

Main Results:

  • The machine learning method successfully identifies false positives generated by heuristic algorithms.
  • Significant percentages of putative homologies from InParanoid (~42%) and HaMStR (~25%) were classified as false positives.
  • The method demonstrated effectiveness on proteomes from low-coverage RNA-seq data.

Conclusions:

  • The developed process serves as a novel post-processing method for homology clustering algorithms.
  • It efficiently removes low-quality clusters of putative homologous genes.
  • This approach enhances the overall quality and reliability of homology detection outputs.