Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

7.2K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
7.2K
Transformers in Distribution System01:27

Transformers in Distribution System

608
Transformers in distribution systems can be broadly categorized into distribution substation transformers and other distribution transformers. They are crucial for stepping down high transmission voltages to levels suitable for distribution and end-user applications.
Distribution substation transformers come in various ratings and typically use mineral oil for insulation and cooling. To prevent moisture and air from entering the oil, some transformers use an inert gas like nitrogen to fill the...
608

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Constructing a lower-bound estimate of the global number of insect species on a hyperdiverse empirical foundation.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Genome-wide pervasiveness and localized variation of [Formula: see text]-mer-based genomic signatures in eukaryotes.

Scientific reports·2026
Same author

A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level.

Scientific data·2026
Same author

Multiomics and deep learning dissect regulatory syntax in human development.

Nature·2026
Same author

Life at the extremes: maximally divergent microbes with similar genomic signatures linked to extreme environments.

NAR genomics and bioinformatics·2025
Same author

Globally aggregated biodiversity data impact predictive and descriptive research.

Proceedings of the National Academy of Sciences of the United States of America·2025
Same journal

Region-aware bridge modeling enables interpretable mesoscale representation of spatial transcriptomic tissue sections.

Bioinformatics advances·2026
Same journal

Microbiome differential abundance methodologies to detect relevant taxa associated with chemotherapy toxicity rate in colorectal cancer.

Bioinformatics advances·2026
Same journal

maldipickr dereplicates microbial MALDI-TOF spectra to facilitate multiplexed isolation.

Bioinformatics advances·2026
Same journal

RAM-MSA: an anytime memory-bounded method for exact multiple sequence alignment using path finding.

Bioinformatics advances·2026
Same journal

Interpretable machine learning for low-sample multi-omics: a case study of ferret vaccine response.

Bioinformatics advances·2026
Same journal

DeepTaxa: a hybrid CNN-BERT framework for 16S rRNA taxonomic classification.

Bioinformatics advances·2026
See all related articles

Related Experiment Video

Updated: Mar 27, 2026

A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

36.3K

BarcodeBERT: transformers for biodiversity analyses.

Pablo Millan Arias1, Niousha Sadjadi1, Monireh Safari1

  • 1David R. Cheriton School of Computer Science, University of Waterloo, 200 University Avenue W, Waterloo, ON N2L 3G1, Canada.

Bioinformatics Advances
|March 25, 2026
PubMed
Summary
This summary is machine-generated.

BarcodeBERT, a novel AI model, enhances biodiversity analysis using DNA barcodes. It achieves faster and accurate taxonomic identification, outperforming existing methods for species-level classification.

More Related Videos

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance
04:58

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance

Published on: December 13, 2024

4.8K
A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

743

Related Experiment Videos

Last Updated: Mar 27, 2026

A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

36.3K
Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance
04:58

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance

Published on: December 13, 2024

4.8K
A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

743

Area of Science:

  • Biodiversity research
  • Genomic analysis
  • Machine learning

Background:

  • DNA barcodes are crucial for comparing species-specific genomic sequences.
  • Existing machine learning methods for DNA barcode analysis often use generic supervised algorithms.

Purpose of the Study:

  • To introduce BarcodeBERT, a specialized model for biodiversity analysis using DNA barcodes.
  • To evaluate BarcodeBERT's performance against other machine learning approaches and sequence-search tools.

Main Methods:

  • Developed BarcodeBERT using self-supervised pretraining on 1.5 million invertebrate DNA barcodes.
  • Evaluated performance on taxonomic identification tasks, comparing against classical neural networks and general DNA foundation models.
  • Analyzed masking and tokenization strategies for DNA language models.

Main Results:

  • BarcodeBERT, trained on domain-specific data, outperforms fine-tuned foundation models in identifying lower taxa (genera, species).
  • Achieved comparable species-level classification accuracy to BLAST but was 55x faster.
  • Provided insights into optimal training strategies for customized DNA language models.

Conclusions:

  • BarcodeBERT offers a significant advancement in DNA barcode analysis for biodiversity characterization.
  • Self-supervised learning on domain-specific data is effective for building specialized biological language models.
  • The study offers practical guidance for developing custom DNA language models tailored to specific datasets and biological domains.