Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

20.5K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
20.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Apollo 3: Multi-Species Genome Curation.

bioRxiv : the preprint server for biology·2026
Same author

The Potential of Digital Twins for Pediatric Rare Diseases.

CPT: pharmacometrics & systems pharmacology·2026
Same author

From FAIR to CURE: guidelines for computational models of biological systems.

NPJ systems biology and applications·2026
Same author

The 2025 Westlake Autumn Symposium for Al Proteomics and Virtual Cell.

Genomics, proteomics & bioinformatics·2026
Same author

React-to-Me: A Conversational Interface for Interactive Exploration of the Reactome Pathway Knowledgebase.

Research square·2026
Same author

Proteins in the Genome Browser: Integration of Phylogenies, Alignments, and Structures With Nucleotide-level Evidence in JBrowse 2.

Journal of molecular biology·2026
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Jan 13, 2026

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics
08:09

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics

Published on: June 17, 2012

20.5K

Application of Large Language Models for Annotating Genes into Reactome Pathways.

Guanming Wu1, Lisa Matthews2, Nathan Boyer3

  • 1Division of Oncological Sciences, Knight Cancer Institute, Oregon Health and Science University, Portland, OR 97239, United States.

Biorxiv : the Preprint Server for Biology
|January 7, 2026
PubMed
Summary
This summary is machine-generated.

Large language models (LLMs) assist in curating Reactome, a biological pathway knowledgebase, by predicting gene functions and extracting literature evidence. This AI-driven approach enhances efficiency and supports manual curation efforts.

More Related Videos

A Web Tool for Generating High Quality Machine-readable Biological Pathways
08:01

A Web Tool for Generating High Quality Machine-readable Biological Pathways

Published on: February 8, 2017

18.4K
A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

2.1K

Related Experiment Videos

Last Updated: Jan 13, 2026

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics
08:09

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics

Published on: June 17, 2012

20.5K
A Web Tool for Generating High Quality Machine-readable Biological Pathways
08:01

A Web Tool for Generating High Quality Machine-readable Biological Pathways

Published on: February 8, 2017

18.4K
A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

2.1K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Artificial Intelligence in Life Sciences

Background:

  • Reactome is a comprehensive, manually curated biological pathway knowledgebase.
  • Manual curation is labor-intensive and struggles to keep pace with biomedical literature growth.
  • Large Language Models (LLMs) and Artificial Intelligence (AI) offer potential solutions for bioinformatics resource development.

Purpose of the Study:

  • To explore the adoption of LLM/AI technologies for Reactome manual curation.
  • To develop and validate an LLM workflow to assist curators in gene annotation and pathway refinement.
  • To assess the utility of AI-generated summaries and extracted relationships for improving Reactome content.

Main Methods:

  • Developed an LLM workflow to predict gene-pathway associations and identify supporting literature.
  • Generated text summaries of potential molecular mechanisms and extracted functional relationships from full-text papers.
  • Validated workflow outputs using semantic similarity comparisons with existing Reactome annotations and manual curator evaluation.
  • Enhanced the workflow with protein-protein interaction data for reaction-based annotation.

Main Results:

  • The LLM workflow demonstrated significant enrichment of high-similarity matches between generated summaries and Reactome annotations.
  • Manual evaluation indicated that over half of the workflow's outputs were useful for supporting curation tasks.
  • An enhanced workflow incorporating protein-protein interaction data improved reaction-based annotation.

Conclusions:

  • Initial adoption of LLM/AI technologies shows encouraging results for Reactome curation.
  • The developed workflow provides a practical framework for integrating AI-assisted methods into the Reactome curation pipeline.
  • The strategies employed may be broadly applicable to other community knowledgebases.