Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genomics02:02

Genomics

39.6K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
39.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The CsPHL11-CsPAL2 module mediates chitooligosaccharide-induced cold tolerance in cucumber seedlings.

Frontiers in plant science·2026
Same author

A Material-Process-Equipment Integrated Design Method for Accelerating the Process Development of Twin-Screw Wet Granulation.

Pharmaceuticals (Basel, Switzerland)·2026
Same author

FGF13 Deficiency Ameliorates Paclitaxel-Induced Neuropathic Pain by Inhibiting VASH1-Mediated Microtubule Detyrosination to Promote Mitophagy.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026
Same author

A Comprehensive Instrumental Analysis Framework for Assessing the Dissolvability and Taste Properties of Plant Extract Instant Granules.

Foods (Basel, Switzerland)·2026
Same author

Programmable antibody-based chimeric entry receptors for sarbecoviruses.

Communications biology·2026
Same author

Cross-clade vaccination to overcome sarbecovirus or merbecovirus neutralization gaps.

Cell reports·2026
Same journal

STED: flexible cross-modal topic modeling infers cell-type-specific regulatory landscapes from bulk epigenomics.

Briefings in bioinformatics·2026
Same journal

A knowledge-guided deep learning framework for quantitative nucleic acid testing.

Briefings in bioinformatics·2026
Same journal

Optimal transport for label transfer in single-cell multi-omics integration.

Briefings in bioinformatics·2026
Same journal

Continuous multi-omics pathway enrichment analysis resolves hidden functional heterogeneity.

Briefings in bioinformatics·2026
Same journal

Evaluating completeness, coherence, and consistency of genome-scale function annotations.

Briefings in bioinformatics·2026
Same journal

Transformers for single-cell RNA sequencing: a survey.

Briefings in bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jan 11, 2026

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq
07:09

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

Published on: May 28, 2021

10.4K

BioWorkflow: Retrieving comprehensive bioinformatics workflows from publications.

Yidan Wang1,2, Jiayin Wang1,2,3

  • 1School of Computer Science and Technology, Faculty of Electronics and Information Engineering, Xi'an Jiaotong University, No. 28 Xianning West Road, Beilin District, Xi'an, Shaanxi 710049, China.

Briefings in Bioinformatics
|November 8, 2025
PubMed
Summary
This summary is machine-generated.

BioWorkflow, a new framework using large language models (LLMs), automates bioinformatics workflow reconstruction from scientific papers. This enhances reproducibility and accuracy, significantly reducing manual curation time.

Keywords:
bioinformaticslarge language modelsmultimodalretrieval-augmented generationworkflow extraction

More Related Videos

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms

Published on: May 9, 2017

9.6K
Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance
04:58

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance

Published on: December 13, 2024

3.9K

Related Experiment Videos

Last Updated: Jan 11, 2026

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq
07:09

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

Published on: May 28, 2021

10.4K
Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms

Published on: May 9, 2017

9.6K
Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance
04:58

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance

Published on: December 13, 2024

3.9K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Scientific Literature Mining

Background:

  • Reconstructing bioinformatics workflows from publications is crucial for scientific analysis but is hindered by dispersed information across various document formats.
  • Manual reconstruction is time-consuming and prone to errors, while existing systems lack the full-text, multimodal capabilities for complete workflow extraction.

Purpose of the Study:

  • To introduce BioWorkflow, a novel large language model (LLM)-based framework designed for automated, end-to-end reconstruction of bioinformatics workflows from scientific literature.
  • To improve the accuracy, completeness, and reproducibility of bioinformatics pipeline extraction and reuse.

Main Methods:

  • BioWorkflow employs a retrieval-augmented framework that parses PDFs to create a unified index of text, tables, and figures.
  • It uses hierarchical query decomposition with dynamic reformulation and iterative, context-aware retrieval to assemble directed workflows.
  • The framework links predicted elements to cited evidence and performs automated consistency checks to ensure traceability and minimize hallucinations.

Main Results:

  • BioWorkflow recovers approximately 80% of workflow steps, a significant improvement over existing tools (~20%).
  • It demonstrates a >20% improvement in reproducibility, completeness, and accuracy compared to strong LLM baselines.
  • Curation time is reduced to 3-5 minutes per paper, enabling faster reuse of published pipelines.

Conclusions:

  • BioWorkflow effectively automates the extraction of detailed bioinformatics workflows from scientific publications.
  • The framework enhances the reliability and efficiency of reusing published computational pipelines, fostering scientific reproducibility.