Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.9K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.9K
Genomics02:02

Genomics

36.6K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
36.6K
Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

2.5K
2.5K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

19.0K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
19.0K
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

13.7K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
13.7K
Improving Translational Accuracy02:07

Improving Translational Accuracy

2.7K
2.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Lense: optimizing data preprocessing in single-cell omics using large language models.

Briefings in bioinformatics·2026
Same author

CARES-Net: a channel-attention residual network for multi-disease classification in small-sample <sup>1</sup>H NMR metabolomics data.

Analytica chimica acta·2026
Same author

Lense: Optimizing data preprocessing in single-cell omics using LLMs.

bioRxiv : the preprint server for biology·2026
Same author

Circulating extracellular microRNAs as tissue-specific biomarkers of human health and disease.

Nature communications·2026
Same author

Trajectory-guided dimensionality reduction for multi-sample single-cell RNA-seq data reveals biologically relevant sample-level heterogeneity.

Bioinformatics (Oxford, England)·2026
Same author

Halo: a pretrained model for whole-cell segmentation from nuclei images in spatial transcriptomics.

bioRxiv : the preprint server for biology·2026
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Aug 5, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

646

Benchmarking large language models for genomic knowledge with GeneTuring.

Wenpin Hou1, Xinyi Shang1, Zhicheng Ji2

  • 1Department of Biostatistics, The Mailman School of Public Health, Columbia University, New York City, NY, USA.

Biorxiv : the Preprint Server for Biology
|March 30, 2023
PubMed
Summary
This summary is machine-generated.

Large language models show promise in genomics, but are not fully reliable. GPT-4o performed best on a genomics Q&A database, yet still made errors.

Keywords:
BenchmarkGenomicsKnowledge baseLarge language model

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

402
Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.2K

Related Experiment Videos

Last Updated: Aug 5, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

646
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

402
Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.2K

Area of Science:

  • Genomics
  • Artificial Intelligence
  • Biomedical Research

Background:

  • Large language models (LLMs) show potential in biomedical research.
  • The utility of LLMs as a knowledge base for genomic research is largely unexplored.

Purpose of the Study:

  • To evaluate the performance of leading LLMs in answering genomic research questions.
  • To assess the reliability of LLMs for genomic data inquiry.

Main Methods:

  • Development of GeneTuring, a Q&A database with 1,200 genomics questions.
  • Manual scoring of 25,200 answers generated by six LLMs (including GPT-4o, Claude 3.5, Gemini Advanced).

Main Results:

  • GPT-4o, with web access, demonstrated the highest overall performance.
  • GPT-4o excelled in most genomic question-answering tasks compared to other models.
  • Despite strong performance, GPT-4o did not answer all questions correctly.

Conclusions:

  • LLMs, including advanced models like GPT-4o, are not yet fully reliable for genomic inquiries.
  • Further development is needed to ensure accuracy and completeness of LLM-generated genomic information.
  • LLMs may serve as a supplementary tool but require careful validation in genomic research.