Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Evolutionary Relationships through Genome Comparisons

Evolutionary Relationships through Genome Comparisons

Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...

Genomics

Genomics

Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...

Genome Size and the Evolution of New Genes

Genome Size and the Evolution of New Genes

Genome Annotation and Assembly

Genome Annotation and Assembly

The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.

Genome-wide Association Studies-GWAS

Genome-wide Association Studies-GWAS

Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...

Improving Translational Accuracy

Improving Translational Accuracy

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Lense: optimizing data preprocessing in single-cell omics using large language models.

Briefings in bioinformatics·2026

Same author

CARES-Net: a channel-attention residual network for multi-disease classification in small-sample <sup>1</sup>H NMR metabolomics data.

Analytica chimica acta·2026

Same author

Lense: Optimizing data preprocessing in single-cell omics using LLMs.

bioRxiv : the preprint server for biology·2026

Same author

Circulating extracellular microRNAs as tissue-specific biomarkers of human health and disease.

Nature communications·2026

Same author

Trajectory-guided dimensionality reduction for multi-sample single-cell RNA-seq data reveals biologically relevant sample-level heterogeneity.

Bioinformatics (Oxford, England)·2026

Same author

Halo: a pretrained model for whole-cell segmentation from nuclei images in spatial transcriptomics.

bioRxiv : the preprint server for biology·2026

Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026

Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026

Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026

Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026

Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026

Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 5, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Benchmarking large language models for genomic knowledge with GeneTuring.

Wenpin Hou¹, Xinyi Shang¹, Zhicheng Ji²

¹Department of Biostatistics, The Mailman School of Public Health, Columbia University, New York City, NY, USA.

Biorxiv : the Preprint Server for Biology

|March 30, 2023

Summary

This summary is machine-generated.

Large language models show promise in genomics, but are not fully reliable. GPT-4o performed best on a genomics Q&A database, yet still made errors.

Keywords:

Benchmark Genomics Knowledge base Large language model

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

Related Experiment Videos

Last Updated: Aug 5, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

Area of Science:

Genomics
Artificial Intelligence
Biomedical Research

Background:

Large language models (LLMs) show potential in biomedical research.
The utility of LLMs as a knowledge base for genomic research is largely unexplored.

Purpose of the Study:

To evaluate the performance of leading LLMs in answering genomic research questions.
To assess the reliability of LLMs for genomic data inquiry.

Main Methods:

Development of GeneTuring, a Q&A database with 1,200 genomics questions.
Manual scoring of 25,200 answers generated by six LLMs (including GPT-4o, Claude 3.5, Gemini Advanced).

Main Results:

GPT-4o, with web access, demonstrated the highest overall performance.
GPT-4o excelled in most genomic question-answering tasks compared to other models.
Despite strong performance, GPT-4o did not answer all questions correctly.

Conclusions:

LLMs, including advanced models like GPT-4o, are not yet fully reliable for genomic inquiries.
Further development is needed to ensure accuracy and completeness of LLM-generated genomic information.
LLMs may serve as a supplementary tool but require careful validation in genomic research.