Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

18.8K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
18.8K
Synthetic Biology02:55

Synthetic Biology

4.7K
Synthetic biology is an interdisciplinary science that involves using principles from disciplines such as engineering, molecular biology, cell biology, and systems biology. It involves remodeling existing organisms from nature or constructing completely new synthetic organisms for applications such as protein or enzyme production, bioremediation, value-added macromolecule production, and the addition of desirable traits to crops, to name a few.
Golden rice
Golden rice is a genetically modified...
4.7K
Genetic Lingo01:11

Genetic Lingo

102.7K
Overview
102.7K
Protein Networks02:26

Protein Networks

2.3K
2.3K
Ribosome Profiling02:24

Ribosome Profiling

3.5K
Ribosome profiling or ribo-sequencing is a deep sequencing technique that produces a snapshot of active translation in a cell. It selectively sequences the mRNAs protected by ribosomes to get an insight into a cell’s translation landscape at any given point in time.
Applications of ribosome profiling
Ribosome profiling has many applications, including in vivo monitoring of translation inside a particular organ or tissue type and quantifying new protein synthesis levels.
The technique...
3.5K
Proteomics01:33

Proteomics

7.3K
A proteome is the entire set of proteins that a cell type produces. We can study proteomes using the knowledge of genomes because genes code for mRNAs, and the mRNAs encode proteins. Although mRNA analysis is a step in the right direction, not all mRNAs are translated into proteins.
Proteomics is the study of proteomes' function. It involves the large-scale systematic study of the proteome to denote the protein complement expressed by a genome. Scientist Mark Wilkins coined the term...
7.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Author Correction: Community benchmarking and evaluation of human unannotated microprotein detection by mass spectrometry based proteomics.

Nature communications·2026
Same author

Expanding the human proteome with microproteins and peptideins.

Nature·2026
Same author

An AI-Ready Phosphorylation Meta-Analysis for <i>Saccharomyces cerevisiae</i>.

Journal of proteome research·2026
Same author

Structure-centric searching enables global mapping of the public metabolome.

Nature biotechnology·2026
Same author

An expanded reference catalog of translated open reading frames for biomedical research.

Nucleic acids research·2026
Same author

The 2025 Westlake Autumn Symposium for Al Proteomics and Virtual Cell.

Genomics, proteomics & bioinformatics·2026
Same journal

Light-Induced Proteomic Changes in Pseudomonas aeruginosa Biofilms.

Proteomics·2026
Same journal

Decade-Resolved Proteomic Profiling of Gastric Cancer FFPE Archives: Evaluating Storage-Associated Shifts and Signal Stability Over 50 Years.

Proteomics·2026
Same journal

Proteome-Scale Mining of Metal-Associated Proteins of Monkeypox Virus.

Proteomics·2026
Same journal

Optimized Sample Handling Minimizes Peptide Adsorption to Plastics to Enable High Sensitivity Evosep Based Chemical Proteomics.

Proteomics·2026
Same journal

Toward Predicting Pandemic Potential: A Comparative Analysis of Virus-Host Interactions Between Diverse Influenza A Viruses and the Human Innate Immune System.

Proteomics·2026
Same journal

Functional Divergence of Mucus in Pacific Oyster (Crassostrea gigas): Insights From Integrated Proteomic and Rheological Study.

Proteomics·2026
See all related articles

Related Experiment Video

Updated: Jun 29, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

554

Open-source large language models in action: A bioinformatics chatbot for PRIDE database.

Jingwen Bai1, Selvakumar Kamatchinathan1, Deepti J Kundu1

  • 1European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.

Proteomics
|March 31, 2024
PubMed
Summary
This summary is machine-generated.

A new chatbot infrastructure enhances user interaction with the PRIDE database, using Large Language Models (LLMs) for documentation search and dataset discovery. This open-source tool improves accessibility and discoverability in bioinformatics resources.

Keywords:
bioinformaticsdataset discoverabilitydocumentationlarge language modelsproteomicspublic datasoftware architecturestraining

More Related Videos

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes
09:10

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes

Published on: May 22, 2018

9.2K
Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

9.7K

Related Experiment Videos

Last Updated: Jun 29, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

554
A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes
09:10

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes

Published on: May 22, 2018

9.2K
Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

9.7K

Area of Science:

  • Bioinformatics
  • Proteomics
  • Computational Biology

Background:

  • User interaction with large biological databases like PRIDE can be complex.
  • Efficiently searching documentation and datasets is crucial for researchers.

Purpose of the Study:

  • To develop a chatbot assistant infrastructure to simplify user interactions with the PRIDE database.
  • To enhance the discoverability of PRIDE datasets and improve documentation accessibility.

Main Methods:

  • Utilized multiple Large Language Models (LLMs) including llama2, chatglm, mixtral, and openhermes.
  • Implemented a web service API, web interface, and vector database for indexing and management.
  • Integrated an Elo-ranking system for LLM performance benchmarking and documentation improvement.

Main Results:

  • The chatbot facilitates natural language interaction with PRIDE documentation.
  • An LLM-based recommendation system enables effective PRIDE dataset searching and discoverability.
  • The framework demonstrated modularity and adaptability for broader applications.

Conclusions:

  • The developed chatbot infrastructure offers a robust and transferable solution for enhancing user experience in bioinformatics.
  • The integration of LLMs, vector databases, and benchmarking improves data accessibility and resource utilization.
  • The open-source nature of the framework promotes wider adoption and development in related scientific domains.