Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Next-generation Sequencing03:00

Next-generation Sequencing

88.9K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
88.9K
RNA-seq03:21

RNA-seq

10.0K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
10.0K
Synthetic Biology02:55

Synthetic Biology

4.7K
Synthetic biology is an interdisciplinary science that involves using principles from disciplines such as engineering, molecular biology, cell biology, and systems biology. It involves remodeling existing organisms from nature or constructing completely new synthetic organisms for applications such as protein or enzyme production, bioremediation, value-added macromolecule production, and the addition of desirable traits to crops, to name a few.
Golden rice
Golden rice is a genetically modified...
4.7K
Sanger Sequencing01:57

Sanger Sequencing

754.4K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
754.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

HELLO: improved neural network architectures and methodologies for small variant calling.

BMC bioinformatics·2021
Same author

Multimode smartphone biosensing: the transmission, reflection, and intensity spectral (TRI)-analyzer.

Lab on a chip·2017
Same author

Characterization of drug authenticity using thin-layer chromatography imaging with a mobile phone.

Journal of pharmaceutical and biomedical analysis·2016
Same journal

DeepMethylation: A deep learning framework for tissue-specific DNA methylation prediction and functional variant annotation.

PLoS computational biology·2026
Same journal

Redefining and estimating the early-phase reproduction ratio for epidemic outbreaks in spatially structured populations.

PLoS computational biology·2026
Same journal

Optimized phenotype definitions boost GWAS power.

PLoS computational biology·2026
Same journal

Detection, communication, and individual identification with deep audio embeddings: A case study with North Atlantic right whales.

PLoS computational biology·2026
Same journal

Exploring the structural lexicon of the Proteome via Metric Geometry.

PLoS computational biology·2026
Same journal

Linking retinal sampling in neural encoding models to temporal profiles of visual processing in humans.

PLoS computational biology·2026
See all related articles

Related Experiment Video

Updated: Jul 5, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.0K

PandoGen: Generating complete instances of future SARS-CoV-2 sequences using Deep Learning.

Anand Ramachandran1, Steven S Lumetta1, Deming Chen1

  • 1University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America.

Plos Computational Biology
|January 19, 2024
PubMed
Summary
This summary is machine-generated.

Forecasting future viral protein sequences is crucial for pandemic preparedness. A new method, PandoGen, accurately predicts novel variants, outperforming larger models in identifying high-spread sequences.

More Related Videos

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning
04:17

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

Published on: May 10, 2024

774
Unbiased Deep Sequencing of RNA Viruses from Clinical Samples
09:36

Unbiased Deep Sequencing of RNA Viruses from Clinical Samples

Published on: July 2, 2016

17.0K

Related Experiment Videos

Last Updated: Jul 5, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.0K
DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning
04:17

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

Published on: May 10, 2024

774
Unbiased Deep Sequencing of RNA Viruses from Clinical Samples
09:36

Unbiased Deep Sequencing of RNA Viruses from Clinical Samples

Published on: July 2, 2016

17.0K

Area of Science:

  • Computational Biology
  • Virology
  • Machine Learning

Background:

  • Viral pandemics present challenges due to novel variants with altered characteristics.
  • Forecasting future viral sequences enables proactive preparation against emerging threats.

Purpose of the Study:

  • To develop a method for generating future viral protein sequences using protein language models.
  • To enhance pandemic preparedness by predicting undiscovered viral strains.

Main Methods:

  • Developed PandoGen, a novel method for training protein language models for pandemic forecasting.
  • PandoGen integrates synthetic data generation, conditional sequence generation, and reward-based learning.
  • Applied PandoGen to model SARS-CoV-2 Spike protein sequences.

Main Results:

  • PandoGen forecasted twice as many novel sequences with five times the case counts compared to a significantly larger model.
  • The method successfully predicted unseen lineages months in advance.
  • PandoGen accurately forecasted sequences of Variants of Concern within tight sequence budgets.

Conclusions:

  • PandoGen offers a powerful approach for forecasting future viral protein sequences.
  • This method significantly improves pandemic preparedness by enabling early identification of high-risk variants.
  • PandoGen demonstrates superior performance over larger models in predicting novel, high-spread viral sequences.