Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.5K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.5K
Mitochondrial Precursor Proteins01:39

Mitochondrial Precursor Proteins

3.1K
Mitochondrial precursors are partially unfolded or loosely folded polypeptide chains. Newly synthesized precursors are inhibited from spontaneously folding into their native conformation by the cytosolic chaperones, heat shock proteins 70 (Hsp70), and mitochondrial import stimulation factors (MSFs). Precursors bound to MSFs are guided to the TOM70-TOM37 receptors, while precursors bound to Hsp70  chaperones are targetted to TOM20-TOM22 receptor complexes.
Most of the mitochondrial...
3.1K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.4K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.4K
Per-Unit Sequence Models01:26

Per-Unit Sequence Models

185
An ideal Y-Y transformer, grounded through neutral impedances, displays per-unit sequence networks akin to those of a single-phase ideal transformer when subjected to balanced positive- or negative-sequence currents. These currents do not produce neutral currents, and their associated voltage drops.
Zero-sequence currents, which are identical in magnitude and phase, generate a neutral current, resulting in voltage drops across the neutral impedance and the low-voltage winding. If the...
185

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

MGAPep: LLM-Augmented Multimodal Graph Attention for Protein-Peptide Binding Site Prediction and Cross-Domain Transfer.

IEEE journal of biomedical and health informatics·2026
Same author

ReMol: A Chemical Reaction Knowledge-guided Self-supervised Molecular Image Representation Learning Framework.

IEEE journal of biomedical and health informatics·2026
Same author

Learnable dendrite neural P systems and applications in survival prediction of glioblastoma patients.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

A systematic review of molecular representation learning foundation models.

Briefings in bioinformatics·2026
Same author

MaskMol: knowledge-guided molecular image pre-training framework for activity cliffs with pixel masking.

BMC biology·2025
Same author

SSR-DTA: Substructure-aware multi-layer graph neural networks for drug-target binding affinity prediction.

Artificial intelligence in medicine·2024
Same journal

Multi-view knowledge-guided flow subgraphs with substructure initialization for explainable DDI prediction.

Briefings in functional genomics·2026
Same journal

Genetically supported mediators linking peripheral metabolism to cerebral ischemia: a multi-omics characterization of HMGCR, TLR4, and MMP9 in angina pectoris and stroke.

Briefings in functional genomics·2026
Same journal

Language model-based self-training reduces labeled data requirements by 99% for biological sequence classification.

Briefings in functional genomics·2026
Same journal

Whole-transcriptome sequencing reveals hypoxic esophageal squamous cell carcinoma-derived migrasomes driving cancer-associated fibroblast activation.

Briefings in functional genomics·2026
Same journal

An integrative meta-analysis of SARS-CoV-2 RNA-protein interactomes identifies conserved host factors shared with other RNA viruses.

Briefings in functional genomics·2026
Same journal

Retraction and replacement of: An integrated complete-genome sequencing and systems biology approach to predict antimicrobial resistance genes in the virulent bacterial strains of Moraxella catarrhalis.

Briefings in functional genomics·2026
See all related articles

Related Experiment Video

Updated: Nov 4, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.2K

Pretraining model for biological sequence data.

Bosheng Song, Zimeng Li, Xuan Lin

    Briefings in Functional Genomics
    |May 29, 2021
    PubMed
    Summary
    This summary is machine-generated.

    This survey reviews pretraining models for biological sequence data, crucial for disease detection and drug discovery. It covers models like CNN, word2vec, LSTM, and Transformer, and discusses future directions.

    Keywords:
    biological sequencedeep learningpretraining model

    More Related Videos

    DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling
    08:04

    DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling

    Published on: October 8, 2019

    8.8K
    A Practical Guide to Phylogenetics for Nonexperts
    12:00

    A Practical Guide to Phylogenetics for Nonexperts

    Published on: February 5, 2014

    35.6K

    Related Experiment Videos

    Last Updated: Nov 4, 2025

    A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
    09:34

    A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

    Published on: September 25, 2021

    4.2K
    DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling
    08:04

    DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling

    Published on: October 8, 2019

    8.8K
    A Practical Guide to Phylogenetics for Nonexperts
    12:00

    A Practical Guide to Phylogenetics for Nonexperts

    Published on: February 5, 2014

    35.6K

    Area of Science:

    • Bioinformatics and Computational Biology
    • Genomics and Proteomics
    • Machine Learning in Life Sciences

    Background:

    • High-throughput sequencing generates vast biological sequence data, vital for understanding life and disease.
    • The COVID-19 pandemic highlighted the importance of biological sequence data for diagnostics and therapeutics.
    • Pretraining models from natural language processing offer efficient feature extraction and improved performance for biological sequence analysis.

    Purpose of the Study:

    • To provide a comprehensive review of pretraining models for biological sequence data.
    • To introduce biological sequences, datasets, and categorize existing pretraining models.
    • To discuss applications, novel methods, and future challenges in the field.

    Main Methods:

    • Systematic review and categorization of pretraining models (CNN, word2vec, LSTM, Transformer).
    • Introduction of biological sequences, datasets, and their accessibility.
    • Presentation of applications and a novel pretraining scheme for protein sequences.

    Main Results:

    • Biological sequence data is increasingly accessible, driving advancements in disease detection and drug discovery.
    • Pretraining models effectively embed and extract features from large biological sequence corpora.
    • Various pretraining models, including CNN, word2vec, LSTM, and Transformer, are summarized and categorized.

    Conclusions:

    • Pretraining models are essential for advancing biological sequence data analysis and its applications.
    • Novel pretraining schemes and benchmarks are emerging for specific biological sequences like proteins.
    • Addressing current challenges and exploring future directions will further enhance the utility of pretraining models.