Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

While every living organism has a genome of some kind (be it RNA, or DNA), there is considerable variation in the sizes of these blueprints. One major factor that impacts genome size is whether the organism is prokaryotic or eukaryotic. In prokaryotes, the genome contains little to no non-coding sequence, such that genes are tightly clustered in groups or operons sequentially along the chromosome. Conversely, the genes in eukaryotes are punctuated by long stretches of non-coding sequence.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

iKa/Ks: Estimating the Selection Pressure and Evolutionary Rate of Proteins under the Non-neutral Hypothesis of Synonymous Mutations.

Journal of molecular biology·2026
Same author

Isolation of Rhizoma Paridis saponins as novel entry inhibitors of Crimean-Congo hemorrhagic fever virus.

Cell insight·2026
Same author

Bridging antiviral drug discovery with a large language model-powered framework.

Communications biology·2026
Same author

Differential Immune Cell Distribution Between Arterial and Venous Blood in Healthy Rats.

FASEB journal : official publication of the Federation of American Societies for Experimental Biology·2026
Same author

MNS induces antiviral protection and suppresses inflammation.

Annals of medicine·2026
Same author

Betrixaban activates cGAS-STING to promote antitumor immunity without pathological inflammation.

EMBO molecular medicine·2026
Same journal

High-throughput DNA engineering by mating bacteria.

Cell systems·2026
Same journal

Living bacterial reservoir computers for information processing and sensing.

Cell systems·2026
Same journal

A data-driven modeling framework for mapping genotypes to synthetic microbial community functions.

Cell systems·2026
Same journal

Glycoform engineering of a mammalian platform to sculpt a humanized recombinant bioscavenger.

Cell systems·2026
Same journal

Targeted genomic editing of human gut Bacteroides species based on CRISPR-associated transposases.

Cell systems·2026
Same journal

Scalable enumeration and sampling of minimal metabolic pathways for organisms and communities.

Cell systems·2026
See all related articles

Related Experiment Video

Updated: Jul 3, 2026

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms

Published on: May 9, 2017

BulkFormer: A large-scale foundation model for bulk transcriptomes.

Boming Kang1, Rui Fan1, Meizheng Yi2

  • 1Department of Biomedical Informatics, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd., Beijing 100191, China.

Cell Systems
|July 1, 2026
PubMed
Summary
This summary is machine-generated.

Foundation models are advancing transcriptome analysis. BulkFormer, a new model for bulk RNA sequencing data, outperforms existing methods and reduces training costs.

Keywords:
deep learningfoundation modelshuman bulk transcriptomes

Related Experiment Videos

Last Updated: Jul 3, 2026

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms

Published on: May 9, 2017

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Foundation models are revolutionizing transcriptome analysis.
  • Current models often use sparse single-cell RNA sequencing (scRNA-seq) data, limiting analysis to ~3,000 genes.
  • Bulk RNA sequencing (bulk RNA-seq) profiles more genes (~16,000) and is crucial for clinical and tissue-level studies.

Purpose of the Study:

  • To develop a foundation model specifically for bulk transcriptome analysis.
  • To address the limitations of scRNA-seq based models for bulk data.
  • To create a robust framework for bulk RNA-seq data modeling.

Main Methods:

  • Introduction of BulkFormer, a foundation model with ~150 million parameters.
  • Pretraining on 581,503 human bulk RNA-seq profiles, covering 20,010 protein-coding genes.
  • Utilizing a hybrid encoder combining a graph neural network and a Performer module.

Main Results:

  • BulkFormer demonstrated superior performance across five downstream tasks compared to existing scRNA-seq foundation models.
  • Achieved high performance with substantially lower training costs.
  • Highlighted the importance of pretraining data modality for foundation model efficacy.

Conclusions:

  • BulkFormer provides an effective framework for bulk transcriptome analysis.
  • The study underscores the impact of data modality on foundation model performance.
  • Establishes BulkFormer as a valuable tool for clinical and tissue-level transcriptomic studies.