Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

What is Gene Expression?01:42

What is Gene Expression?

196.7K
Overview
Gene expression is the process in which DNA directs the synthesis of functional products, that is, proteins. Cells can regulate gene expression at various stages. It allows organisms to generate different cell types and enables cells to adapt to internal and external factors.
Genetic Information Flows from DNA to RNA to Protein
A gene is a stretch of DNA that serves as the blueprint for functional RNAs and proteins. Since DNA is made up of nucleotides and proteins consist of amino...
196.7K
What is Gene Expression?01:36

What is Gene Expression?

11.4K
A gene is a stretch of DNA that serves as the blueprint for functional RNAs and proteins. Since DNA is comprised  of nucleotides and proteins are comprised of amino acids, a mediator is required to convert the information encoded in DNA into proteins. This mediator is the messenger RNA (mRNA). mRNA copies the blueprint from DNA by a process called transcription. In eukaryotes, transcription occurs in the nucleus by complementary base-pairing with the DNA template. The mRNA is then...
11.4K
Cell Specific Gene Expression01:58

Cell Specific Gene Expression

16.5K
Multicellular organisms contain a variety of structurally and functionally distinct cell types, but the DNA in all the cells originated from the same parent cells. The differences in the cells can be attributed to the differential gene expression. Liver cells, whose functions include detoxification of blood, production of bile to metabolize fats, and synthesis of proteins essential for metabolism, must express a specific set of genes to perform their functions. Gene expression also varies with...
16.5K
Cell Specific Gene Expression01:58

Cell Specific Gene Expression

5.6K
5.6K
Chromatin Position Affects Gene Expression02:35

Chromatin Position Affects Gene Expression

24.8K
Chromatin is the massive complex of DNA and proteins packaged inside the nucleus. The complexity of chromatin folding and how it is packaged inside the nucleus greatly influences  access to genetic information. Generally, the nucleus' periphery is considered transcriptionally repressive, while the cell's interior is considered a transcriptionally active area. 
Topologically Associated Domains (TADs)
The 3-dimensional positioning of chromatin in the nucleus influences the...
24.8K
mRNA Stability and Gene Expression02:51

mRNA Stability and Gene Expression

6.7K
The structure and stability of mRNA molecules regulates gene expression, as mRNAs are a key step in the pathway from gene to protein. In eukaryotes, the half-life of mRNA varies from a few minutes up to several days. mRNA stability is essential in growth and development. The absence of the proteins regulating its stability, such as tristetraprolin in mice, can cause systemic issues, including bone marrow overgrowth, inflammation, and autoimmunity.
Cis-acting Elements involved in mRNA stability
6.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Evaluation of analysis modes for RNA coexpression in single-cell and bulk tissue.

bioRxiv : the preprint server for biology·2026
Same author

Persistent hindrances to data re-use in single-cell genomics.

Scientific data·2026
Same author

Application of large language models to the annotation of cell lines and mouse strains in genomics data.

bioRxiv : the preprint server for biology·2026
Same author

Translating short-form Python exercises to other programming languages using diverse prompting strategies.

GigaScience·2025
Same author

Functional, Pharmacogenomic, and Immune Landscapes of Long Non-Coding RNAs in Cancer.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2025
Same author

The 'Be Prepared' programme: Enhancing local emergency preparedness through public education in Alberta.

Journal of business continuity & emergency planning·2025
Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026
Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026
Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026
Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026
Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Feb 4, 2026

Using an Automated Cell Counter to Simplify Gene Expression Studies: siRNA Knockdown of IL-4 Dependent Gene Expression in Namalwa Cells
10:34

Using an Automated Cell Counter to Simplify Gene Expression Studies: siRNA Knockdown of IL-4 Dependent Gene Expression in Namalwa Cells

Published on: April 14, 2010

16.0K

Using semantic search to find publicly available gene-expression datasets.

Grace S Brown1, James Wengler1,2, Aaron Joyce S Fabelico1

  • 1Department of Biology, Brigham Young University, Provo, UT, 84602, United States.

Bioinformatics (Oxford, England)
|February 2, 2026
PubMed
Summary
This summary is machine-generated.

Language models can improve the discovery of relevant scientific datasets by summarizing descriptions into embeddings. This approach often outperforms traditional search engines for finding similar molecular data in repositories like Gene Expression Omnibus (GEO).

More Related Videos

Analyzing Tumor Gene Expression Factors with the CorExplorer Web Portal
08:00

Analyzing Tumor Gene Expression Factors with the CorExplorer Web Portal

Published on: October 11, 2019

7.9K
A Semantic Priming Event-related Potential ERP Task to Study Lexico-semantic and Visuo-semantic Processing in Autism Spectrum Disorder
08:17

A Semantic Priming Event-related Potential ERP Task to Study Lexico-semantic and Visuo-semantic Processing in Autism Spectrum Disorder

Published on: April 12, 2018

11.1K

Related Experiment Videos

Last Updated: Feb 4, 2026

Using an Automated Cell Counter to Simplify Gene Expression Studies: siRNA Knockdown of IL-4 Dependent Gene Expression in Namalwa Cells
10:34

Using an Automated Cell Counter to Simplify Gene Expression Studies: siRNA Knockdown of IL-4 Dependent Gene Expression in Namalwa Cells

Published on: April 14, 2010

16.0K
Analyzing Tumor Gene Expression Factors with the CorExplorer Web Portal
08:00

Analyzing Tumor Gene Expression Factors with the CorExplorer Web Portal

Published on: October 11, 2019

7.9K
A Semantic Priming Event-related Potential ERP Task to Study Lexico-semantic and Visuo-semantic Processing in Autism Spectrum Disorder
08:17

A Semantic Priming Event-related Potential ERP Task to Study Lexico-semantic and Visuo-semantic Processing in Autism Spectrum Disorder

Published on: April 12, 2018

11.1K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Data Science

Background:

  • Millions of high-throughput molecular datasets are publicly available, but finding relevant ones is challenging due to description inconsistencies and scale.
  • Discovering relevant datasets is crucial for data reuse and validation, yet current methods in repositories like Gene Expression Omnibus (GEO) are time-consuming and may miss important data.
  • This challenge directly impacts the FAIR data principles, emphasizing the need for improved dataset discovery mechanisms.

Purpose of the Study:

  • To evaluate the effectiveness of language models in enhancing dataset discovery within the Gene Expression Omnibus (GEO) repository.
  • To determine if language model-generated embeddings can identify relevant datasets more efficiently than existing search tools.
  • To explore the potential of natural language processing techniques for improving access to large-scale molecular data.

Main Methods:

  • Utilized 30 different language models to generate numerical representations (embeddings) of dataset descriptions from the Gene Expression Omnibus (GEO).
  • Focused on six human medical conditions, using datasets curated by humans as a reference.
  • Compared the performance of language model-based similarity searches against GEO's native search engine to find related datasets.

Main Results:

  • Language models, particularly those trained on general corpora using contrastive learning with large embeddings, often improved dataset discovery compared to GEO's search.
  • The approach successfully identified additional relevant datasets for specific medical conditions.
  • While effective, the language model approach was not universally superior to existing search methods in all tested scenarios.

Conclusions:

  • Language models show significant potential to enhance the discovery of scientific datasets by providing semantic summaries (embeddings).
  • This methodology can complement existing search tools, offering a more efficient way to find relevant molecular data.
  • Further development and integration of language models could greatly improve data accessibility and reuse in scientific research.