Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cis-regulatory Sequences02:02

Cis-regulatory Sequences

11.9K
Cis-regulatory sequences are short fragments of non-coding DNA that are present on the same chromosomes as the genes that they regulate. These fragments serve as binding sites for transcriptional regulators, proteins that are responsible for controlling gene transcription and differential gene expression across cell types in eukaryotes. Cis-regulatory sequences can be close to the gene of interest or thousands of bases away in the DNA sequence; however, those sequences that are further away are...
11.9K
Cis-regulatory Sequences02:02

Cis-regulatory Sequences

4.2K
4.2K
Language01:16

Language

921
Language is a unique communication system that uses words and systematic rules to organize and transmit information. Unlike other forms of communication, which may involve postures, movements, odors, or vocalizations, language relies on symbols and grammar. This makes human communication distinct from that of other species, who also communicate but do not use language in the same way humans do.
Corballis and Suddendorf (2007) and Tomasello and Rakoczy (2003) highlight the role of language in...
921
Self Within Cultural Contexts01:30

Self Within Cultural Contexts

243
Cultural frameworks for understanding the self are often categorized into two broad orientations: individualism and collectivism. These paradigms influence how people define themselves, relate to others, and interpret their social worlds. Each orientation offers distinct perspectives on autonomy, responsibility, and the role of the individual within a community.Individualistic CulturesIn individualistic cultures like North America and Western Europe, identity is understood as autonomous and...
243
Impact of Pharmacokinetic–Pharmacodynamic Models: Regulatory Decisions01:15

Impact of Pharmacokinetic–Pharmacodynamic Models: Regulatory Decisions

1
PK–PD modeling has significantly influenced FDA regulatory decisions, particularly drug approval, dosage optimization, and labeling. These models integrate pharmacokinetics (PK) and pharmacodynamics (PD) to predict drug behavior and effects, aiding in optimizing dosing regimens and enhancing the probability of clinical trial success.One notable example is Nesiritide (Natrecor®), a recombinant human brain natriuretic peptide for treating acute decompensated congestive heart failure...
1
Components of Language01:24

Components of Language

830
Language, whether spoken, signed, or written, consists of specific components: lexicon and grammar. The lexicon is the vocabulary of a language, comprising its words. Grammar is the set of rules used to convey meaning through the lexicon. For example, English grammar adds “-ed” to most verbs to indicate past tense. Words are formed by combining phonemes, which are the basic sound units of a language. Different languages have different sets of phonemes (e.g., “ah” vs.
830

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Decoding common and rare noncoding variant effects across cellular and developmental contexts.

Nature genetics·2026
Same author

Multiomics and deep learning dissect regulatory syntax in human development.

Nature·2026
Same author

TGF-β Pathway-Based Polygenic Risk Score Modifies the Association between Red Meat Intake and Colorectal Cancer Risk: Application of a Novel Pathway-Based PRS Method.

Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology·2026
Same author

Vascular smooth muscle cell state trajectories mediate molecular mechanisms of coronary disease risk.

Nature communications·2026
Same author

An expanded registry of candidate cis-regulatory elements.

Nature·2026
Same author

JASPAR 2026: expansion of transcription factor binding profiles and integration of deep learning models.

Nucleic acids research·2025
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Feb 13, 2026

Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes
07:55

Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes

Published on: May 31, 2011

10.7K

Short-Context Regulatory DNA Language Models with Motif-Discovery Regularization.

Aman Patel1, Anshul Kundaje1,2

  • 1Department of Computer Science, School of Engineering, Stanford University.

Biorxiv : the Preprint Server for Biology
|February 12, 2026
PubMed
Summary
This summary is machine-generated.

We developed ARSENAL, a novel DNA language model, to better identify regulatory DNA motifs and predict variant effects. This approach enhances understanding of gene regulation and aids in designing functional DNA sequences.

More Related Videos

Stable DNA Motifs, 1D and 2D Nanostructures Constructed from Small Circular DNA Molecules
09:32

Stable DNA Motifs, 1D and 2D Nanostructures Constructed from Small Circular DNA Molecules

Published on: April 12, 2019

7.1K
Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

16.4K

Related Experiment Videos

Last Updated: Feb 13, 2026

Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes
07:55

Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes

Published on: May 31, 2011

10.7K
Stable DNA Motifs, 1D and 2D Nanostructures Constructed from Small Circular DNA Molecules
09:32

Stable DNA Motifs, 1D and 2D Nanostructures Constructed from Small Circular DNA Molecules

Published on: April 12, 2019

7.1K
Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

16.4K

Area of Science:

  • Genomics
  • Computational Biology
  • Bioinformatics

Background:

  • Self-supervised DNA language models (DNALMs) trained on whole genomes struggle with sparse, heterogeneous regulatory sequences and short motifs.
  • Existing annotation-agnostic DNALMs often underperform simpler models on regulatory tasks due to difficulties in learning regulatory syntax.

Purpose of the Study:

  • Introduce ARSENAL, a short-context masked DNA language model optimized for regulatory sequence analysis.
  • Improve the discovery of transcription factor motifs and prediction of regulatory variant effects.
  • Enhance supervised learning tasks like chromatin accessibility prediction and regulatory variant scoring.

Main Methods:

  • Trained ARSENAL on a functionally enriched regulatory corpus with a novel regularizer promoting motif discovery.
  • Evaluated ARSENAL's performance in zero-shot motif recovery and regulatory variant effect prediction.
  • Integrated ARSENAL embeddings into supervised models for chromatin accessibility prediction and variant scoring.

Main Results:

  • ARSENAL demonstrated superior recovery of transcription factor motifs *de novo* and improved prediction of regulatory variant effects compared to other DNALMs.
  • Incorporating ARSENAL embeddings significantly boosted supervised chromatin accessibility prediction accuracy across multiple cell types.
  • ARSENAL embeddings led to enhanced regulatory variant scoring and enabled targeted regulatory sequence design.

Conclusions:

  • ARSENAL effectively addresses limitations of large-scale DNALMs in capturing regulatory sequence features.
  • The model provides a powerful tool for motif discovery, variant effect prediction, and functional genomics analysis.
  • ARSENAL facilitates the design of regulatory sequences with specific functional properties.