Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.7K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.7K
DNA as a Genetic Template02:05

DNA as a Genetic Template

21.7K
Two structural features of the DNA molecule provide a basis for the mechanisms of heredity: the four nucleotide bases and its double-stranded nature. The Watson-Crick model of double-helical DNA structure, proposed in 1952, drew heavily upon the X-ray crystallography work of researchers Rosalind Franklin and Maurice Wilkins. Watson, Crick, and Wilkins jointly received the Nobel Prize in Physiology or Medicine for their work in 1962. Franklin was, controversially, excluded from the prize for...
21.7K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

3.9K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
3.9K
Nucleic Acid Structure01:25

Nucleic Acid Structure

5.9K
The pentose sugar in DNA is deoxyribose, while in RNA the pentose sugar is ribose. The difference between the sugars is the presence of the hydroxyl group on the ribose's second carbon and a hydrogen on the deoxyribose's second carbon. The phosphate residue attaches to the hydroxyl group of the 5′ carbon of one sugar and the hydroxyl group of the 3′ carbon of the sugar of the next nucleotide, which forms  a 5′ to 3′ phosphodiester linkage.
DNA Structure
DNA...
5.9K
DNA Microarrays02:34

DNA Microarrays

17.2K
Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
17.2K
The Nucleosome01:19

The Nucleosome

1.3K
Human DNA is almost two meters long. However, it is compressed inside a tiny nucleus measuring only a few microns in diameter. To make this degree of compaction possible, DNA is organized into several sequential levels so that it can fit into such a tiny space. The most compact form of DNA is a chromosome that can be seen under a microscope in a dividing cell.
In a chromosome, DNA is wound twice around a protein complex called a histone octamer core, which consists of 8 histone proteins. This...
1.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

An angular-resolved multi-channel Thomson parabola spectrometer for laser-driven ion measurement.

The Review of scientific instruments·2018
Same author

Framework of Cytochrome/Vitamin B<sub>2</sub> Linker/Graphene for Robust Microbial Electricity Generation.

ACS applied materials & interfaces·2018
Same author

Symbolic time series analysis of fNIRS signals in brain development assessment.

Journal of neural engineering·2018
Same author

Nickel(0)-Catalyzed Hydroalkylation of 1,3-Dienes with Simple Ketones.

Journal of the American Chemical Society·2018
Same author

A morphological classification for vocal fold leukoplakia.

Brazilian journal of otorhinolaryngology·2018
Same author

NCycDB: a curated integrative database for fast and accurate metagenomic profiling of nitrogen cycling genes.

Bioinformatics (Oxford, England)·2018
Same journal

TraNce: Type-aware hypergraph neural network with biological mediators for drug repositioning.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Decentralized ADMM for factorization-based Low-rank matrix estimation.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Memristive neuromorphic circuit design inspired by the neural mechanisms of conditioned fear.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Q-learning based asynchronous Boolean control networks stabilization with data loss.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

New results on prescribed-time synchronization of complex networks via intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Variance-constrained multi-view ensemble broad network for imbalanced data.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Jun 4, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

3.9K

A sparse and wide neural network model for DNA sequences.

Tong Yu1, Lei Cheng1, Ruslan Khalitov1

  • 1Norwegian University of Science and Technology, Trondheim, Norway.

Neural Networks : the Official Journal of the International Neural Network Society
|December 22, 2024
PubMed
Summary
This summary is machine-generated.

SwanDNA, a novel neural network, accurately models long DNA sequences using a sparse, wide architecture and self-supervised learning. This approach requires fewer labels for precise predictions in genomic tasks.

Keywords:
DNA sequence modelingNeural networkSelf-supervised pretraining

More Related Videos

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning
04:17

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

Published on: May 10, 2024

667
Analyzing and Building Nucleic Acid Structures with 3DNA
16:24

Analyzing and Building Nucleic Acid Structures with 3DNA

Published on: April 26, 2013

20.5K

Related Experiment Videos

Last Updated: Jun 4, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

3.9K
DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning
04:17

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

Published on: May 10, 2024

667
Analyzing and Building Nucleic Acid Structures with 3DNA
16:24

Analyzing and Building Nucleic Acid Structures with 3DNA

Published on: April 26, 2013

20.5K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Accurate DNA sequence modeling is crucial for understanding biological functions.
  • Existing deep neural networks struggle with long-range interactions and require extensive labeled data.

Purpose of the Study:

  • To develop a novel neural network model, SwanDNA, capable of modeling long DNA sequences.
  • To overcome the limitations of existing models in capturing distant semantic relationships and reducing reliance on supervised labels.

Main Methods:

  • Utilized a sparse and wide network architecture for efficient processing of long DNA sequences.
  • Integrated a self-supervised learning framework to minimize the need for supervised labels.

Main Results:

  • SwanDNA demonstrated superior performance in predicting human variant effects and detecting open chromatin regions in plant genes.
  • Achieved state-of-the-art results on seven out of eight datasets in the GenomicBenchmarks.
  • Successfully enabled inferences over very long DNA sequences.

Conclusions:

  • SwanDNA offers a powerful and efficient solution for DNA sequence modeling, particularly for long sequences.
  • The self-supervised approach significantly reduces the practical cost associated with obtaining labeled data.
  • The model's performance across diverse tasks highlights its versatility and effectiveness in genomic applications.