Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.7K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.7K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

17.5K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
17.5K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

7.0K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
7.0K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

18.8K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
18.8K
RNA-seq03:21

RNA-seq

9.8K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
9.8K
Genetic Variation01:25

Genetic Variation

264
Genetic variation is the diversity in DNA sequences found among individuals of the same species. This diversity is crucial for a species' survival because it helps organisms adapt to environmental changes. Genetic variation begins with fertilization, where an egg and sperm cell merge. Each of these cells carries 23 chromosomes, up to 46 in the fertilized egg. Chromosomes are long DNA strands that contain genes, the basic units of heredity.
Genes exist in different versions called alleles,...
264

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Single-atom cocatalysts engineer proton microenvironments for efficient alkaline hydrogen evolution.

Chemical science·2026
Same author

Adaptive fault-tolerant control of mixed-order vehicles platoon with actuator saturation and disturbances.

Scientific reports·2026
Same author

Occupational health, risk factors, and protection among unmanned aerial vehicle operator in the high-altitude region of China: an observational study.

Frontiers in public health·2026
Same author

HiCMamba: Enhancing Hi-C resolution and identifying 3D genome structures with state space modeling.

PLoS computational biology·2026
Same author

Dynamic hydroxyl cycle of zeolite for short and ultra-short chain PFAS free potable water.

Nature communications·2026
Same author

Development and Validation of Prognostic Models in Patients With Stage IV Thyroid Cancer Undergoing Surgical Treatment.

The American surgeon·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jun 10, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

3.9K

DNASimCLR: a contrastive learning-based deep learning approach for gene sequence data classification.

Minghao Yang1,2, Zehua Wang2, Zizhuo Yan2

  • 1Shandong University, Weihai, People's Republic of China.

BMC Bioinformatics
|October 14, 2024
PubMed
Summary
This summary is machine-generated.

DNASimCLR, an unsupervised deep learning framework, effectively extracts features from microbial gene sequences. This method surpasses current techniques for gene sequence classification, offering a robust solution for genomics.

Keywords:
Biological sequence dataContrastive learningConvolutional neural networksRepresentation learningSequence classificationSimCLR

More Related Videos

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.0K
Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

8.6K

Related Experiment Videos

Last Updated: Jun 10, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

3.9K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.0K
Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

8.6K

Area of Science:

  • Genomics
  • Bioinformatics
  • Machine Learning

Background:

  • Deep neural networks advance microbial sequence data analysis.
  • Labeled microbial data scarcity hinders supervised learning.
  • Unsupervised learning is crucial for complex biological data.

Purpose of the Study:

  • Introduce DNASimCLR, an unsupervised framework for gene sequence feature extraction.
  • Address limitations of supervised learning with limited labeled microbial data.
  • Enhance the analysis of microbial sequence data.

Main Methods:

  • Utilize convolutional neural networks (CNNs) and the SimCLR framework.
  • Employ contrastive learning for feature extraction from diverse microbial gene sequences.
  • Pre-train on large-scale unlabeled metagenome and viral gene datasets.

Main Results:

  • DNASimCLR achieves performance comparable to state-of-the-art methods.
  • Outperforms existing CNN-based feature extraction techniques.
  • Demonstrates robust adaptability and superior performance across various biological sequence analysis tasks.

Conclusions:

  • DNASimCLR provides a robust, database-agnostic solution for gene sequence classification.
  • Effective for novel and unseen gene sequences, valuable for diverse genomics applications.
  • Advances unsupervised learning in microbial genomics.