Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

17.1K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
17.1K
RNA-seq03:21

RNA-seq

10.8K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
10.8K
Size and Structure of Viral Genomes01:26

Size and Structure of Viral Genomes

310
Viral genomes exhibit remarkable diversity in size, structure, and composition, influencing their replication strategies and interactions with host cells. These genomes consist of either DNA or RNA and may be linear or circular. Additionally, they can be single-stranded or double-stranded, with each configuration affecting how the virus propagates within a host. RNA viruses, for instance, generally have smaller genomes than DNA viruses, a factor that contributes to their high mutation rates and...
310

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Mapping the AI life sciences landscape in Greece: a bibliometric comparison with global patterns.

Scientific reports·2026
Same author

The deubiquitinase activity of CYLD is required for B cell differentiation.

Cell death & disease·2026
Same author

Open and sustainable AI: challenges, opportunities and the road ahead in the life sciences.

Nature methods·2026
Same author

Longitudinal profiling of antigen receptor gene repertoire dynamics in kidney transplant recipients after multiple SARS-CoV-2 vaccinations.

ImmunoHorizons·2026
Same author

An ELIXIR scoping review on domain-specific evaluation metrics for synthetic data in life sciences.

NAR genomics and bioinformatics·2026
Same author

Molecular and functional characterization of the B-cell receptor in chronic lymphocytic leukemia-like monoclonal B-cell lymphocytosis.

Leukemia·2026
Same journal

Potential role of the <i>Trpv4 c.1491+1G>A</i> mutation in pulmonary fibrosis in a gene-edited mouse model.

Frontiers in genetics·2026
Same journal

Utilization of whole exome sequencing to identify hereditary mutations in Palestinian families with hereditary cancers.

Frontiers in genetics·2026
Same journal

Research of N-acetyl-L-cysteine on CD40-CD40L pathway in pulmonary fibrosis induced by silicon dioxide.

Frontiers in genetics·2026
Same journal

Novel variants in LSS related hypotrichosis simplex 14.

Frontiers in genetics·2026
Same journal

Network-based analysis identifies shared mechanisms between ischemic stroke and myocardial infarction and therapeutic ingredients of Buyang Huanwu Decoction.

Frontiers in genetics·2026
Same journal

GWAS analysis of a depression cohort defined by an EHR-phenotyping algorithm reveals the role of immune regulations in depression risk.

Frontiers in genetics·2026
See all related articles

Related Experiment Video

Updated: Nov 2, 2025

In Vitro Selection of Aptamers to Differentiate Infectious from Non-Infectious Viruses
12:23

In Vitro Selection of Aptamers to Differentiate Infectious from Non-Infectious Viruses

Published on: September 7, 2022

1.9K

A Computational Framework for Pattern Detection on Unaligned Sequences: An Application on SARS-CoV-2 Data.

Nikolaos Pechlivanis1,2, Anastasios Togkousidis1, Maria Tsagiopoulou1

  • 1Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece.

Frontiers in Genetics
|June 14, 2021
PubMed
Summary
This summary is machine-generated.

This study introduces a computational framework using unsupervised learning to identify characteristic k-mers in genome sequences. This approach aids in grouping genetic data and studying evolutionary signals without traditional alignment.

Keywords:
SARS-CoV-2feature selectionk-mersphylogeneticsunsupervised learning

More Related Videos

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example
05:45

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Published on: March 11, 2020

9.1K
Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells
10:34

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Published on: December 9, 2022

4.7K

Related Experiment Videos

Last Updated: Nov 2, 2025

In Vitro Selection of Aptamers to Differentiate Infectious from Non-Infectious Viruses
12:23

In Vitro Selection of Aptamers to Differentiate Infectious from Non-Infectious Viruses

Published on: September 7, 2022

1.9K
Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example
05:45

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Published on: March 11, 2020

9.1K
Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells
10:34

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Published on: December 9, 2022

4.7K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • The increasing volume of genome sequences necessitates advanced pattern detection methods for evolutionary analysis.
  • Traditional phylogenetic reconstruction relies on multiple sequence alignment and positional homology, which can be challenging with large datasets.
  • Delineating species and extracting evolutionary signals from vast biological data remains a complex computational problem.

Purpose of the Study:

  • To propose a novel computational framework for identifying meaningful features in unaligned sequence data.
  • To leverage unsupervised learning techniques for characteristic k-mer discovery across various k-values.
  • To utilize identified k-mers for sequence clustering and analyzing k-mer distribution shifts.

Main Methods:

  • Development of a Python-based computational framework utilizing unsupervised learning.
  • Identification of characteristic k-mers from unaligned sequence data for a range of k-values.
  • Application of k-mers as features for clustering sequences and analyzing k-mer distribution patterns.

Main Results:

  • Successful identification of characteristic k-mers within unaligned genome sequences.
  • Effective clustering of sequence data based on identified k-mer features.
  • Demonstration of analyzing changes in k-mer distributions across different k-values.

Conclusions:

  • The proposed framework offers a promising approach for grouping sequence data based on inherent features.
  • This method facilitates the study of evolutionary signals by analyzing k-mer distribution dynamics.
  • The open-source Python software provides a freely accessible tool for bioinformatics research.