Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.8K
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

836
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
836
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

3.3K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
3.3K
Classification of Systems-I01:26

Classification of Systems-I

742
Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:
742
Classification of Systems-II01:31

Classification of Systems-II

651
Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,
651
Applications of Molecular Taxonomy01:20

Applications of Molecular Taxonomy

705
Molecular taxonomy has revolutionized the understanding and classification of bacteria, providing precise insights into their diversity, evolutionary relationships, and ecological roles. By utilizing molecular techniques such as DNA sequencing and fingerprinting, researchers have made significant strides in various fields related to bacterial studies.Resolving Taxonomic AmbiguitiesMolecular taxonomy has been instrumental in distinguishing closely related bacterial species initially thought to...
705

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A TAD-informed aging-brain xQTL atlas of multi-modal and cell-type-resolved regulatory variation.

medRxiv : the preprint server for health sciences·2026
Same author

Basic Science and Pathogenesis.

Alzheimer's & dementia : the journal of the Alzheimer's Association·2025
Same author

Integrated genomic analysis and CRISPRi implicates <i>EGFR</i> in Alzheimer's disease risk.

NPJ dementia·2025
Same author

BTS: a scalable Bayesian Tissue Score for prioritizing GWAS variants and their functional contexts across >1000s of omics datasets.

Bioinformatics (Oxford, England)·2025
Same author

Multi-ancestry genome-wide meta-analysis of 56,241 individuals identifies known and novel cross-population and ancestry-specific associations as novel risk loci for Alzheimer's disease.

Genome biology·2025
Same author

Integrated genomic analysis and CRISPRi implicates <i>EGFR</i> in Alzheimer's disease risk.

medRxiv : the preprint server for health sciences·2025
Same journal

circ2DGNN: circRNA-Disease Association Prediction via Transformer-Based Graph Neural Network.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Hierarchical Hypergraph Learning in Association- Weighted Heterogeneous Network for miRNA- Disease Association Identification.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Discriminative Domain Adaption Network for Simultaneously Removing Batch Effects and Annotating Cell Types in Single-Cell RNA-Seq.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

MLW-BFECF: A Multi-Weighted Dynamic Cascade Forest Based on Bilinear Feature Extraction for Predicting the Stage of Kidney Renal Clear Cell Carcinoma on Multi-Modal Gene Data.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

An End-to-End Knowledge Graph Fused Graph Neural Network for Accurate Protein-Protein Interactions Prediction.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Generative Biomedical Event Extraction With Constrained Decoding Strategy.

IEEE/ACM transactions on computational biology and bioinformatics·2024
See all related articles

Related Experiment Video

Updated: May 4, 2026

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

3.2K

Biological sequence classification with multivariate string kernels.

Pavel P Kuksa1

  • 1NEC Laboratories America Inc, Princeton.

IEEE/ACM Transactions on Computational Biology and Bioinformatics
|January 4, 2014
PubMed
Summary
This summary is machine-generated.

New multivariate string kernels improve biological sequence classification. These methods enhance analysis of complex protein sequences, achieving significant performance gains over existing techniques.

More Related Videos

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

720
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K

Related Experiment Videos

Last Updated: May 4, 2026

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

3.2K
A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

720
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K

Area of Science:

  • Bioinformatics
  • Machine Learning
  • Computational Biology

Background:

  • String kernel methods excel in structured/sequential data analysis, including biological sequence classification.
  • Current methods primarily analyze discrete 1D string data, limiting analysis of complex biological sequences.
  • State-of-the-art performance is achieved in tasks like remote homology detection and protein superfamily prediction.

Purpose of the Study:

  • To develop and evaluate novel multivariate string kernels for multiclass biological sequence classification.
  • To leverage multivariate representations, such as sequences of feature vectors (e.g., protein profiles, physicochemical descriptors), for enhanced sequence analysis.
  • To improve upon existing sequence classification methods by utilizing richer data representations.

Main Methods:

  • Utilized multivariate representations of biological sequences, including sequences of feature vectors.
  • Developed and applied a class of multivariate string kernels designed to exploit these representations.
  • Tested the proposed methods on three distinct protein sequence classification tasks.

Main Results:

  • The proposed multivariate representations and kernels demonstrated significant performance improvements.
  • Achieved 15-20 percent accuracy gains compared to existing state-of-the-art sequence classification methods.
  • Effectively addressed multiclass biological sequence classification challenges.

Conclusions:

  • Multivariate string kernels offer a powerful approach for analyzing complex biological sequence data.
  • The developed methods provide a substantial advancement over traditional 1D string kernel techniques.
  • This work advances the field of biological sequence classification with improved accuracy and efficiency.