Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Per-Unit Sequence Models01:26

Per-Unit Sequence Models

129
An ideal Y-Y transformer, grounded through neutral impedances, displays per-unit sequence networks akin to those of a single-phase ideal transformer when subjected to balanced positive- or negative-sequence currents. These currents do not produce neutral currents, and their associated voltage drops.
Zero-sequence currents, which are identical in magnitude and phase, generate a neutral current, resulting in voltage drops across the neutral impedance and the low-voltage winding. If the...
129
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.3K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.3K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.3K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.3K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

19.4K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
19.4K
RNA-seq03:21

RNA-seq

10.5K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
10.5K
Conserved Binding Sites01:49

Conserved Binding Sites

4.5K
Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...
4.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Interpretable machine learning for personalized breast cancer screening recommendations.

Health care management science·2026
Same author

Predicting offer burden to optimize batch sizes in simultaneously expiring kidney offers.

Frontiers in artificial intelligence·2025
Same author

Association Between Delayed Discharge From Acute Care and Rehabilitation Outcomes and Length of Stay: A Retrospective Cohort Study.

Archives of physical medicine and rehabilitation·2022
Same author

[Collateral supply in patients with severe carotid stenosis].

Zhonghua yi xue za zhi·2007
Same author

[Changes of sleep architecture in patients with narcolepsy].

Zhonghua yi xue za zhi·2007
Same author

[Combined anterior and posterior approach for cervical fracture-dislocation with ankylosing spondylitis].

Zhonghua wai ke za zhi [Chinese journal of surgery]·2007
Same journal

Designing an mHealth App for Defecatory Function Rehabilitation of Post-Surgical Colorectal Cancer Survivors.

Journal of healthcare informatics research·2026
Same journal

A Literature Review on Example-Based Explanations in Medical Image Analysis.

Journal of healthcare informatics research·2026
Same journal

A Scoping Review of Synthetic Data Generation by Language Models in Biomedical Research and Application: Data Utility and Quality Perspectives.

Journal of healthcare informatics research·2026
Same journal

Towards Accurate and Reliable ICU Outcome Prediction: A Multimodal Learning Framework Based on Belief Function Theory using Structured EHRs and Free-Text Notes.

Journal of healthcare informatics research·2026
Same journal

Multimodal Federated Learning in Healthcare: A Review.

Journal of healthcare informatics research·2026
Same journal

A Hybrid Language Framework for Ontology-Based Clinical Concept Extraction.

Journal of healthcare informatics research·2026
See all related articles

Related Experiment Video

Updated: Sep 27, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.1K

A Data Adaptive Biological Sequence Representation for Supervised Learning.

Hande Cakin1, Berk Gorgulu1, Mustafa Gokce Baydogan1

  • 1Department of Industrial Engineering, Boğaziçi University, İstanbul, Turkey.

Journal of Healthcare Informatics Research
|April 13, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces SW-RF, a novel machine learning approach for analyzing biological sequences. SW-RF effectively represents DNA and protein sequences, improving gene expression prediction and handling missing data in microarrays.

Keywords:
Biological sequencesCategoricalClassificationGene expressionRepresentation learningTime series

More Related Videos

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

8.8K
Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.3K

Related Experiment Videos

Last Updated: Sep 27, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.1K
Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

8.8K
Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.3K

Area of Science:

  • Genomics
  • Bioinformatics
  • Machine Learning

Background:

  • Gene expression is crucial for organism function, and DNA microarray technology enables large-scale monitoring.
  • Understanding gene regulation mechanisms requires analyzing the relationship between gene expression and nucleotide sequences.
  • Identifying local DNA elements (motifs) is key to inferring these regulatory relationships.

Purpose of the Study:

  • To propose a novel data-adaptive representation approach for supervised learning on biological sequences.
  • To develop a method for predicting biological responses based on sequence data.
  • To address challenges in high-dimensionality and missing values common in biological sequence analysis.

Main Methods:

  • The study introduces the Sliding Window-Random Forest (SW-RF) method for categorical sequence representation.
  • SW-RF represents sequences using overlapping subsequences and a tree-based learner to create a bag-of-words-like representation.
  • A lasso logistic regression classifier is trained on the learned representation to identify important patterns.

Main Results:

  • The SW-RF approach demonstrated significantly improved accuracy on synthetic and DNA promoter sequence data.
  • The method efficiently handles missing values in microarray datasets, a common challenge.
  • The learned representation allows for the application of various classifiers for pattern identification.

Conclusions:

  • SW-RF provides an effective feature-based representation for categorical sequences, particularly in bioinformatics.
  • The approach enhances the accuracy of predicting biological responses from sequence data.
  • SW-RF's flexibility makes it applicable to diverse categorical sequence data beyond biological applications.