Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

How Data are Classified: Numerical Data00:59

How Data are Classified: Numerical Data

37.0K
Data that are countable or measurable in specific units are called numerical or quantitative data. Quantitative data are always numbers. Quantitative data are the result of counting or measuring the attributes of a population. Amount of money, pulse rate, weight, number of people living in a town, and number of students who opt for statistics are examples of quantitative data.
Quantitative data may be either discrete or continuous. All quantitative data that take on only specific numerical...
37.0K
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

43.0K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
43.0K
Classifying Matter by Composition03:35

Classifying Matter by Composition

89.8K
Matter: Pure Substances and Mixtures
According to its composition, the matter can be classified into two broad categories — pure substances and mixtures. 
A pure substance is a form of matter that has a constant composition throughout with uniform properties. For example, any sample of sucrose has the same composition and same physical properties, such as melting point, color, and sweetness, regardless of the source from which it is isolated. 
A mixture is composed of two or...
89.8K
Classifying Matter by State02:49

Classifying Matter by State

102.7K
Chemistry is the study of matter and the changes it undergoes. Matter is anything that has mass and occupies space. Matter is all around us; the air, water, soil, mountains, even our bodies are all examples of matter. Matter is divided into three states — solid, liquid, and gas — that are commonly found on earth. The fourth state of matter, plasma, occurs naturally in the interiors of stars. 
102.7K
Cis-regulatory Sequences02:02

Cis-regulatory Sequences

11.6K
Cis-regulatory sequences are short fragments of non-coding DNA that are present on the same chromosomes as the genes that they regulate. These fragments serve as binding sites for transcriptional regulators, proteins that are responsible for controlling gene transcription and differential gene expression across cell types in eukaryotes. Cis-regulatory sequences can be close to the gene of interest or thousands of bases away in the DNA sequence; however, those sequences that are further away are...
11.6K
Trial and Error and Algorithm01:12

Trial and Error and Algorithm

399
A problem-solving strategy is a plan of action used to find a solution. Different strategies have distinct action plans. Trial and error involves trying different solutions until one works. For instance, to fix a broken printer, you might check ink levels, ensure the paper tray isn't jammed, and verify the printer's connection to your laptop. This method can be time-consuming but is commonly used. Thomas Edison, for example, used trial and error to find a suitable filament for the light...
399

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Adverse and adaptive placental DNA methylation changes linking prenatal air pollution exposure to child lung function: findings from the SEPAGES cohort.

EBioMedicine·2026
Same author

A sorghum pangenome reference improves global crop trait discovery.

Nature·2026
Same author

SeqManager: a web-based tool for efficient sequencing data storage management and duplicate detection.

Bioinformatics advances·2025
Same author

DNA methylation predicts lung function and pulmonary exacerbation in sputum samples from patients with cystic fibrosis.

Clinical epigenetics·2025
Same author

Targeting FOXA1 and FOXA2 disrupts the lineage-specific oncogenic output program in prostate cancer.

Cell reports·2025
Same author

Immediate and durable effects of maternal tobacco consumption on placental DNA methylation: a replication and discovery study.

Environmental epigenetics·2025
Same journal

Ciliary flow and morphology shape mass transport at the surface and within gastrovascular cavities of black corals.

Communications biology·2026
Same journal

Virus-mediated prokaryotic community adaptation dynamics under thermal stress in municipal organic solid waste microbiomes.

Communications biology·2026
Same journal

Multi-omics insights into the woolly trait of Saussurea medusa and the plant's coordinated regulation of flavonoid biosynthesis.

Communications biology·2026
Same journal

Loss contexts enhance dorsolateral prefrontal interpersonal neural synchrony during successful human deceptive recommendations.

Communications biology·2026
Same journal

Neuro-regulator role of H<sub>2</sub>S in astrocyte activation and its effects on neurological damage and behavior of VPA-exposed rats.

Communications biology·2026
Same journal

Temporal orchestration of transcriptional and epigenomic programming underlying maternal embryonic diapause in a cricket model.

Communications biology·2026
See all related articles

Related Experiment Video

Updated: Jan 23, 2026

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.5K

GECKO is a genetic algorithm to classify and explore high throughput sequencing data.

Aubin Thomas1, Sylvain Barriere1, Lucile Broseus1

  • 11Institute of Human Genetics, CNRS UPR1142, Machine learning and gene regulation, University of Montpellier, Montpellier, France.

Communications Biology
|June 27, 2019
PubMed
Summary
This summary is machine-generated.

This study introduces GECKO, a novel bioinformatics approach for analyzing high throughput sequencing data. GECKO classifies and discovers genetic sequences without mapping, reducing bias and data loss in cancer and other studies.

Keywords:
Machine learningPredictive medicine

More Related Videos

Two Algorithms for High-throughput and Multi-parametric Quantification of Drosophila Neuromuscular Junction Morphology
12:29

Two Algorithms for High-throughput and Multi-parametric Quantification of Drosophila Neuromuscular Junction Morphology

Published on: May 3, 2017

11.0K
High-Throughput Analysis of Optical Mapping Data Using ElectroMap
07:36

High-Throughput Analysis of Optical Mapping Data Using ElectroMap

Published on: June 4, 2019

10.0K

Related Experiment Videos

Last Updated: Jan 23, 2026

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.5K
Two Algorithms for High-throughput and Multi-parametric Quantification of Drosophila Neuromuscular Junction Morphology
12:29

Two Algorithms for High-throughput and Multi-parametric Quantification of Drosophila Neuromuscular Junction Morphology

Published on: May 3, 2017

11.0K
High-Throughput Analysis of Optical Mapping Data Using ElectroMap
07:36

High-Throughput Analysis of Optical Mapping Data Using ElectroMap

Published on: June 4, 2019

10.0K

Area of Science:

  • Bioinformatics
  • Genomics
  • Computational Biology

Background:

  • Comparative analysis of high throughput sequencing data typically requires mapping reads to a reference genome, which can introduce bias and data loss.
  • This mapping step is particularly problematic in studies involving patient data, such as cancer genomics, where individual variations from reference genomes are common.

Purpose of the Study:

  • To develop a novel bioinformatics approach and software that bypasses the need for read mapping in high throughput sequencing data analysis.
  • To enable comprehensive exploration, classification, and discovery of genetic sequences without relying on intensive bioinformatics pipelines.

Main Methods:

  • Utilizes advances in genetic algorithms and feature selection for data exploration.
  • Introduces GECKO (GEnetic Classification using k-mer Optimization), a software tool designed for sequence analysis.
  • Applies the approach to diverse sequencing data types, including mRNA, microRNA, and DNA methylome data.

Main Results:

  • Demonstrates the effectiveness of GECKO in classifying and extracting meaningful sequences from complex datasets.
  • Shows that the GECKO approach significantly reduces bias and potential data loss associated with traditional mapping methods.
  • Successfully applied to multiple sequencing data types, highlighting its versatility.

Conclusions:

  • GECKO offers a robust and efficient alternative to traditional mapping-based methods for high throughput sequencing data analysis.
  • The approach is effective across various sequencing modalities, including transcriptomic and epigenomic data.
  • This method has the potential to improve the accuracy and comprehensiveness of genetic sequence discovery and classification, particularly in clinical applications like cancer research.