Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Aggregates Classification

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

Types of Aggregate Grading

Types of Aggregate Grading

Aggregate grading is crucial in economically obtaining a concrete mix with adequate strength, reasonable workability, and minimal segregation. There are four types of aggregate gradation: well-graded, uniformly (or one-sized) graded, gap-graded, and open-graded.
Well-graded aggregates include a complete range of necessary size fractions that fit together to create a dense matrix with minimal voids, represented by a smooth, continuous gradation curve. This type of grading ensures good...

Mass Analyzers: Common Types

Mass Analyzers: Common Types

The quadrupole mass analyzer consists of four cylindrical metal rods arranged in a diamond carrying a DC voltage and a radio-frequency AC voltage. The motion of ions through the quadrupole depends on the field strength, causing only ions of a certain m/z to resonate successfully and strike the detector at a given field strength. Though the transmission rate for these analyzers is high, the exact elemental composition of the sample is not determined because of low resolution; however, they are...

Data Collection by Observations

Data Collection by Observations

Data collection refers to a systematic way of obtaining, observing, measuring, and analyzing accurate information. Observational studies are one of the most widely used methods of data collection. It involves collecting data by observing the behavior and physical characteristics of a sample without making any modifications to the sample.
An astronomer viewing the motion and brightness of stars in the sky and recording the data is an example of observational data collection. A botanist recording...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

How Data are Classified: Categorical Data

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Origin and Evolution of Very Large Extracellular Proteins in Fructophilic Lactic Acid Bacteria.

Genome biology and evolution·2026

Same author

Origin and Evolution of Key Enzymes in the Anammox Pathway Revisited.

Genome biology and evolution·2025

Same author

Host-Specific Adaptation of Legionella pneumophila to Single and Multiple Hosts.

Molecular biology and evolution·2025

Same author

Host-bacteria interactions: ecological and evolutionary insights from ancient, professional endosymbionts.

FEMS microbiology reviews·2024

Same author

Phylogeny and Expansion of Serine/Threonine Kinases in Phagocytotic Bacteria in the Phylum Planctomycetota.

Genome biology and evolution·2024

Same author

<i>Apilactobacillus kunkeei</i> releases RNA-associated membrane vesicles and proteinaceous nanoparticles.

microLife·2023

Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026

Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026

Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026

Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026

Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026

Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 9, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

TADA: taxonomy-aware dataset aggregator.

Emil Hägglund¹, Siv G E Andersson¹, Lionel Guy²

¹Molecular Evolution, Department of Cell and Molecular Biology, Science for Life Laboratory, Biomedical Centre, Uppsala University, SE-751 24 Uppsala, Sweden.

Bioinformatics (Oxford, England)

|December 7, 2023

Summary

This summary is machine-generated.

Selecting representative genomes for bacterial and archaeal phylogenetic analysis is crucial. TADA (Taxonomic-Aware Dataset selection) is a new workflow that automates this process, ensuring quality and diversity in genomic datasets.

More Related Videos

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

Related Experiment Videos

Last Updated: Jul 9, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

Area of Science:

Genomics
Bioinformatics
Evolutionary Biology

Background:

The increasing number of sequenced bacterial and archaeal genomes enables advanced phylogenetic and comparative genomic studies.
However, utilizing all available genomic data for phylogenetic reconstruction is computationally challenging and can introduce biases due to uneven distribution across diversity.

Purpose of the Study:

To develop a user-friendly software solution for efficient and reliable subsampling of prokaryotic genomes for phylogenetic analysis.
To address the need for automated taxonomic-aware dataset selection in large-scale genomic studies.

Main Methods:

Implementation of TADA as a Snakemake workflow.
Development of a taxonomic-aware dataset selection process with adjustable granularity.
Inclusion of genome quality control and branch-balancing parameters.

Main Results:

TADA facilitates the selection of representative genomic subsets from diverse prokaryotic lineages.
The workflow allows for user-defined sampling strategies across prokaryotic diversity.
Constraints on genome quality and phylogenetic balance are integrated into the selection process.

Conclusions:

TADA provides a practical solution for constructing high-quality, diverse genomic datasets for phylogenetic inference.
This tool enhances the feasibility of large-scale phylogenetic analyses in prokaryotes.
Automated, taxonomic-aware subsampling improves the efficiency and accuracy of comparative genomics.