Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Aggregates Classification

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Benchmarking reliability and calibration of LLMs for multi-cancer early detection test communication.

JAMIA open·2026

Same author

Pan-Cancer Genomic Scars of Alternative End Joining and Single-Strand Annealing.

bioRxiv : the preprint server for biology·2026

Same author

Multivariate causal effects: a Bayesian causal regression factor model.

Biometrics·2026

Same author

A Longitudinal Comprehensive Biospecimen and Clinical Data Repository for Cancer Early Detection: The InAdvance Study.

Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology·2026

Same author

A Gene Expression Tumor Signature Optimizing Partial Area-Under-the-Curve (pAUC) to Improve Specificity for Indolent Prostate Cancer.

The Prostate·2026

Same author

Web-Based User Interface for Fam3PRO: A Multigene, Multicancer Risk Prediction Model for Families With Cancer History.

JCO clinical cancer informatics·2026

Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026

Same journal

KASSPer: Kinase Active Site Structure Prediction using Protein and Ligand Language Models and Its Application to Virtual Screening.

Bioinformatics (Oxford, England)·2026

Same journal

IDR searcher: a search engine solution for public image resources.

Bioinformatics (Oxford, England)·2026

Same journal

KCFtools: Rapid alignment-free method for introgression screening and GWAS using k-mer profiles.

Bioinformatics (Oxford, England)·2026

Same journal

Meta2DB: Curated shotgun metagenomic feature sets and metadata for health state prediction.

Bioinformatics (Oxford, England)·2026

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Nov 28, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Robustifying genomic classifiers to batch effects via ensemble learning.

Yuqing Zhang¹, Prasad Patil², W Evan Johnson^2,3

¹Clinical Bioinformatics, Gilead Sciences, Inc., Foster City, CA 94404, USA.

Bioinformatics (Oxford, England)

|November 27, 2020

Summary

This summary is machine-generated.

Batch effects in genomic data can skew results. An ensemble learning approach offers robust performance for genomic classification, especially when batch effects are severe, outperforming traditional merging and adjustment methods.

More Related Videos

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Identification and Classification of Position-specific GABA_A Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Published on: June 6, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Related Experiment Videos

Last Updated: Nov 28, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Identification and Classification of Position-specific GABA_A Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Published on: June 6, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Area of Science:

Genomics
Bioinformatics
Machine Learning

Background:

Genomic data often exhibits batch effects due to production in separate batches, impacting downstream analyses.
Traditional methods merge data and adjust for batch effects, but may not be optimal for all scenarios.

Purpose of the Study:

To propose and evaluate an ensemble learning strategy for handling batch effects in genomic data.
To compare the proposed ensemble method against traditional batch adjustment techniques.

Main Methods:

Developed prediction models within individual batches.
Integrated batch-specific models using ensemble weighting methods.
Systematically compared ensemble learning with merging and batch adjustment using tuberculosis genomic data.

Main Results:

Ensemble learning demonstrated more robust performance in genomic classification, particularly with high batch effect severity.
Merging followed by batch adjustment showed better discrimination at low heterogeneity levels.
The study provides practical guidelines for handling batch effects in genomic classifier development.

Conclusions:

Ensemble learning offers a powerful alternative for managing batch effects in genomic data.
The choice of method depends on the severity of batch effects and desired robustness.
This work contributes to more reliable genomic data analysis and prediction.