Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

3.7K
3.7K
Improving Translational Accuracy02:07

Improving Translational Accuracy

15.4K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
15.4K
Aggregates Classification01:29

Aggregates Classification

1.1K
Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
1.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Machine Learning-Based Frailty Prediction and Classification in Community-Dwelling Older Adults: A Systematic Review of Validation, Explainability, and Implementation Readiness.

Healthcare (Basel, Switzerland)·2026
Same author

GeoFusion-3D: Multi-Scale Geomorphic Feature Fusion for Landslide Scar Detection Using UAV-Mounted LiDAR.

Sensors (Basel, Switzerland)·2026
Same author

Development of machine learning models with explainable AI for frailty risk prediction and their web-based application in community public health.

Frontiers in public health·2025
Same author

A Cross-Sectional Social Network Analysis of Decision-Making About Recruiting a Living Donor for Kidney Transplantation.

Kidney medicine·2025
Same author

Using Machine Learning to Predict Treatment Outcome in a Concatenated Dataset of Youth Anxiety Treatments.

Child psychiatry and human development·2025
Same author

Leveraging multi-modal data for early prediction of severity in forced transmission outages with hierarchical spatiotemporal multiplex networks.

PloS one·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Mar 15, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

A robust data scaling algorithm to improve classification accuracies in biomedical data.

Xi Hang Cao1, Ivan Stojkovic1,2, Zoran Obradovic3

  • 1Center for Data Analytics and Biomedical Informatics, College of Science and Technology, Temple University, 1925 North 12th Street, Philadelphia, 19122, USA.

BMC Bioinformatics
|September 11, 2016
PubMed
Summary
This summary is machine-generated.

A new Generalized Logistic (GL) algorithm offers effective data scaling for machine learning in biomedical informatics. This method improves classification accuracy and is robust to outliers, outperforming standard scaling techniques.

Keywords:
Classification modelData normalizationData scalingEmpirical cumulative distribution functionGeneralized logistic functionOutlier

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K

Related Experiment Videos

Last Updated: Mar 15, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K

Area of Science:

  • Biomedical Informatics
  • Machine Learning
  • Data Science

Background:

  • Machine learning models are increasingly used in biomedical research for knowledge discovery and decision support.
  • Data preprocessing, particularly data scaling, is crucial but often overlooked in favor of model accuracy.
  • Existing scaling methods like Min-max and Z-score may not be optimal for all biomedical datasets, especially those with outliers or limited samples.

Purpose of the Study:

  • To introduce and evaluate the Generalized Logistic (GL) algorithm for data scaling in machine learning.
  • To demonstrate the effectiveness of the GL algorithm in improving classification accuracy for biomedical applications.
  • To compare the GL algorithm against commonly used data scaling methods.

Main Methods:

  • The Generalized Logistic (GL) algorithm was developed to scale data uniformly by fitting a generalized logistic function to the empirical cumulative distribution function.
  • The GL algorithm is designed to be intrinsically robust to outliers and performs nonlinear data scaling.
  • Experiments were conducted on 16 binary classification tasks across various applications to assess performance.

Main Results:

  • Models trained on data scaled using the GL algorithm demonstrated superior performance compared to Min-max and Z-score scaling.
  • Performance was evaluated using the area under the receiver operating characteristic curve (AUROC) and percentage of correct classification.
  • The GL algorithm showed improved accuracy in classification tasks, particularly beneficial for small sample sizes common in clinical settings.

Conclusions:

  • The proposed GL algorithm is a simple yet effective data preprocessing technique for machine learning in biomedical informatics.
  • Its robustness to outliers eliminates the need for separate denoising or outlier detection steps.
  • Empirical results confirm that the GL algorithm enhances model accuracy compared to traditional scaling methods.