Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

An ECG foundation model for generalizable cardiac function prediction across the lifespan.

medRxiv : the preprint server for health sciences·2026
Same author

K<sub>2</sub>O Encapsulation-Decomposition Mechanism: Unlocking Closed-Pore Engineering in Hard Carbon Anode for Sodium-Ion Batteries.

Inorganic chemistry·2026
Same author

Towards symbolic regression for interpretable clinical decision scores.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026
Same author

Automated Echocardiographic Detection of Congenital Heart Disease Using Artificial Intelligence.

Circulation·2026
Same author

The Benefit of the Doubt Phenomenon in Emergency Triage Assignment Disparities.

medRxiv : the preprint server for health sciences·2026
Same author

Deep Learning-Based Automated Echocardiographic Measurements in Pediatric and Congenital Heart Disease.

medRxiv : the preprint server for health sciences·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
Same journal

KASSPer: Kinase Active Site Structure Prediction using Protein and Ligand Language Models and Its Application to Virtual Screening.

Bioinformatics (Oxford, England)·2026
Same journal

IDR searcher: a search engine solution for public image resources.

Bioinformatics (Oxford, England)·2026
Same journal

KCFtools: Rapid alignment-free method for introgression screening and GWAS using k-mer profiles.

Bioinformatics (Oxford, England)·2026
Same journal

Meta2DB: Curated shotgun metagenomic feature sets and metadata for health state prediction.

Bioinformatics (Oxford, England)·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Oct 16, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.7K

PMLB v1.0: an open-source dataset collection for benchmarking machine learning methods.

Joseph D Romano1,2, Trang T Le1, William La Cava1

  • 1Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA 19104, USA.

Bioinformatics (Oxford, England)
|October 22, 2021
PubMed
Summary
This summary is machine-generated.

The Penn Machine Learning Benchmarks (PMLB) offers a comprehensive, user-friendly collection of benchmark datasets for evaluating machine learning and data science methods. This updated release enhances accessibility and integration into data science workflows.

More Related Videos

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

1.2K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.0K

Related Experiment Videos

Last Updated: Oct 16, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.7K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

1.2K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.0K

Area of Science:

  • Machine Learning
  • Data Science
  • Statistical Modeling

Background:

  • Standardized benchmark datasets are crucial for comparing novel machine learning and statistical modeling methods.
  • Existing tools lack a unified, user-friendly interface for accessing diverse benchmark datasets.
  • Integration with popular data science workflows is often limited.

Purpose of the Study:

  • To introduce the Penn Machine Learning Benchmarks (PMLB) v1.0, providing the largest collection of diverse, public benchmark datasets.
  • To offer a standardized, user-friendly interface for accessing and utilizing benchmark datasets.
  • To improve the evaluation of new machine learning and data science methods through enhanced accessibility and integration.

Main Methods:

  • Aggregation of a large number of diverse, public benchmark datasets.
  • Development of standardized, user-friendly interfaces for data access.
  • Integration with popular data science workflows and programming languages (Python and R).

Main Results:

  • PMLB v1.0 is the largest collection of diverse, public benchmark datasets available in one location.
  • Introduced critical improvements based on community feedback.
  • Provides Python and R interfaces for easy installation and use.

Conclusions:

  • PMLB facilitates standardized comparisons of machine learning and data science methods.
  • The v1.0 release significantly enhances the accessibility and usability of benchmark datasets.
  • PMLB supports reproducible research and accelerates the development of new algorithms.