Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Prediction Intervals01:03

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
The...
Survival Tree01:19

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a survival tree begins...
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Development of the authentication and authorization processes for the iAgree portal, a platform for patient-controlled data sharing across health systems.

JAMIA open·2026
Same author

Foundation Model-Guided Synthetic EHR Release: Performance Enhancement with Privacy Preservation.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

Large Models for Small Tables: Adapting Tabular Foundation Models to EHR Data.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

Memorization in large language models in medicine prevalence characteristics and implications.

Nature communications·2026
Same author

Privacy-enhancing sequential learning under heterogeneous selection bias in multi-site electronic health records data.

Journal of the American Medical Informatics Association : JAMIA·2026
Same author

Large Language Models for Summarizing Advance Care Planning Information From Goals of Care Notes in the EHR.

Learning health systems·2026
Same journal

Sensitivity Analyses of a Scoring System for a Contraception Decision Aid.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Improving electronic health record processing of large language models via retrieval-augmented generation: A case study on dietary supplements.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Developing a User-Centered Mobile Application Prototype: Bridging Lower-Limb Fracture Care from Skilled Nursing Facility and Back to the Community.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Automating Adjudication of Cardiovascular Events Using Large Language Models.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Predictive Factors and State-Level Barriers to Postpartum Birth Control Usage in the United States: Insights from PRAMS Phase 8.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
See all related articles

Related Experiment Videos

Improving predictions in imbalanced data using Pairwise Expanded Logistic Regression.

Xiaoqian Jiang1, Robert El-Kareh, Lucila Ohno-Machado

  • 1Division of Biomedical Informatics, University of California, San Diego, La Jolla, CA, USA. x1jiang@ucsd.edu

AMIA ... Annual Symposium Proceedings. AMIA Symposium
|December 24, 2011
PubMed
Summary
This summary is machine-generated.

This study introduces Pairwise Expanded Logistic Regression to address imbalanced medical datasets. The novel method improves classification accuracy and discrimination for rare events, outperforming existing techniques.

Related Experiment Videos

Area of Science:

  • Machine Learning
  • Medical Data Analysis
  • Bioinformatics

Background:

  • Medical classification tasks frequently encounter imbalanced datasets, where rare events are critical.
  • Standard algorithms like Logistic Regression (LR) and Support Vector Machines (SVM) struggle with imbalanced data, leading to poor discrimination.
  • Existing strategies for imbalanced training data are often ineffective, compromising model performance.

Purpose of the Study:

  • To propose a novel approach for estimating class memberships in imbalanced medical datasets.
  • To enhance the discrimination capabilities of classification models when dealing with rare but significant events.
  • To introduce Pairwise Expanded Logistic Regression (PELR) as a solution for imbalanced data challenges.

Main Methods:

  • Developed a new method, Pairwise Expanded Logistic Regression (PELR), to estimate class memberships.
  • Focused on evaluating pairwise relationships within the training data to improve classification.
  • Tested PELR on two imbalanced datasets to assess its effectiveness.

Main Results:

  • Pairwise Expanded Logistic Regression demonstrated improved discrimination compared to existing methods.
  • The proposed method achieved higher accuracy on imbalanced datasets.
  • PELR showed significant promise in addressing the challenges posed by rare medical events.

Conclusions:

  • Pairwise Expanded Logistic Regression offers a promising solution for medical classification problems with imbalanced datasets.
  • The method effectively improves model discrimination and accuracy for rare event detection.
  • PELR represents a valuable advancement in handling imbalanced data for medical machine learning applications.