Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

127
Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
127
Introduction To Survival Analysis01:18

Introduction To Survival Analysis

236
Survival analysis is a statistical method used to study time-to-event data, where the "event" might represent outcomes like death, disease relapse, system failure, or recovery. A unique feature of survival data is censoring, which occurs when the event of interest has not been observed for some individuals during the study period. This requires specialized techniques to handle incomplete data effectively.
The primary goal of survival analysis is to estimate survival time—the time...
236
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

186
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
186
Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K
Truncation in Survival Analysis01:09

Truncation in Survival Analysis

208
Truncation in survival analysis refers to the exclusion of individuals or events from the dataset based on specific criteria related to the time of the event. This exclusion can happen in two primary forms: left truncation and right truncation.
Left truncation occurs when individuals who experienced the event of interest before a certain time are not included in the study. This is often due to a "delayed entry" into the study where only those who survive until a certain entry point are...
208
Biostatistics: Overview01:20

Biostatistics: Overview

241
Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...
241

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Gradient boosting-based discrete failure time model for selecting time-varying effects and interactions.

Lifetime data analysis·2026
Same author

α/β for Hepatocellular Carcinoma Tumors Treated With Radiation Therapy.

International journal of radiation oncology, biology, physics·2026
Same author

KULLBACK-LEIBLER-BASED DISCRETE FAILURE TIME MODELS FOR INTEGRATION OF PUBLISHED PREDICTION MODELS WITH NEW TIME-TO-EVENT DATASET.

The annals of applied statistics·2026
Same author

Impact of neighbor-level disadvantage on survival outcomes in head and neck squamous cell carcinoma.

Cancer causes & control : CCC·2025
Same author

A Multiple Imputation Approach for the Cumulative Incidence, with Implications for Variance Estimation.

The American statistician·2025
Same author

Revisiting the hazards of hazard ratios through simulations and case studies.

European journal of epidemiology·2025
Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026
Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026
Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026
Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026
Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026
Same journal

A Bayesian phase I/II platform design with data augmentation accounting for delayed outcomes.

Biometrics·2026
See all related articles
  1. Home
  2. Robust Data Integration From Multiple External Sources For Generalized Linear Models With Binary Outcomes.
  1. Home
  2. Robust Data Integration From Multiple External Sources For Generalized Linear Models With Binary Outcomes.

Related Experiment Video

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K

Robust data integration from multiple external sources for generalized linear models with binary outcomes.

Kyuseong Choi1, Jeremy M G Taylor2, Peisong Han2

  • 1Department of Statistics and Data Science, Cornell University, Ithaca, NY 14853, United States.

Biometrics
|February 16, 2024

View abstract on PubMed

Summary
This summary is machine-generated.

This study introduces an adaptive penalization method to improve generalized linear model (GLM) parameter estimation using external study data. The novel approach enhances efficiency and robustness, outperforming direct maximum likelihood estimation.

Keywords:
adaptive weightsgeneralized information criterionpenalizationratio of parametersrobustness

More Related Videos

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.2K

Related Experiment Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.2K

Area of Science:

  • Statistical modeling
  • Biostatistics
  • Machine learning

Background:

  • Generalized linear models (GLMs) are widely used for analyzing various data types.
  • Integrating external study data can improve parameter estimation in internal studies.
  • Challenges exist in effectively leveraging heterogeneous external summary information.

Purpose of the Study:

  • To develop an adaptive penalization method for GLM parameter estimation.
  • To enhance estimation efficiency and robustness by incorporating external GLM summary information.
  • To provide a computationally efficient approach for complex statistical modeling.

Main Methods:

  • An adaptive penalization technique is proposed, utilizing external parameter estimates from GLMs.
  • The method exploits relationships between GLM parameters and downweights incompatible external data.
  • Computational efficiency is achieved through adaptive weights and information criteria for tuning parameter selection.
  • Main Results:

    • Simulation studies demonstrate the proposed estimator's robustness to population distribution heterogeneity.
    • The method shows significant efficiency gains compared to direct maximum likelihood estimation.
    • The approach was successfully applied to a prostate cancer prediction model using external data.

    Conclusions:

    • The adaptive penalization method effectively integrates external GLM summary information for improved internal study parameter estimation.
    • The proposed technique offers a robust, efficient, and computationally feasible solution for complex statistical modeling tasks.
    • This method holds promise for enhancing predictive models in various scientific domains, including medical research.