Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

83
Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
83
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

115
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
115
Censoring Survival Data01:09

Censoring Survival Data

55
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
55
Kaplan-Meier Approach01:24

Kaplan-Meier Approach

74
The Kaplan-Meier estimator is a non-parametric method used to estimate the survival function from time-to-event data. In medical research, it is frequently employed to measure the proportion of patients surviving for a certain period after treatment. This estimator is fundamental in analyzing time-to-event data, making it indispensable in clinical trials, epidemiological studies, and reliability engineering. By estimating survival probabilities, researchers can evaluate treatment effectiveness,...
74
McNemar's Test01:23

McNemar's Test

118
McNemar's Test is a nonparametric statistical test used to determine if there is a significant difference in proportions between two related groups when the outcome is binary (e.g., yes/no, success/failure). It is beneficial when we have paired data, such as pre-test/post-test designs, where the same subjects are measured under two different conditions. The test is named after the statistician Quinn McNemar, who introduced it in 1947. It is commonly used in situations where subjects are...
118
Crossover Experiments01:16

Crossover Experiments

2.7K
Crossover experiments, also called the repeated-measurements design, is a study design in which all experimental units are exposed to all treatments in different periods. Crossover experiments are generally used in psychology, the pharmaceutical industry, agriculture, and medicine.
Crossover designs are performed even with smaller sample sizes since the samples can act as their controls. These are better than simple randomized trials since patients are exposed to all the treatments.
2.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Mid-treatment MRI-based tumor volume reduction rate as a continuous prognostic factor after chemoradiation for cervical cancer: development and two-center internal-external validation.

Journal of radiation research·2026
Same author

Adverse events in exercise interventions among older adults with frailty: a study protocol of systematic review and meta-analysis.

BMJ open·2026
Same author

Association of the Fibrosis-4 Index With Indices of Atherosclerosis in Patients With Type 2 Diabetes: An Exploratory Subanalysis of a Prospective Observational Cohort Study.

Journal of diabetes research·2026
Same author

Proton Beam Therapy Versus Photon Radiotherapy for Pediatric Neuroblastoma: A Systematic Review and Meta-Analysis (TRP-2025 Neuroblastoma).

Cancer medicine·2026
Same author

Associations of Physical Activity With Incidence of and Recovery From Knee Pain Among Older Adults: A 3-Year Longitudinal Study.

Journal of aging and physical activity·2026
Same author

Pre-Treatment With Potassium Iodide and Higher Radioiodine Dose Are the Risk Factors of Mid-to-Late-Phase Transient Thyrotoxicosis Following Radioiodine Therapy for Graves' Disease in Japanese Patients: A Retrospective Cohort Study.

Clinical endocrinology·2026
Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026
Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026
Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026
Same journal

Beyond Fixed Thresholds: Optimizing Summaries of Wearable Device Data via Piecewise Linearization of Quantile Functions.

Statistics in medicine·2026
Same journal

A Causal Framework for Evaluating the Total Effect of Strategies Aiming to Expand Screening and to Improve Outcomes.

Statistics in medicine·2026
Same journal

Causal Effects on Nonterminal Event Time With Application to Antibiotic Usage and Future Resistance.

Statistics in medicine·2026
See all related articles

Related Experiment Video

Updated: May 24, 2025

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
20:24

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study

Published on: January 31, 2014

16.4K

Statistical Inference for a Two-Stage Adaptive Seamless Design Using Different Binary Endpoints.

Ryota Ishii1, Kenichi Takahashi2, Kazushi Maruo1

  • 1Department of Biostatistics, Institute of Medicine, University of Tsukuba, Ibaraki, Japan.

Statistics in Medicine
|March 6, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces new statistical methods, conditional mean-adjusted estimator (CMAE) and uniformly minimum variance conditional unbiased estimator (UMVCUE), to reduce bias in adaptive seamless trial designs. These methods improve treatment effect estimation in drug development.

Area of Science:

  • Biostatistics and Clinical Trial Methodology
  • Pharmaceutical Development and Adaptive Seamless Design
  • Statistical Inference for Binary Endpoints

Background:

Adaptive seamless design represents a sophisticated and increasingly popular paradigm in pharmaceutical research that merges the exploratory Phase II and confirmatory Phase III stages of clinical trials into a single, continuous protocol. Prior research has shown that this integrated approach can significantly reduce the total time and financial resources required for drug development by eliminating the traditional hiatus between distinct trial phases. In the initial stage of these structures, investigators typically evaluate multiple treatment arms or dosages to identify the most promising candidates for further investigation based on preliminary data. The subsequent phase then focuses on a rigorous comparison between the selected treatment groups and a control group to establish definitive clinical efficacy and safety profiles. A common challenge arises when the selection process utilizes a short-term binary endpoint, such as a rapid biomarker response, while the final evaluation relies on a different, long-term binary outcome like overall survival. This absence of evidence motivated a deeper investigation into how these differing endpoints influence the statistical validity and potential bias of the final treatment effect estimates.

Keywords:
bias adjustmentconfidence intervalseamless phase II/III designshort‐term endpoint

Frequently Asked Questions

More Related Videos

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences
00:08

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences

Published on: September 4, 2019

6.9K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K

Related Experiment Videos

Last Updated: May 24, 2025

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
20:24

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study

Published on: January 31, 2014

16.4K
A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences
00:08

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences

Published on: September 4, 2019

6.9K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K

Purpose Of The Study:

This investigation develops specialized statistical tools to correct for the inherent upward bias introduced by treatment selection in two-stage adaptive seamless designs. The scientists addressed a specific and complex scenario where the criteria for choosing a treatment arm differ significantly from the primary binary endpoint used for the final efficacy comparison. The research effort aimed to replace the conventional maximum likelihood estimator, which often fails to account for the conditional nature of the second-stage data following a selection event. By focusing on binary endpoints, the work sought to provide a robust and reliable framework for trials where outcomes are measured as discrete successes or failures. The objective included the construction of precise confidence intervals using the Clopper-Pearson method to enhance the overall inferential quality of the adaptive trial results. The project intended to identify which specific combination of analytical frameworks and statistical tests provides the most accurate, unbiased, and well-controlled results for clinical practitioners.

Main Methods:

The investigators proposed the conditional mean-adjusted estimator (CMAE) as a primary tool for reducing the systematic upward bias typically observed in adaptive seamless designs. They further developed the uniformly minimum variance conditional unbiased estimator (UMVCUE) to provide a more mathematically precise and unbiased assessment of the treatment effect in this setting. The statistical architecture incorporated both the exact test and the mid-p test to evaluate the significance of the findings during the second stage of the trial. To determine the performance and reliability of these techniques, the team designed and executed comprehensive simulation studies covering a wide range of trial parameters and scenarios. These simulations evaluated six distinct inference methods formed by pairing the three computational models—Maximum Likelihood Estimator (MLE), CMAE, and UMVCUE—with the two different statistical tests. The methodology specifically accounted for the transition from a short-term selection endpoint in the first stage to a long-term binary endpoint for the final comparison in the second stage.

Main Results:

Simulation data revealed that the conventional Maximum Likelihood Estimator (MLE) consistently produced a significant upward bias in the estimated treatment effect during the second stage of the trial. The proposed Conditional Mean-Adjusted Estimator (CMAE) and the Uniformly Minimum Variance Conditional Unbiased Estimator (UMVCUE) both achieved a substantial and significant reduction in this selection-induced inaccuracy. The exact test proved to be overly conservative in this context, frequently resulting in type-I error rates that were considerably lower than the specified nominal significance level. In contrast, the mid-p test showed superior performance by yielding type-I error rates that remained consistently close to the desired nominal level across various simulations. The results indicated that the bias in the MLE was a direct and measurable consequence of the treatment selection process occurring after the first stage. The study found that the combination of the mid-p test with either the CMAE or the UMVCUE provided the most reliable and accurate statistical inference for these adaptive designs.

Conclusions:

The authors recommend the implementation of the CMAE or UMVCUE in conjunction with the mid-p test for analyzing two-stage adaptive seamless designs using binary endpoints. These findings highlight the functional necessity of using conditional estimators to ensure the accuracy and integrity of drug efficacy claims in modern clinical research. The work shows that traditional maximum likelihood methods are prone to overestimating therapeutic benefits when treatment selection is involved in the trial design. Adopting these new statistical tools can improve the efficiency of drug development by providing more realistic and unbiased assessments of a drug's true clinical impact. The researchers conclude that the mid-p test offers a better balance between statistical sensitivity and error control than the more conservative exact test. This work establishes a validated and rigorous statistical foundation for trials that utilize different binary endpoints for selection and final efficacy comparison.

According to the study's authors, selecting treatment groups based on first-stage performance leads to an upward bias in the conventional Maximum Likelihood Estimator (MLE). This occurs because the second-stage data is conditional on the selection criteria, which the standard MLE fails to account for.

The researchers found that the exact test was conservative, while the mid-p test yielded results close to the nominal level for type-I error rates. This suggests the mid-p test provides more accurate significance testing when evaluating long-term binary endpoints after a selection phase.

The study used these estimators to address the systematic bias inherent in the Maximum Likelihood Estimator (MLE) during adaptive trials. The CMAE and UMVCUE specifically adjust for the selection process, providing more accurate point estimates for the treatment effect in the second stage.

The findings are confined to two-stage adaptive seamless designs where treatment selection is based on a short-term binary endpoint and comparison uses a long-term binary endpoint. The authors specifically focused on addressing the bias resulting from this selection-to-comparison transition in binary outcomes.

The study's authors propose that researchers should utilize either the CMAE or UMVCUE combined with the mid-p test. This recommendation aims to ensure unbiased treatment effect estimation and appropriate type-I error control in trials integrating Phase II and Phase III.