Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

27
Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...
27
Multiple Regression01:25

Multiple Regression

2.9K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
2.9K
Biostatistics: Overview01:20

Biostatistics: Overview

220
Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...
220
Regression Analysis01:11

Regression Analysis

5.6K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.6K
Statistical Software for Data Analysis and Clinical Trials01:12

Statistical Software for Data Analysis and Clinical Trials

493
Statistical software is pivotal in data analysis and clinical trials by providing tools to analyze data, draw conclusions, and make predictions. These software packages range from simple data management applications to complex analytical platforms, supporting various statistical tests, models, and simulation techniques. Their significance lies in their ability to handle vast amounts of data with precision and efficiency, enabling researchers to validate hypotheses, identify trends, and make...
493
Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

97
Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
97

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Preferences for HIV testing among partners of transgender women in Lima, Peru: a discrete choice experiment.

Journal of acquired immune deficiency syndromes (1999)·2026
Same author

Assessing VirScan serosurvey epitope profiling variability between in-clinic venous blood draw and capillary blood self-sampling device.

Microbiology spectrum·2026
Same author

Influence of B cell-lineage targeted CAR-T cell therapy on humoral immunity and vaccine-induced antibody response.

Nature communications·2026
Same author

Cytomegalovirus DNAemia in Hospitalized Adults With SARS-CoV-2 Infection Requiring Supplemental Oxygen: Virologic and Clinical Characteristics and Association With Outcomes.

The Journal of infectious diseases·2026
Same author

Sexual behaviors and access to HIV services during the COVID-19 pandemic among cisgender men who have sex with men in Lima, Peru.

BMC public health·2025
Same author

CD4-mimetics sensitize HIV-infected cells to ADCC mediated by plasma from persons with early-stage HIV-1 infection.

Journal of virology·2025
Same journal

Practical Considerations for Variable Screening in the Super Learner.

The New England Journal of Statistics in Data Science·2025
Same journal

Meta-analysis of Censored Adverse Events.

The New England Journal of Statistics in Data Science·2025
Same journal

Effect of model space priors on statistical inference with model uncertainty.

The New England Journal of Statistics in Data Science·2024
Same journal

Effects of stopping criterion on the growth of trees in regression random forests.

The New England Journal of Statistics in Data Science·2023
Same journal

Modeling Multivariate Spatial Dependencies Using Graphical Models.

The New England Journal of Statistics in Data Science·2023
See all related articles

Related Experiment Video

Updated: Jun 6, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K

Biomarker Panel Development Using Logic Regression in the Presence of Missing Data.

Ying Huang1, Sayan Dasgupta1

  • 1Vaccine & Infectious Disease Division, Fred Hutchinson Cancer Center, US.

The New England Journal of Statistics in Data Science
|December 3, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces novel logic regression methods for early cancer detection using biomarker panels, effectively handling missing data. The approach outperforms existing methods in classifying pancreatic cysts and predicting malignancy.

Keywords:
62P10BiomarkerLogic regressionMissing data

More Related Videos

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.6K
Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

954

Related Experiment Videos

Last Updated: Jun 6, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K
Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.6K
Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

954

Area of Science:

  • Biostatistics
  • Bioinformatics
  • Computational Biology

Background:

  • Developing accurate biomarker panels for early cancer detection is crucial.
  • Missing data in biomarker studies presents significant analytical challenges.
  • Pancreatic cysts require reliable methods for subtype classification and malignancy prediction.

Purpose of the Study:

  • To develop flexible and parsimonious biomarker combinations for early cancer detection.
  • To address variable missingness at random using multiple imputation.
  • To construct interpretable logic rules for biomarker panel selection.

Main Methods:

  • Logic regression for feature selection and rule construction.
  • Multiple imputation framework to handle missing data.
  • Ensemble and single decision trees for classification.

Main Results:

  • Proposed methods demonstrate superior performance over complete-case and single imputation.
  • Effective identification of biomarker panels for pancreatic cyst classification.
  • Successful prediction of malignant potential in pancreatic cysts.

Conclusions:

  • Logic regression with multiple imputation offers a robust approach for biomarker panel development.
  • The methods provide interpretable decision trees for clinical application.
  • This strategy enhances early cancer detection and risk stratification.