Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Prediction Intervals01:03

Prediction Intervals

3.1K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
3.1K
Relative Risk01:12

Relative Risk

1.7K
Relative risk (RR) is a statistical measure commonly used in epidemiology to compare the likelihood of a particular event occurring between two groups. This metric is important for evaluating the relationship between exposure to a specific risk factor and the probability of a particular outcome. It plays a crucial role in medical research, public health studies, and risk assessment. Relative risk quantifies how much more (or less) likely an event is to occur in an exposed group compared to an...
1.7K
Survival Tree01:19

Survival Tree

369
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
369
Drug Concentration Versus Time Correlation01:15

Drug Concentration Versus Time Correlation

1.9K
The plasma drug concentration-time curve is a crucial tool in pharmacokinetics, representing the drug's concentration in plasma at different time intervals post-administration. This curve illustrates the drug's journey from absorption into the systemic circulation, distribution to body tissues, and eventual elimination through excretion or biotransformation.
Two pivotal parameters are the minimum effective concentration (MEC) and the minimum toxic concentration (MTC). The MEC is the...
1.9K
Introduction To Survival Analysis01:18

Introduction To Survival Analysis

710
Survival analysis is a statistical method used to study time-to-event data, where the "event" might represent outcomes like death, disease relapse, system failure, or recovery. A unique feature of survival data is censoring, which occurs when the event of interest has not been observed for some individuals during the study period. This requires specialized techniques to handle incomplete data effectively.
The primary goal of survival analysis is to estimate survival time—the time...
710
Kaplan-Meier Approach01:24

Kaplan-Meier Approach

530
The Kaplan-Meier estimator is a non-parametric method used to estimate the survival function from time-to-event data. In medical research, it is frequently employed to measure the proportion of patients surviving for a certain period after treatment. This estimator is fundamental in analyzing time-to-event data, making it indispensable in clinical trials, epidemiological studies, and reliability engineering. By estimating survival probabilities, researchers can evaluate treatment effectiveness,...
530

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Automated calculation of background parenchymal enhancement as a biomarker of treatment responses and recurrence-free survival in breast cancer.

Breast cancer research and treatment·2026
Same author

Review of Prognostic Significance of Quantitative BPE Measurements.

Diagnostics (Basel, Switzerland)·2026
Same author

Amyotrophic Lateral Sclerosis, the Endocannabinoid System, and Exogenous Cannabinoids: Current State and Clinical Implications.

Muscle & nerve·2025
Same author

Substrate Channeling in Compartmentalized Nanoreactors.

Macromolecules·2024
Same author

Current distribution monitoring enables quench and damage detection in superconducting fusion magnets.

Scientific reports·2022
Same author

Biomedical Literature Mining for Repurposing Laboratory Tests.

Methods in molecular biology (Clifton, N.J.)·2022
Same journal

HIV Transmission Dynamics in Greater Mexico City are Shaped by Dense Spatial Mixing.

Research square·2026
Same journal

A UCP1-IRES-Cre Knock-In Mouse Enables Specific Brown Adipocyte Targeting Without CNS Off-Target Expression.

Research square·2026
Same journal

Precision RNAi for Fibrodysplasia Ossificans Progressiva: a combinatorial, unimolecular, allele selective approach.

Research square·2026
Same journal

Perceptions of end-of-life care quality among bereaved closest contacts of community-dwelling older Australians: a cross-sectional survey of the ASPREE cohort.

Research square·2026
Same journal

Heavy-chain immune repertoire sequencing enables language-model prediction of antigen-specific antibodies.

Research square·2026
Same journal

25+ Years of TRPV4: From Discovery to Translational Horizons.

Research square·2026
See all related articles

Related Experiment Video

Updated: Jan 7, 2026

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.2K

MIMIC-IV-Ext-22MCTS: A 22 Million-Event Temporal Clinical Time-Series Dataset for Risk Prediction.

Jing Wang1, Xing Niu2, Juyong Kim3

  • 1National Library of Medicine, Bethesda, MD, USA.

Research Square
|December 31, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces MIMIC-IV-Ext-22MCTS, a large dataset of clinical time series events extracted from medical notes. This dataset enhances machine learning models for improved healthcare applications and clinical risk prediction.

Keywords:
Clinical eventClinical trialContextual BM25Contextual semantic searchLarge language modelMIMICNatural language processingQuestion answeringTemporal informationTime series

More Related Videos

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data
10:46

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

Published on: December 9, 2015

10.9K
Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack
07:31

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

7.5K

Related Experiment Videos

Last Updated: Jan 7, 2026

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.2K
A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data
10:46

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

Published on: December 9, 2015

10.9K
Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack
07:31

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

7.5K

Area of Science:

  • Medical Informatics
  • Machine Learning
  • Natural Language Processing

Background:

  • High-quality time series clinical events are essential for reliable machine learning-based healthcare risk prediction.
  • Existing datasets like MIMIC-IV-Note contain unstructured discharge summaries, posing challenges for event extraction and temporal information retrieval.

Purpose of the Study:

  • To create a comprehensive dataset of clinical time series events (MIMIC-IV-Ext-22MCTS) with extracted temporal information.
  • To develop a novel framework for processing lengthy discharge summaries and inferring timestamps for clinical events.
  • To improve the performance of machine learning models in healthcare applications through fine-tuning on the new dataset.

Main Methods:

  • Developed a framework to break down long discharge summaries into manageable text chunks.
  • Utilized contextual BM25 and semantic search to identify relevant text chunks containing clinical events.
  • Employed carefully designed prompts with the Llama-3.1-8B model to extract or infer event timestamps.
  • Extracted 22,588,586 clinical events and their associated temporal information from MIMIC-IV-Note.

Main Results:

  • The MIMIC-IV-Ext-22MCTS dataset provides informative and transparent clinical time series data.
  • Fine-tuning BERT on this dataset resulted in a 10% accuracy improvement in medical question answering and a 3% improvement in clinical trial matching.
  • GPT-2 models fine-tuned on the dataset demonstrated more clinically reliable results for clinical questions.

Conclusions:

  • The proposed framework effectively extracts clinical events and temporal information from unstructured discharge summaries.
  • The MIMIC-IV-Ext-22MCTS dataset significantly enhances the performance of machine learning models in various healthcare tasks.
  • This work facilitates the development of more accurate and reliable clinical risk prediction tools.