Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Software for Data Analysis and Clinical Trials01:12

Statistical Software for Data Analysis and Clinical Trials

Statistical software is pivotal in data analysis and clinical trials by providing tools to analyze data, draw conclusions, and make predictions. These software packages range from simple data management applications to complex analytical platforms, supporting various statistical tests, models, and simulation techniques. Their significance lies in their ability to handle vast amounts of data with precision and efficiency, enabling researchers to validate hypotheses, identify trends, and make...
Data Collection I01:30

Data Collection I

Data collection gathers information needed to make accurate judgments about a patient's present condition. During a health history interview, subjective data is collected from the patient, their caregivers, or family members, and objective data is collected through observations and physical assessment. Patients are the primary source of subjective data. Thus information gathered from patients through interviews, observations, and physical examination is primary data. Secondary sources of data...
Clinical Trials01:16

Clinical Trials

Clinical trials are prospective experimental studies conducted on humans to determine the safety and efficacy of treatments, drugs, diet methods, and medical devices. Using statistics in clinical trials enables researchers to derive reasonable and accurate conclusions from the collected data, allowing them to make wise decisions in uncertain situations. In medical research, statistical methods are crucial for preventing errors and bias.
There are four phases in a clinical trial. A phase one...
Types of Biopharmaceutical Studies: Controlled and Non-Controlled Approaches01:23

Types of Biopharmaceutical Studies: Controlled and Non-Controlled Approaches

Biopharmaceutical studies constitute a vital field aiming to enhance drug delivery methods and refine therapeutic approaches, drawing upon diverse interdisciplinary knowledge. In research methodologies, the choice between controlled and non-controlled studies significantly influences the study's reliability and accuracy.
Non-controlled studies, commonly employed for initial exploration, lack a control group, rendering them susceptible to biases and external influences. In contrast, controlled...
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
Data Collection II01:29

Data Collection II

The nursing history captures and records the patient's health status, so that a care plan evolves to meet the patient's individual needs. The nursing health history is a part of the initial assessment. A comprehensive history covers all health dimensions and plays a significant role in the assessment process. A comprehensive history includes the patient's biographical information, reasons for seeking health care, expectations, present and past health history, medications, and family,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Measuring the Quality of Datasets: Development of the IDEFIM Indicator Set for Empirical Health Research.

Journal of medical Internet research·2026
Same author

A Data-Driven Visit Windowing Approach Applied to Cochlear Implant Follow-Up Data.

Studies in health technology and informatics·2026
Same author

Visualization of Clinical Case Similarities in Pediatric Cardiology Using Structured Case Conference Reports.

Studies in health technology and informatics·2026
Same author

Factors Influencing Hearing Preservation in Cochlear Implant Patients: A Predictive Modelling Approach.

Studies in health technology and informatics·2025
Same author

On the Harmonisation of Time Series Data for the Optimisation of Machine Learning Using the Example of Rejection Prediction After Kidney Transplantation.

Studies in health technology and informatics·2025
Same author

Visualizing Study Profiles Using the GMDS Competency Catalog for Medical Informatics Programs.

Studies in health technology and informatics·2024
Same journal

The Essential Components and Critical Conditions for Success in a Learning Health System in Oncology.

Studies in health technology and informatics·2026
Same journal

Use of Artificial Intelligence in Screening for Adolescent Idiopathic Scoliosis: A Scoping Review.

Studies in health technology and informatics·2026
Same journal

Movement Related Biomechanics in Adolescent Idiopathic Scoliosis: A Review of Reviews.

Studies in health technology and informatics·2026
Same journal

The Impact of Surgical Correction of Adolescent Idiopathic Scoliosis Using Posterior Spinal Fusion on Selected Radiological Parameters and Respiratory Function.

Studies in health technology and informatics·2026
Same journal

Acute Effect of Physio-logic® Exercises on Muscle Tone and Stiffness in Adolescent Idiopathic Scoliosis Patients: A Preliminary Study.

Studies in health technology and informatics·2026
Same journal

Effects of Integrated Music and Occupational Therapy on Motor and Autonomic Function in Children with Neurogenic Scoliosis.

Studies in health technology and informatics·2026
See all related articles

Related Experiment Video

Updated: May 24, 2026

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

Bridging Routine Data and Clinical Research: A Structured Approach to Data Preprocessing.

Matthias Katzensteiner1, Darian Liehr1, Oliver J Bott1

  • 1Data|H Institute for Applied Data Science, Faculty III - Media, Information and Design, University of Applied Sciences and Arts Hanover, Germany.

Studies in Health Technology and Informatics
|May 23, 2026
PubMed
Summary
This summary is machine-generated.

Routine clinical data is often messy, hindering research. Our new framework structures this data, making it usable for statistical analysis and machine learning in clinical research.

Keywords:
Clinical routine dataData preprocessingMachine learningReal-world evidenceTime series harmonisation

More Related Videos

TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients
09:00

TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients

Published on: April 13, 2021

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Related Experiment Videos

Last Updated: May 24, 2026

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients
09:00

TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients

Published on: April 13, 2021

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Area of Science:

  • Health Informatics
  • Clinical Data Management
  • Biomedical Data Science

Background:

  • Routine clinical data presents challenges for research due to its heterogeneous and irregular nature.
  • Secondary data use is crucial for advancing clinical research and generating real-world evidence.
  • Existing methods often struggle to harmonize diverse clinical datasets for analytical purposes.

Purpose of the Study:

  • To present a structured three-step preprocessing framework for transforming routine healthcare data into research-compatible datasets.
  • To address the challenges of data heterogeneity and irregularity in secondary data use.
  • To enable the generation of high-quality datasets for statistical analysis and machine learning in clinical research.

Main Methods:

  • A three-step preprocessing framework involving recursive event classification, context-sensitive annotation, and time series harmonization.
  • Construction of indication-specific timelines and cohort-wide comparable data.
  • Application of the framework to a real-world use case in kidney transplant research.

Main Results:

  • Successfully transformed heterogeneous routine healthcare data into structured, research-compatible datasets.
  • Generated indication-specific timelines and cohort-wide comparable data for kidney transplant research.
  • Demonstrated the suitability of the generated datasets for statistical analysis and machine learning.

Conclusions:

  • The proposed framework effectively enhances the usability of routine clinical data for research purposes.
  • This methodology provides a transferable foundation for improving real-world evidence generation.
  • The structured datasets facilitate advanced analytical approaches, including machine learning, in clinical research.