Federated Analysis With Differential Privacy in Oncology Research: Longitudinal Observational Study Across Hospital Data Warehouses
View abstract on PubMed
Summary
This summary is machine-generated.Federated analytics combined with differential privacy enabled a real-world oncology study across multiple centers. This approach ensures data privacy while facilitating collaborative research on patient data.
Area Of Science
- Health Informatics
- Medical Oncology
- Data Privacy
Background
- Federated analytics allows statistical queries on remote health datasets without accessing raw data, addressing privacy and governance concerns.
- This approach is crucial for multi-center studies but remains underutilized in real-world settings.
- Combining federated analytics with differential privacy (DP) has not been previously explored in real-world studies.
Purpose Of The Study
- To deploy a federated architecture in a real-world clinical setting for a multicenter oncology study.
- To evaluate the practicality and utility of differential privacy (DP) as a privacy-enhancing technology within this federated framework.
Main Methods
- Established a federated architecture across three healthcare centers (Toulouse, Reims, Foch).
- Harmonized data within each center and performed statistical analyses using DataSHIELD.
- Implemented a novel open-source differential privacy package (dsPrivacy) for DataSHIELD.
Main Results
- Successfully conducted a multicenter study on metastatic non-small cell lung cancer patient data without centralizing raw information.
- Demonstrated the practicality of DataSHIELD for efficient, secure, multi-center data analysis.
- Showcased the feasibility of implementing differential privacy with acceptable privacy-utility trade-offs.
Conclusions
- The federated architecture successfully supported a real-world, multicenter oncology study with robust privacy protections.
- Differential privacy integration proved practical, offering a viable solution for enhancing data security in federated health research.
Related Concept Videos
Cancer survival analysis focuses on quantifying and interpreting the time from a key starting point, such as diagnosis or the initiation of treatment, to a specific endpoint, such as remission or death. This analysis provides critical insights into treatment effectiveness and factors that influence patient outcomes, helping to shape clinical decisions and guide prognostic evaluations. A cornerstone of oncology research, survival analysis tackles the challenges of skewed, non-normally...
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
Descriptive Statistics: These provide basic...
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
Noncompartmental analyses offer an alternative method for describing drug pharmacokinetics without relying on a specific compartmental model. In this approach, the drug's pharmacokinetics are assumed to be linear, with the terminal phase log-linear. This assumption allows for simplified analysis and interpretation of the drug's behavior in the body.
One important characteristic of noncompartmental analyses is that drug exposure increases proportionally with increasing doses. This...
Analysis of population pharmacokinetic data involves studying the behavior of drugs within diverse populations to understand their pharmacokinetic parameters. Traditional pharmacokinetic methods typically involve collecting samples from a few individuals and estimating these parameters. While these methods are commonly used, they have limitations in capturing the variability in drug response among individuals or heterogeneous populations. Population pharmacokinetics is employed to address these...
Sometimes we want to see how people change over time, as in studies of human development and lifespan. When we test the same group of individuals repeatedly over an extended period of time, we are conducting longitudinal research. Longitudinal research is a research design in which data-gathering is administered repeatedly over an extended period of time. For example, we may survey a group of individuals about their dietary habits at age 20, retest them a decade later at age 30, and then again...

