Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Issues And Trends In Healthcare Delivery System01:29

Issues And Trends In Healthcare Delivery System

5.6K
The issues and trends in healthcare delivery are constantly changing. The COVID-19 pandemic is one recent issue that wreaked havoc on healthcare systems, causing a shortage of healthcare workers, high demand for medicines and supplies, and increased medical expenditure due to a lack of insurance. Other issues include rising healthcare costs and care fragmentation.
Cost Containment
Payment for healthcare services has historically promoted adoption of costly and often unnecessary or inefficient...
5.6K
Health Information Technology and Healthcare Information System01:30

Health Information Technology and Healthcare Information System

798
Health Information Technology (HIT)
Health Information Technology, commonly called HIT, integrates advanced information systems and technology in healthcare settings. Its primary functions include:
798
Overview of Biostatistics in Health Sciences01:19

Overview of Biostatistics in Health Sciences

311
Biostatistics involves the application of statistical techniques to scientific research in health-related fields, including biology and public health. These techniques are essential for designing studies, collecting data, and analyzing it to draw meaningful conclusions. Given the complexity of biological processes, particularly in studies involving human subjects, biostatistical methods are crucial for effectively organizing and interpreting data that might otherwise obscure underlying patterns...
311
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

294
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
294
Statistical Software for Data Analysis and Clinical Trials01:12

Statistical Software for Data Analysis and Clinical Trials

485
Statistical software is pivotal in data analysis and clinical trials by providing tools to analyze data, draw conclusions, and make predictions. These software packages range from simple data management applications to complex analytical platforms, supporting various statistical tests, models, and simulation techniques. Their significance lies in their ability to handle vast amounts of data with precision and efficiency, enabling researchers to validate hypotheses, identify trends, and make...
485
Synthetic Biology02:55

Synthetic Biology

4.7K
Synthetic biology is an interdisciplinary science that involves using principles from disciplines such as engineering, molecular biology, cell biology, and systems biology. It involves remodeling existing organisms from nature or constructing completely new synthetic organisms for applications such as protein or enzyme production, bioremediation, value-added macromolecule production, and the addition of desirable traits to crops, to name a few.
Golden rice
Golden rice is a genetically modified...
4.7K
  1. Home
  2. Research Domains
  3. Information And Computing Sciences
  4. Data Management And Data Science
  5. Query Processing And Optimisation
  6. Preserving Information While Respecting Privacy Through An Information Theoretic Framework For Synthetic Health Data Generation

Preserving information while respecting privacy through an information theoretic framework for synthetic health data generation

Nadir Sella1, Florent Guinot2, Nikita Lagrange3

  • 1Institut Roche, Boulogne-Billancourt, France. nadirsella@gmail.com.

NPJ Digital Medicine
|January 22, 2025

Related Experiment Videos

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans
09:23

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans

Published on: August 16, 2017

8.0K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.4K
Methodology for Establishing a Community-Wide Life Laboratory for Capturing Unobtrusive and Continuous Remote Activity and Health Data
11:21

Methodology for Establishing a Community-Wide Life Laboratory for Capturing Unobtrusive and Continuous Remote Activity and Health Data

Published on: July 27, 2018

8.2K

View abstract on PubMed

Summary
This summary is machine-generated.

This study introduces MIIC-SDG, a novel algorithm for generating high-quality, privacy-preserving synthetic medical data using a multivariate information framework. It also presents a new metric to evaluate the trade-off between data quality and privacy in synthetic data generation.

Area of Science:

  • Medical Informatics
  • Data Science
  • Biostatistics

Background:

  • Generating synthetic medical data is challenging due to privacy concerns.
  • Existing methods often fail to jointly evaluate data quality and privacy.
  • Assessing multivariate associations is crucial for accurate synthetic data.

Purpose of the Study:

  • Introduce a novel algorithm (MIIC-SDG) for synthetic data generation from electronic health records.
  • Propose a new metric (QPS) to quantify the quality-privacy trade-off.
  • Demonstrate the performance of MIIC-SDG against state-of-the-art methods.

Main Methods:

  • Developed MIIC-SDG based on multivariate information theory and Bayesian networks.
  • Proposed a Quality and Privacy Score (QPS) metric for evaluation.

Related Experiment Videos

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans
09:23

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans

Published on: August 16, 2017

8.0K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.4K
Methodology for Establishing a Community-Wide Life Laboratory for Capturing Unobtrusive and Continuous Remote Activity and Health Data
11:21

Methodology for Establishing a Community-Wide Life Laboratory for Capturing Unobtrusive and Continuous Remote Activity and Health Data

Published on: July 27, 2018

8.2K
  • Validated MIIC-SDG on diverse clinical datasets.
  • Main Results:

    • MIIC-SDG effectively generates synthetic data preserving multivariate associations.
    • The QPS metric provides a quantitative measure of the quality-privacy balance.
    • MIIC-SDG demonstrates superior performance compared to existing methods.

    Conclusions:

    • MIIC-SDG offers a robust approach for generating high-fidelity, privacy-preserving synthetic medical data.
    • The QPS metric facilitates objective comparison of synthetic data generation techniques.
    • This work advances the field of secure and reliable health data synthesis.