Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Feb 22, 2026

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates
08:56

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

2.9K

Scalable Iterative Classification for Sanitizing Large-Scale Datasets.

Bo Li1, Yevgeniy Vorobeychik1, Muqun Li1

  • 1Vanderbilt University.

IEEE Transactions on Knowledge and Data Engineering
|September 26, 2017
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

Aggregates Classification01:29

Aggregates Classification

1.1K
Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
1.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Identification of the gene cluster for the dithiolopyrrolone antibiotic holomycin in Streptomyces clavuligerus.

Proceedings of the National Academy of Sciences of the United States of America·2010
Same author

Safety evaluation of tea (Camellia sinensis (L.) O. Kuntze) flower extract: assessment of mutagenicity, and acute and subchronic toxicity in rats.

Journal of ethnopharmacology·2010
Same author

Influences of soil properties and leaching on nickel toxicity to barley root elongation.

Ecotoxicology and environmental safety·2010
Same author

Effects of CO2 insufflation on cerebrum during endoscopic thyroidectomy in a porcine model.

Surgical endoscopy·2010
Same author

Plants' use of different nitrogen forms in response to crude oil contamination.

Environmental pollution (Barking, Essex : 1987)·2010
Same author

An unusual ten-connected self-penetrating metal-organic framework based on tetranuclear cobalt clusters.

Chemical communications (Cambridge, England)·2010
Same journal

STORM: Exploiting Spatiotemporal Continuity for Trajectory Similarity Learning in Road Networks.

IEEE transactions on knowledge and data engineering·2026
Same journal

Hierarchical Active Learning with Label Proportions on Data Regions.

IEEE transactions on knowledge and data engineering·2025
Same journal

Data Synthesis Reinvented: Preserving Missing Patterns for Enhanced Analysis.

IEEE transactions on knowledge and data engineering·2025
Same journal

Cafe: Improved Federated Data Imputation by Leveraging Missing Data Heterogeneity.

IEEE transactions on knowledge and data engineering·2025
Same journal

A Neural Database for Answering Aggregate Queries on Incomplete Relational Data.

IEEE transactions on knowledge and data engineering·2024
Same journal

Weakly Supervised Concept Map Generation through Task-Guided Graph Translation.

IEEE transactions on knowledge and data engineering·2024
See all related articles

This study introduces a novel game model for data sanitization, balancing data utility and privacy risks. A fast algorithm effectively removes sensitive information, preserving over 93% of original data with minimal risk of identifier leakage.

Area of Science:

  • Computer Science
  • Data Privacy
  • Machine Learning

Background:

  • Ubiquitous computing generates vast personal data, necessitating data sharing while protecting privacy.
  • Current machine learning methods for de-identifying data are imperfect, risking sensitive information leakage.
  • Balancing data utility with the risk of adversary-driven identifier discovery is crucial.

Purpose of the Study:

  • To model data sanitization as a game between a data publisher and an adversary.
  • To develop an efficient algorithm for publishers to minimize the risk of leaked identifiers.
  • To quantify the trade-off between data utility and privacy protection.

Main Methods:

  • A game-theoretic model was developed for data sanitization.
  • A fast iterative greedy algorithm was introduced for the publisher's strategy.
Keywords:
Privacy preservinggame theoryweak structured data sanitization

Related Experiment Videos

Last Updated: Feb 22, 2026

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates
08:56

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

2.9K
  • The algorithm's effectiveness was evaluated on five text datasets against state-of-the-art learning algorithms.
  • Main Results:

    • The proposed algorithm ensures low utility for resource-limited adversaries.
    • Over 93% of original data was shared across five text datasets.
    • Virtually no automatically identifiable sensitive instances remained for advanced learning algorithms.

    Conclusions:

    • The developed algorithm effectively balances data utility and privacy.
    • The iterative greedy approach provides a practical solution for data sanitization.
    • This method significantly reduces the risk of identifier leakage in shared datasets.