Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Video

Updated: Feb 22, 2026

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

Scalable Iterative Classification for Sanitizing Large-Scale Datasets.

Bo Li¹, Yevgeniy Vorobeychik¹, Muqun Li¹

¹Vanderbilt University.

IEEE Transactions on Knowledge and Data Engineering

|September 26, 2017

Summary

This summary is machine-generated.

Related Concept Videos

Aggregates Classification

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Identification of the gene cluster for the dithiolopyrrolone antibiotic holomycin in Streptomyces clavuligerus.

Proceedings of the National Academy of Sciences of the United States of America·2010

Same author

Safety evaluation of tea (Camellia sinensis (L.) O. Kuntze) flower extract: assessment of mutagenicity, and acute and subchronic toxicity in rats.

Journal of ethnopharmacology·2010

Same author

Influences of soil properties and leaching on nickel toxicity to barley root elongation.

Ecotoxicology and environmental safety·2010

Same author

Effects of CO2 insufflation on cerebrum during endoscopic thyroidectomy in a porcine model.

Surgical endoscopy·2010

Same author

Plants' use of different nitrogen forms in response to crude oil contamination.

Environmental pollution (Barking, Essex : 1987)·2010

Same author

An unusual ten-connected self-penetrating metal-organic framework based on tetranuclear cobalt clusters.

Chemical communications (Cambridge, England)·2010

Same journal

STORM: Exploiting Spatiotemporal Continuity for Trajectory Similarity Learning in Road Networks.

IEEE transactions on knowledge and data engineering·2026

Same journal

Hierarchical Active Learning with Label Proportions on Data Regions.

IEEE transactions on knowledge and data engineering·2025

Same journal

Data Synthesis Reinvented: Preserving Missing Patterns for Enhanced Analysis.

IEEE transactions on knowledge and data engineering·2025

Same journal

Cafe: Improved Federated Data Imputation by Leveraging Missing Data Heterogeneity.

IEEE transactions on knowledge and data engineering·2025

Same journal

A Neural Database for Answering Aggregate Queries on Incomplete Relational Data.

IEEE transactions on knowledge and data engineering·2024

Same journal

Weakly Supervised Concept Map Generation through Task-Guided Graph Translation.

IEEE transactions on knowledge and data engineering·2024

See all related articles

This study introduces a novel game model for data sanitization, balancing data utility and privacy risks. A fast algorithm effectively removes sensitive information, preserving over 93% of original data with minimal risk of identifier leakage.

Area of Science:

Computer Science
Data Privacy
Machine Learning

Background:

Ubiquitous computing generates vast personal data, necessitating data sharing while protecting privacy.
Current machine learning methods for de-identifying data are imperfect, risking sensitive information leakage.
Balancing data utility with the risk of adversary-driven identifier discovery is crucial.

Purpose of the Study:

To model data sanitization as a game between a data publisher and an adversary.
To develop an efficient algorithm for publishers to minimize the risk of leaked identifiers.
To quantify the trade-off between data utility and privacy protection.

Main Methods:

A game-theoretic model was developed for data sanitization.
A fast iterative greedy algorithm was introduced for the publisher's strategy.

Keywords:

Privacy preserving game theory weak structured data sanitization

Related Experiment Videos

Last Updated: Feb 22, 2026

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

The algorithm's effectiveness was evaluated on five text datasets against state-of-the-art learning algorithms.

Main Results:

The proposed algorithm ensures low utility for resource-limited adversaries.
Over 93% of original data was shared across five text datasets.
Virtually no automatically identifiable sensitive instances remained for advanced learning algorithms.

Conclusions:

The developed algorithm effectively balances data utility and privacy.
The iterative greedy approach provides a practical solution for data sanitization.
This method significantly reduces the risk of identifier leakage in shared datasets.