Injecting utility into anonymized datasets

Daniel Kifer, Johannes Gehrke

Research output: Chapter in Book/Report/Conference proceedingConference contribution

183 Citations (Scopus)

Abstract

Limiting disclosure in data publishing requires a careful balance between privacy and utility. Information about individuals must not be revealed, but a dataset should still be useful for studying the characteristics of a population. Privacy requirements such as k-anonymity and l-diversity are designed to thwart attacks that attempt to identify individuals in the data and to discover their sensitive information. On the other hand, the utility of such data has not been well-studied.In this paper we will discuss the shortcomings of current heuristic approaches to measuring utility and we will introduce a formal approach to measuring utility. Armed with this utility metric, we will show how to inject additional information into k-anonymous and l-diverse tables. This information has an intuitive semantic meaning, it increases the utility beyond what is possible in the original k-anonymity and l-diversity frameworks, and it maintains the privacy guarantees of k-anonymity and l-diversity.

Original languageEnglish (US)
Title of host publicationSIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data
Pages217-228
Number of pages12
DOIs
StatePublished - Dec 1 2006
Event2006 ACM SIGMOD International Conference on Management of Data - Chicago, IL, United States
Duration: Jun 27 2006Jun 29 2006

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Other

Other2006 ACM SIGMOD International Conference on Management of Data
CountryUnited States
CityChicago, IL
Period6/27/066/29/06

Fingerprint

Semantics

All Science Journal Classification (ASJC) codes

  • Software
  • Information Systems

Cite this

Kifer, D., & Gehrke, J. (2006). Injecting utility into anonymized datasets. In SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 217-228). (Proceedings of the ACM SIGMOD International Conference on Management of Data). https://doi.org/10.1145/1142473.1142499
Kifer, Daniel ; Gehrke, Johannes. / Injecting utility into anonymized datasets. SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data. 2006. pp. 217-228 (Proceedings of the ACM SIGMOD International Conference on Management of Data).
@inproceedings{a702eb9e6b034c7ca565b4d31155f0f3,
title = "Injecting utility into anonymized datasets",
abstract = "Limiting disclosure in data publishing requires a careful balance between privacy and utility. Information about individuals must not be revealed, but a dataset should still be useful for studying the characteristics of a population. Privacy requirements such as k-anonymity and l-diversity are designed to thwart attacks that attempt to identify individuals in the data and to discover their sensitive information. On the other hand, the utility of such data has not been well-studied.In this paper we will discuss the shortcomings of current heuristic approaches to measuring utility and we will introduce a formal approach to measuring utility. Armed with this utility metric, we will show how to inject additional information into k-anonymous and l-diverse tables. This information has an intuitive semantic meaning, it increases the utility beyond what is possible in the original k-anonymity and l-diversity frameworks, and it maintains the privacy guarantees of k-anonymity and l-diversity.",
author = "Daniel Kifer and Johannes Gehrke",
year = "2006",
month = "12",
day = "1",
doi = "10.1145/1142473.1142499",
language = "English (US)",
isbn = "1595934340",
series = "Proceedings of the ACM SIGMOD International Conference on Management of Data",
pages = "217--228",
booktitle = "SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data",

}

Kifer, D & Gehrke, J 2006, Injecting utility into anonymized datasets. in SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data. Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 217-228, 2006 ACM SIGMOD International Conference on Management of Data, Chicago, IL, United States, 6/27/06. https://doi.org/10.1145/1142473.1142499

Injecting utility into anonymized datasets. / Kifer, Daniel; Gehrke, Johannes.

SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data. 2006. p. 217-228 (Proceedings of the ACM SIGMOD International Conference on Management of Data).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Injecting utility into anonymized datasets

AU - Kifer, Daniel

AU - Gehrke, Johannes

PY - 2006/12/1

Y1 - 2006/12/1

N2 - Limiting disclosure in data publishing requires a careful balance between privacy and utility. Information about individuals must not be revealed, but a dataset should still be useful for studying the characteristics of a population. Privacy requirements such as k-anonymity and l-diversity are designed to thwart attacks that attempt to identify individuals in the data and to discover their sensitive information. On the other hand, the utility of such data has not been well-studied.In this paper we will discuss the shortcomings of current heuristic approaches to measuring utility and we will introduce a formal approach to measuring utility. Armed with this utility metric, we will show how to inject additional information into k-anonymous and l-diverse tables. This information has an intuitive semantic meaning, it increases the utility beyond what is possible in the original k-anonymity and l-diversity frameworks, and it maintains the privacy guarantees of k-anonymity and l-diversity.

AB - Limiting disclosure in data publishing requires a careful balance between privacy and utility. Information about individuals must not be revealed, but a dataset should still be useful for studying the characteristics of a population. Privacy requirements such as k-anonymity and l-diversity are designed to thwart attacks that attempt to identify individuals in the data and to discover their sensitive information. On the other hand, the utility of such data has not been well-studied.In this paper we will discuss the shortcomings of current heuristic approaches to measuring utility and we will introduce a formal approach to measuring utility. Armed with this utility metric, we will show how to inject additional information into k-anonymous and l-diverse tables. This information has an intuitive semantic meaning, it increases the utility beyond what is possible in the original k-anonymity and l-diversity frameworks, and it maintains the privacy guarantees of k-anonymity and l-diversity.

UR - http://www.scopus.com/inward/record.url?scp=34250673244&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34250673244&partnerID=8YFLogxK

U2 - 10.1145/1142473.1142499

DO - 10.1145/1142473.1142499

M3 - Conference contribution

AN - SCOPUS:34250673244

SN - 1595934340

SN - 9781595934345

T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data

SP - 217

EP - 228

BT - SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data

ER -

Kifer D, Gehrke J. Injecting utility into anonymized datasets. In SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data. 2006. p. 217-228. (Proceedings of the ACM SIGMOD International Conference on Management of Data). https://doi.org/10.1145/1142473.1142499