File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1145/1538909.1538911
- Scopus: eid_2-s2.0-68649129111
- WOS: WOS:000268472600002
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: A framework for efficient data anonymization under privacy and accuracy constraints
Title | A framework for efficient data anonymization under privacy and accuracy constraints |
---|---|
Authors | |
Keywords | Anonymity Privacy |
Issue Date | 2009 |
Publisher | Association for Computing Machinery, Inc |
Citation | ACM Transactions On Database Systems, 2009, v. 34 n. 2, article no. 9 How to Cite? |
Abstract | Recent research studied the problem of publishing microdata without revealing sensitive information, leading to the privacy-preserving paradigms of k-anonymity and l-diversity. k-anonymity protects against the identification of an individual's record. l-diversity, in addition, safeguards against the association of an individual with specific sensitive information. However, existing approaches suffer from at least one of the following drawbacks: (i) l-diversification is solved by techniques developed for the simpler k-anonymization problem, causing unnecessary information loss. (ii) The anonymization process is inefficient in terms of computational and I/O cost. (iii) Previous research focused exclusively on the privacy-constrained problem and ignored the equally important accuracy-constrained (or dual) anonymization problem. In this article, we propose a framework for efficient anonymization of microdata that addresses these deficiencies. First, we focus on one-dimensional (i.e., single-attribute) quasi-identifiers, and study the properties of optimal solutions under the k-anonymity and l-diversity models for the privacy-constrained (i.e., direct) and the accuracy-constrained (i.e., dual) anonymization problems. Guided by these properties, we develop efficient heuristics to solve the one-dimensional problems in linear time. Finally, we generalize our solutions to multidimensional quasi-identifiers using space-mapping techniques. Extensive experimental evaluation shows that our techniques clearly outperform the existing approaches in terms of execution time and information loss. © 2009 ACM. |
Persistent Identifier | http://hdl.handle.net/10722/60618 |
ISSN | 2023 Impact Factor: 2.2 2023 SCImago Journal Rankings: 1.730 |
ISI Accession Number ID | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ghinita, G | en_HK |
dc.contributor.author | Karras, P | en_HK |
dc.contributor.author | Kalnis, P | en_HK |
dc.contributor.author | Mamoulis, N | en_HK |
dc.date.accessioned | 2010-05-31T04:15:05Z | - |
dc.date.available | 2010-05-31T04:15:05Z | - |
dc.date.issued | 2009 | en_HK |
dc.identifier.citation | ACM Transactions On Database Systems, 2009, v. 34 n. 2, article no. 9 | en_HK |
dc.identifier.issn | 0362-5915 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/60618 | - |
dc.description.abstract | Recent research studied the problem of publishing microdata without revealing sensitive information, leading to the privacy-preserving paradigms of k-anonymity and l-diversity. k-anonymity protects against the identification of an individual's record. l-diversity, in addition, safeguards against the association of an individual with specific sensitive information. However, existing approaches suffer from at least one of the following drawbacks: (i) l-diversification is solved by techniques developed for the simpler k-anonymization problem, causing unnecessary information loss. (ii) The anonymization process is inefficient in terms of computational and I/O cost. (iii) Previous research focused exclusively on the privacy-constrained problem and ignored the equally important accuracy-constrained (or dual) anonymization problem. In this article, we propose a framework for efficient anonymization of microdata that addresses these deficiencies. First, we focus on one-dimensional (i.e., single-attribute) quasi-identifiers, and study the properties of optimal solutions under the k-anonymity and l-diversity models for the privacy-constrained (i.e., direct) and the accuracy-constrained (i.e., dual) anonymization problems. Guided by these properties, we develop efficient heuristics to solve the one-dimensional problems in linear time. Finally, we generalize our solutions to multidimensional quasi-identifiers using space-mapping techniques. Extensive experimental evaluation shows that our techniques clearly outperform the existing approaches in terms of execution time and information loss. © 2009 ACM. | en_HK |
dc.language | eng | en_HK |
dc.publisher | Association for Computing Machinery, Inc | en_HK |
dc.relation.ispartof | ACM Transactions on Database Systems | en_HK |
dc.rights | ACM Transactions on Database Systems. Copyright © Association for Computing Machinery, Inc. | en_HK |
dc.subject | Anonymity | en_HK |
dc.subject | Privacy | en_HK |
dc.title | A framework for efficient data anonymization under privacy and accuracy constraints | en_HK |
dc.type | Article | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=0362-5915&volume=34&issue=2&spage=47&epage=&date=2009&atitle=A+Framework+for+Efficient+Data+Anonymization+under+Privacy+and+Accuracy+Constraints | en_HK |
dc.identifier.email | Mamoulis, N:nikos@cs.hku.hk | en_HK |
dc.identifier.authority | Mamoulis, N=rp00155 | en_HK |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1145/1538909.1538911 | en_HK |
dc.identifier.scopus | eid_2-s2.0-68649129111 | en_HK |
dc.identifier.hkuros | 166338 | en_HK |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-68649129111&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.volume | 34 | en_HK |
dc.identifier.issue | 2 | en_HK |
dc.identifier.spage | article no. 9 | - |
dc.identifier.epage | article no. 9 | - |
dc.identifier.isi | WOS:000268472600002 | - |
dc.publisher.place | United States | en_HK |
dc.identifier.scopusauthorid | Ghinita, G=22834343300 | en_HK |
dc.identifier.scopusauthorid | Karras, P=14028488200 | en_HK |
dc.identifier.scopusauthorid | Kalnis, P=6603477534 | en_HK |
dc.identifier.scopusauthorid | Mamoulis, N=6701782749 | en_HK |
dc.identifier.issnl | 0362-5915 | - |