File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1145/1807167.1807248
- Scopus: eid_2-s2.0-77954730854
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Non-homogeneous generalization in privacy preserving data publishing
Title | Non-homogeneous generalization in privacy preserving data publishing |
---|---|
Authors | |
Keywords | anonymization non-homogeneous generalization privacy |
Issue Date | 2010 |
Publisher | Association for Computing Machinery, Inc. The Journal's web site is located at http://www.acm.org/sigmod |
Citation | The 2010 International Conference on Management of Data (SIGMOD '10), Indianapolis, IN., 6-11 June 2010. In Proceedings of the ACM Conference on Management of Data, 2010, p. 747-758 How to Cite? |
Abstract | Most previous research on privacy-preserving data publishing, based on the k-anonymity model, has followed the simplistic approach of homogeneously giving the same generalized value in all quasi-identifiers within a partition. We observe that the anonymization error can be reduced if we follow a non-homogeneous generalization approach for groups of size larger than k. Such an approach would allow tuples within a partition to take different generalized quasi-identifier values. Anonymization following this model is not trivial, as its direct application can easily violate k-anonymity. In addition, non-homogeneous generalization allows for additional types of attack, which should be considered in the process. We provide a methodology for verifying whether a non-homogeneous generalization violates k-anonymity. Then, we propose a technique that generates a non-homogeneous generalization for a partition and show that its result satisfies k-anonymity, however by straightforwardly applying it, privacy can be compromised if the attacker knows the anonymization algorithm. Based on this, we propose a randomization method that prevents this type of attack and show that k-anonymity is not compromised by it. Nonhomogeneous generalization can be used on top of any existing partitioning approach to improve its utility. In addition, we show that a new partitioning technique tailored for non-homogeneous generalization can further improve quality. A thorough experimental evaluation demonstrates that our methodology greatly improves the utility of anonymized data in practice. © 2010 ACM. |
Persistent Identifier | http://hdl.handle.net/10722/129569 |
ISBN | |
ISSN | 2023 SCImago Journal Rankings: 2.640 |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Wong, WK | en_HK |
dc.contributor.author | Mamoulis, N | en_HK |
dc.contributor.author | Cheung, DWL | en_HK |
dc.date.accessioned | 2010-12-23T08:39:22Z | - |
dc.date.available | 2010-12-23T08:39:22Z | - |
dc.date.issued | 2010 | en_HK |
dc.identifier.citation | The 2010 International Conference on Management of Data (SIGMOD '10), Indianapolis, IN., 6-11 June 2010. In Proceedings of the ACM Conference on Management of Data, 2010, p. 747-758 | en_HK |
dc.identifier.isbn | 978-1-4503-0032-2 | - |
dc.identifier.issn | 0730-8078 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/129569 | - |
dc.description.abstract | Most previous research on privacy-preserving data publishing, based on the k-anonymity model, has followed the simplistic approach of homogeneously giving the same generalized value in all quasi-identifiers within a partition. We observe that the anonymization error can be reduced if we follow a non-homogeneous generalization approach for groups of size larger than k. Such an approach would allow tuples within a partition to take different generalized quasi-identifier values. Anonymization following this model is not trivial, as its direct application can easily violate k-anonymity. In addition, non-homogeneous generalization allows for additional types of attack, which should be considered in the process. We provide a methodology for verifying whether a non-homogeneous generalization violates k-anonymity. Then, we propose a technique that generates a non-homogeneous generalization for a partition and show that its result satisfies k-anonymity, however by straightforwardly applying it, privacy can be compromised if the attacker knows the anonymization algorithm. Based on this, we propose a randomization method that prevents this type of attack and show that k-anonymity is not compromised by it. Nonhomogeneous generalization can be used on top of any existing partitioning approach to improve its utility. In addition, we show that a new partitioning technique tailored for non-homogeneous generalization can further improve quality. A thorough experimental evaluation demonstrates that our methodology greatly improves the utility of anonymized data in practice. © 2010 ACM. | en_HK |
dc.language | eng | en_US |
dc.publisher | Association for Computing Machinery, Inc. The Journal's web site is located at http://www.acm.org/sigmod | en_HK |
dc.relation.ispartof | Proceedings of the ACM SIGMOD International Conference on Management of Data | en_HK |
dc.rights | Proceedings of the ACM Conference on Management of Data. Copyright © Association for Computing Machinery. | - |
dc.subject | anonymization | en_HK |
dc.subject | non-homogeneous generalization | en_HK |
dc.subject | privacy | en_HK |
dc.title | Non-homogeneous generalization in privacy preserving data publishing | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.email | Mamoulis, N:nikos@cs.hku.hk | en_HK |
dc.identifier.email | Cheung, DWL:dcheung@cs.hku.hk | en_HK |
dc.identifier.authority | Mamoulis, N=rp00155 | en_HK |
dc.identifier.authority | Cheung, DWL=rp00101 | en_HK |
dc.description.nature | link_to_OA_fulltext | - |
dc.identifier.doi | 10.1145/1807167.1807248 | en_HK |
dc.identifier.scopus | eid_2-s2.0-77954730854 | en_HK |
dc.identifier.hkuros | 176422 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-77954730854&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 747 | en_HK |
dc.identifier.epage | 758 | en_HK |
dc.publisher.place | United States | en_HK |
dc.description.other | The 2010 International Conference on Management of Data (SIGMOD '10), Indianapolis, IN., 6-11 June 2010. In Proceedings of the ACM Conference on Management of Data, 2010, p. 747-758 | - |
dc.identifier.scopusauthorid | Wong, WK=8835876000 | en_HK |
dc.identifier.scopusauthorid | Mamoulis, N=6701782749 | en_HK |
dc.identifier.scopusauthorid | Cheung, DWL=34567902600 | en_HK |
dc.identifier.issnl | 0730-8078 | - |