File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

postgraduate thesis: Collaborative privacy-preserving data mining in the cloud

TitleCollaborative privacy-preserving data mining in the cloud
Authors
Advisors
Advisor(s):Yiu, SM
Issue Date2018
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Zhang, J. [张君]. (2018). Collaborative privacy-preserving data mining in the cloud. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
AbstractWith the popularity of cloud computing, an increasing number of institutions outsource their data to a third-party cloud system. In such an environment, demands exist for both the privacy and mining of data. To protect data privacy, different institutions encrypt their data with different keys. To discover knowledge from existing data, mining techniques are used widely but computationally intensive, especially for large datasets. Combining data from different institutions for a big and varied training set helps enhance data mining performance, which motivates the study of collaborative privacy-preserving data mining. The integration of cloud computing with collaborative privacy-preserving data mining provides a promising solution to the above needs. In this thesis, we study three important issues: secure dot product, privacy-preserving support vector machine and privacy-preserving elastic net. First, the dot product is a fundamental operation in data mining algorithms. We design a secure dot product scheme for encrypted vectors under different keys on one cloud server. We extend the application scenario of an integer vector encryption that supports dot product computation from one institution to multiple institutions. We prove our extension is secure. To allow the cloud to decrypt an encrypted dot product, we generate a key-switching matrix to transform the encrypted dot product. Our secure dot product scheme does not require the institutions to stay online. The experimental results confirm that our system is efficient. Second, the support vector machine is one of the most popular classifiers. We present a privacy-preserving support vector machine protocol, which runs on encrypted data under different keys on one cloud server. As training a support vector machine requires the knowledge of a gram matrix that consists of dot products, we base our scheme on the secure dot product protocol. The institutions could stay offline in our system. To enhance efficiency and security, we adopt a reduced support vector machine with a secure kernel matrix. Each element of the secure kernel matrix is a dot product between a training sample and a random vector. We prove the security of our protocol and the experimental results highlight its practicality. Finally, the elastic net is an essential regression tool that has important applications. We construct the first privacy-preserving elastic net protocol for encrypted data under different keys on two non-colluding cloud servers. We reduce the elastic net regression to a support vector machine. As reduction leads to dataset transformation, we propose a cryptosystem that supports one multiplication and multiple addition operations. We employ secret sharing to keep the institutions offline. Our scheme is proven to be secure. The experimental results demonstrate that our system can be applied in practice. To conclude, we solved three issues in collaborative privacy-preserving data mining in the cloud. We plan to leverage parallel computing to accelerate our schemes, design privacy-preserving schemes for other data mining algorithms, and utilize verifiable computing to prevent malicious cloud in the future.
DegreeDoctor of Philosophy
SubjectData protection
Data mining
Cloud computing
Dept/ProgramComputer Science
Persistent Identifierhttp://hdl.handle.net/10722/276468

 

DC FieldValueLanguage
dc.contributor.advisorYiu, SM-
dc.contributor.authorZhang, Jun-
dc.contributor.author张君-
dc.date.accessioned2019-09-17T04:54:58Z-
dc.date.available2019-09-17T04:54:58Z-
dc.date.issued2018-
dc.identifier.citationZhang, J. [张君]. (2018). Collaborative privacy-preserving data mining in the cloud. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/276468-
dc.description.abstractWith the popularity of cloud computing, an increasing number of institutions outsource their data to a third-party cloud system. In such an environment, demands exist for both the privacy and mining of data. To protect data privacy, different institutions encrypt their data with different keys. To discover knowledge from existing data, mining techniques are used widely but computationally intensive, especially for large datasets. Combining data from different institutions for a big and varied training set helps enhance data mining performance, which motivates the study of collaborative privacy-preserving data mining. The integration of cloud computing with collaborative privacy-preserving data mining provides a promising solution to the above needs. In this thesis, we study three important issues: secure dot product, privacy-preserving support vector machine and privacy-preserving elastic net. First, the dot product is a fundamental operation in data mining algorithms. We design a secure dot product scheme for encrypted vectors under different keys on one cloud server. We extend the application scenario of an integer vector encryption that supports dot product computation from one institution to multiple institutions. We prove our extension is secure. To allow the cloud to decrypt an encrypted dot product, we generate a key-switching matrix to transform the encrypted dot product. Our secure dot product scheme does not require the institutions to stay online. The experimental results confirm that our system is efficient. Second, the support vector machine is one of the most popular classifiers. We present a privacy-preserving support vector machine protocol, which runs on encrypted data under different keys on one cloud server. As training a support vector machine requires the knowledge of a gram matrix that consists of dot products, we base our scheme on the secure dot product protocol. The institutions could stay offline in our system. To enhance efficiency and security, we adopt a reduced support vector machine with a secure kernel matrix. Each element of the secure kernel matrix is a dot product between a training sample and a random vector. We prove the security of our protocol and the experimental results highlight its practicality. Finally, the elastic net is an essential regression tool that has important applications. We construct the first privacy-preserving elastic net protocol for encrypted data under different keys on two non-colluding cloud servers. We reduce the elastic net regression to a support vector machine. As reduction leads to dataset transformation, we propose a cryptosystem that supports one multiplication and multiple addition operations. We employ secret sharing to keep the institutions offline. Our scheme is proven to be secure. The experimental results demonstrate that our system can be applied in practice. To conclude, we solved three issues in collaborative privacy-preserving data mining in the cloud. We plan to leverage parallel computing to accelerate our schemes, design privacy-preserving schemes for other data mining algorithms, and utilize verifiable computing to prevent malicious cloud in the future.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshData protection-
dc.subject.lcshData mining-
dc.subject.lcshCloud computing-
dc.titleCollaborative privacy-preserving data mining in the cloud-
dc.typePG_Thesis-
dc.description.thesisnameDoctor of Philosophy-
dc.description.thesislevelDoctoral-
dc.description.thesisdisciplineComputer Science-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.5353/th_991044058183903414-
dc.date.hkucongregation2018-
dc.identifier.mmsid991044058183903414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats