File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Scalable and feasible learning and retrieval from matrix data
Title | Scalable and feasible learning and retrieval from matrix data |
---|---|
Authors | |
Advisors | |
Issue Date | 2018 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Li, H. [李輝]. (2018). Scalable and feasible learning and retrieval from matrix data. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
Abstract | Matrix data are commonly found in AI applications. They differ from traditional tables in relational database, as they only contain numerical values and the interaction between matrices involves linear algebraic operations instead of relational operations. Matrix data management plays an important role in the age of big data and artificial intelligence, as it is directly related to the performance of many AI systems. In this thesis, we develop a number of techniques that improve the scalability and accuracy of two AI applications: recommender systems and knowledge bases.
We first conduct an experimental study which demonstrates that the bottleneck of matrix factorization in recommender systems is the retrieval phase. Based on this observation, we design an exact inner product retrieval framework FEXIPRO, which can retrieve top-k results from learned factors extremely fast. Then, we extend the inner product retrieval problem to a multi-matrix product retrieval problem, which is related to the evaluation of knowledge base completion. For this problem, we present a sampling based evaluation framework WedgeEval, which can give a fast estimation of model performance. Lastly, we investigate the cross-domain application of sequential recommendation. We present CTransRec, which adapts ideas from knowledge base completion and utilizes auxiliary information to improve the quality of sequential recommendation.
The frameworks presented in this thesis not only assist researchers and developers in fast model selection and hyper-parameter tuning, but also help the systems give quick and accurate feedback to users' queries. Although the learning and retrieval algorithms that we propose in this thesis mainly focus on two applications, recommender systems and knowledge bases, they can be easily extended to apply in other applications where similar search operations exist. |
Degree | Doctor of Philosophy |
Subject | Matrices - Data processing |
Dept/Program | Computer Science |
Persistent Identifier | http://hdl.handle.net/10722/265402 |
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kao, CM | - |
dc.contributor.advisor | Mamoulis, N | - |
dc.contributor.author | Li, Hui | - |
dc.contributor.author | 李輝 | - |
dc.date.accessioned | 2018-11-29T06:22:35Z | - |
dc.date.available | 2018-11-29T06:22:35Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | Li, H. [李輝]. (2018). Scalable and feasible learning and retrieval from matrix data. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
dc.identifier.uri | http://hdl.handle.net/10722/265402 | - |
dc.description.abstract | Matrix data are commonly found in AI applications. They differ from traditional tables in relational database, as they only contain numerical values and the interaction between matrices involves linear algebraic operations instead of relational operations. Matrix data management plays an important role in the age of big data and artificial intelligence, as it is directly related to the performance of many AI systems. In this thesis, we develop a number of techniques that improve the scalability and accuracy of two AI applications: recommender systems and knowledge bases. We first conduct an experimental study which demonstrates that the bottleneck of matrix factorization in recommender systems is the retrieval phase. Based on this observation, we design an exact inner product retrieval framework FEXIPRO, which can retrieve top-k results from learned factors extremely fast. Then, we extend the inner product retrieval problem to a multi-matrix product retrieval problem, which is related to the evaluation of knowledge base completion. For this problem, we present a sampling based evaluation framework WedgeEval, which can give a fast estimation of model performance. Lastly, we investigate the cross-domain application of sequential recommendation. We present CTransRec, which adapts ideas from knowledge base completion and utilizes auxiliary information to improve the quality of sequential recommendation. The frameworks presented in this thesis not only assist researchers and developers in fast model selection and hyper-parameter tuning, but also help the systems give quick and accurate feedback to users' queries. Although the learning and retrieval algorithms that we propose in this thesis mainly focus on two applications, recommender systems and knowledge bases, they can be easily extended to apply in other applications where similar search operations exist. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Matrices - Data processing | - |
dc.title | Scalable and feasible learning and retrieval from matrix data | - |
dc.type | PG_Thesis | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Computer Science | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.5353/th_991044058176303414 | - |
dc.date.hkucongregation | 2018 | - |
dc.identifier.mmsid | 991044058176303414 | - |