Scalable and feasible learning and retrieval from matrix data

Li, Hui; 李輝

File Download

FullText.pdf

Links for fulltext

(May Require Subscription)

DOI: 10.5353/th_991044058176303414

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Computer Science: Theses

postgraduate thesis: Scalable and feasible learning and retrieval from matrix data

Title	Scalable and feasible learning and retrieval from matrix data
Authors	Li, Hui 李輝
Advisors	Advisor(s):Kao, CM Mamoulis, N
Issue Date	2018
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Li, H. [李輝]. (2018). Scalable and feasible learning and retrieval from matrix data. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	Matrix data are commonly found in AI applications. They differ from traditional tables in relational database, as they only contain numerical values and the interaction between matrices involves linear algebraic operations instead of relational operations. Matrix data management plays an important role in the age of big data and artificial intelligence, as it is directly related to the performance of many AI systems. In this thesis, we develop a number of techniques that improve the scalability and accuracy of two AI applications: recommender systems and knowledge bases. We first conduct an experimental study which demonstrates that the bottleneck of matrix factorization in recommender systems is the retrieval phase. Based on this observation, we design an exact inner product retrieval framework FEXIPRO, which can retrieve top-k results from learned factors extremely fast. Then, we extend the inner product retrieval problem to a multi-matrix product retrieval problem, which is related to the evaluation of knowledge base completion. For this problem, we present a sampling based evaluation framework WedgeEval, which can give a fast estimation of model performance. Lastly, we investigate the cross-domain application of sequential recommendation. We present CTransRec, which adapts ideas from knowledge base completion and utilizes auxiliary information to improve the quality of sequential recommendation. The frameworks presented in this thesis not only assist researchers and developers in fast model selection and hyper-parameter tuning, but also help the systems give quick and accurate feedback to users' queries. Although the learning and retrieval algorithms that we propose in this thesis mainly focus on two applications, recommender systems and knowledge bases, they can be easily extended to apply in other applications where similar search operations exist.
Degree	Doctor of Philosophy
Subject	Matrices - Data processing
Dept/Program	Computer Science
Persistent Identifier	http://hdl.handle.net/10722/265402

DC Field	Value	Language
dc.contributor.advisor	Kao, CM	-
dc.contributor.advisor	Mamoulis, N	-
dc.contributor.author	Li, Hui	-
dc.contributor.author	李輝	-
dc.date.accessioned	2018-11-29T06:22:35Z	-
dc.date.available	2018-11-29T06:22:35Z	-
dc.date.issued	2018	-
dc.identifier.citation	Li, H. [李輝]. (2018). Scalable and feasible learning and retrieval from matrix data. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/265402	-
dc.description.abstract	Matrix data are commonly found in AI applications. They differ from traditional tables in relational database, as they only contain numerical values and the interaction between matrices involves linear algebraic operations instead of relational operations. Matrix data management plays an important role in the age of big data and artificial intelligence, as it is directly related to the performance of many AI systems. In this thesis, we develop a number of techniques that improve the scalability and accuracy of two AI applications: recommender systems and knowledge bases. We first conduct an experimental study which demonstrates that the bottleneck of matrix factorization in recommender systems is the retrieval phase. Based on this observation, we design an exact inner product retrieval framework FEXIPRO, which can retrieve top-k results from learned factors extremely fast. Then, we extend the inner product retrieval problem to a multi-matrix product retrieval problem, which is related to the evaluation of knowledge base completion. For this problem, we present a sampling based evaluation framework WedgeEval, which can give a fast estimation of model performance. Lastly, we investigate the cross-domain application of sequential recommendation. We present CTransRec, which adapts ideas from knowledge base completion and utilizes auxiliary information to improve the quality of sequential recommendation. The frameworks presented in this thesis not only assist researchers and developers in fast model selection and hyper-parameter tuning, but also help the systems give quick and accurate feedback to users' queries. Although the learning and retrieval algorithms that we propose in this thesis mainly focus on two applications, recommender systems and knowledge bases, they can be easily extended to apply in other applications where similar search operations exist.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Matrices - Data processing	-
dc.title	Scalable and feasible learning and retrieval from matrix data	-
dc.type	PG_Thesis	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Computer Science	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.5353/th_991044058176303414	-
dc.date.hkucongregation	2018	-
dc.identifier.mmsid	991044058176303414	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

postgraduate thesis: Scalable and feasible learning and retrieval from matrix data

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats