File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Discovering meta-paths in large knowledge bases
Title | Discovering meta-paths in large knowledge bases |
---|---|
Authors | |
Issue Date | 2014 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Meng, C. [蒙昌平]. (2014). Discovering meta-paths in large knowledge bases. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5435672 |
Abstract | A knowledge base, such as Yago or DBpedia, can be modeled as a large graph with nodes and edges annotated with class and relationship labels. Recent work has studied how to make use of these rich information sources. In particular, meta-paths, which represent sequences of node classes and edge types between two nodes in a knowledge base, have been proposed for such tasks as information retrieval, decision making, and product recommendation. Current methods assume meta-paths are found by domain experts. However, in a large and complex knowledge base, retrieving meta-paths manually can be tedious and difficult. We thus study how to discover meta-paths automatically. Specifically, users are asked to provide example pairs of nodes that exhibit high proximity. We then investigate how to generate meta-paths that can best explain the relationship between these node pairs. Since this problem is computationally intractable, we propose a greedy algorithm to select the most relevant meta-paths. We also present a data structure to enable efficient execution of this algorithm. We further incorporate hierarchical relationships among node classes in our solutions. Finally, we propose an effective similarity join algorithm in order to generate more node pairs using these meta-paths. Extensive experiments on real knowledge bases show that our approach captures important meta-paths in an efficient and scalable manner. |
Degree | Master of Philosophy |
Subject | Data mining Knowledge management |
Dept/Program | Computer Science |
Persistent Identifier | http://hdl.handle.net/10722/209504 |
HKU Library Item ID | b5435672 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Meng, Changping | - |
dc.contributor.author | 蒙昌平 | - |
dc.date.accessioned | 2015-04-23T23:10:55Z | - |
dc.date.available | 2015-04-23T23:10:55Z | - |
dc.date.issued | 2014 | - |
dc.identifier.citation | Meng, C. [蒙昌平]. (2014). Discovering meta-paths in large knowledge bases. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5435672 | - |
dc.identifier.uri | http://hdl.handle.net/10722/209504 | - |
dc.description.abstract | A knowledge base, such as Yago or DBpedia, can be modeled as a large graph with nodes and edges annotated with class and relationship labels. Recent work has studied how to make use of these rich information sources. In particular, meta-paths, which represent sequences of node classes and edge types between two nodes in a knowledge base, have been proposed for such tasks as information retrieval, decision making, and product recommendation. Current methods assume meta-paths are found by domain experts. However, in a large and complex knowledge base, retrieving meta-paths manually can be tedious and difficult. We thus study how to discover meta-paths automatically. Specifically, users are asked to provide example pairs of nodes that exhibit high proximity. We then investigate how to generate meta-paths that can best explain the relationship between these node pairs. Since this problem is computationally intractable, we propose a greedy algorithm to select the most relevant meta-paths. We also present a data structure to enable efficient execution of this algorithm. We further incorporate hierarchical relationships among node classes in our solutions. Finally, we propose an effective similarity join algorithm in order to generate more node pairs using these meta-paths. Extensive experiments on real knowledge bases show that our approach captures important meta-paths in an efficient and scalable manner. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Data mining | - |
dc.subject.lcsh | Knowledge management | - |
dc.title | Discovering meta-paths in large knowledge bases | - |
dc.type | PG_Thesis | - |
dc.identifier.hkul | b5435672 | - |
dc.description.thesisname | Master of Philosophy | - |
dc.description.thesislevel | Master | - |
dc.description.thesisdiscipline | Computer Science | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.5353/th_b5435672 | - |
dc.identifier.mmsid | 991003168679703414 | - |