File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

postgraduate thesis: Effective and efficient discovery of top-k meta path in heterogeneous information networks

TitleEffective and efficient discovery of top-k meta path in heterogeneous information networks
Authors
Advisors
Advisor(s):Cheng, CK
Issue Date2019
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Zhu, Z. [朱子晨]. (2019). Effective and efficient discovery of top-k meta path in heterogeneous information networks. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract\textit{Heterogeneous information networks (HINs)}, which are typed graphs with labeled nodes and edges, have attracted tremendous interest from academia and industry. Given two HIN nodes $s$ and $t$, and a natural number $k$, we study the discovery of the $k$ most important paths in real time. The returned paths can be used to support friend search, product recommendation, anomaly detection, and graph clustering. Although related algorithms have been proposed before, they were primarily designed to return the $k$ shortest paths from unlabeled graphs. This leads to two problems: (1) there are often many shortest paths between $s$ and $t$, and so it is not easy to choose the $k$ best ones; and (2) it is arguable whether a shorter path implies a more crucial one. To address these issues, we study the {\it top-$k$ meta path query} for an HIN. A meta path is essentially a sequence of node and edge types between $s$ and $t$ (e.g., {\it Author}$\xrightarrow{write}${\it Paper}$\xrightarrow{written-by}${\it Author} denotes the coauthor relationship of $s$ and $t$). A meta path abstracts multiple path instances into a high-level path pattern, thereby giving more insight between two nodes. We further study several ranking functions that evaluate the \textit{importance} of meta paths based on \textit{frequency} and \textit{rarity}, rather than on path length. Also, we propose a solution that seamlessly integrates these functions into a multi-step searching framework. In addition, we combine bidirectional searching algorithm with this framework to further boost up the efficiency performance. The experiment on different datasets shows that the proposed method outperforms state-of-the-art algorithms in terms of effectiveness with reasonable response time.
DegreeMaster of Philosophy
SubjectInformation networks
Dept/ProgramComputer Science
Persistent Identifierhttp://hdl.handle.net/10722/278449

 

DC FieldValueLanguage
dc.contributor.advisorCheng, CK-
dc.contributor.authorZhu, Zichen-
dc.contributor.author朱子晨-
dc.date.accessioned2019-10-09T01:17:46Z-
dc.date.available2019-10-09T01:17:46Z-
dc.date.issued2019-
dc.identifier.citationZhu, Z. [朱子晨]. (2019). Effective and efficient discovery of top-k meta path in heterogeneous information networks. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/278449-
dc.description.abstract\textit{Heterogeneous information networks (HINs)}, which are typed graphs with labeled nodes and edges, have attracted tremendous interest from academia and industry. Given two HIN nodes $s$ and $t$, and a natural number $k$, we study the discovery of the $k$ most important paths in real time. The returned paths can be used to support friend search, product recommendation, anomaly detection, and graph clustering. Although related algorithms have been proposed before, they were primarily designed to return the $k$ shortest paths from unlabeled graphs. This leads to two problems: (1) there are often many shortest paths between $s$ and $t$, and so it is not easy to choose the $k$ best ones; and (2) it is arguable whether a shorter path implies a more crucial one. To address these issues, we study the {\it top-$k$ meta path query} for an HIN. A meta path is essentially a sequence of node and edge types between $s$ and $t$ (e.g., {\it Author}$\xrightarrow{write}${\it Paper}$\xrightarrow{written-by}${\it Author} denotes the coauthor relationship of $s$ and $t$). A meta path abstracts multiple path instances into a high-level path pattern, thereby giving more insight between two nodes. We further study several ranking functions that evaluate the \textit{importance} of meta paths based on \textit{frequency} and \textit{rarity}, rather than on path length. Also, we propose a solution that seamlessly integrates these functions into a multi-step searching framework. In addition, we combine bidirectional searching algorithm with this framework to further boost up the efficiency performance. The experiment on different datasets shows that the proposed method outperforms state-of-the-art algorithms in terms of effectiveness with reasonable response time.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshInformation networks-
dc.titleEffective and efficient discovery of top-k meta path in heterogeneous information networks-
dc.typePG_Thesis-
dc.description.thesisnameMaster of Philosophy-
dc.description.thesislevelMaster-
dc.description.thesisdisciplineComputer Science-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.5353/th_991044146578203414-
dc.date.hkucongregation2019-
dc.identifier.mmsid991044146578203414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats