File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Tight and fast generalization error bound of graph embedding in metric space

TitleTight and fast generalization error bound of graph embedding in metric space
Authors
Issue Date2023
Citation
Proceedings of Machine Learning Research, 2023, v. 202, p. 33268-33284 How to Cite?
AbstractRecent studies have experimentally shown that we can achieve in non-Euclidean metric space effective and efficient graph embedding, which aims to obtain the vertices' representations reflecting the graph's structure in the metric space. Specifically, graph embedding in hyperbolic space has experimentally succeeded in embedding graphs with hierarchical-tree structure, e.g., data in natural languages, social networks, and knowledge bases. However, recent theoretical analyses have shown a much higher upper bound on non-Euclidean graph embedding's generalization error than Euclidean one's, where a high generalization error indicates that the incompleteness and noise in the data can significantly damage learning performance. It implies that the existing bound cannot guarantee the success of graph embedding in non-Euclidean metric space in a practical training data size, which can prevent non-Euclidean graph embedding's application in real problems. This paper provides a novel upper bound of graph embedding's generalization error by evaluating the local Rademacher complexity of the model as a function set of the distances of representation couples. Our bound clarifies that the performance of graph embedding in non-Euclidean metric space, including hyperbolic space, is better than the existing upper bounds suggest. Specifically, our new upper bound is polynomial in the metric space's geometric radius R and can be O(S1 ) at the fastest, where S is the training data size. Our bound is significantly tighter and faster than the existing one, which can be exponential in R and O(√1S ) at the fastest. Specific calculations on example cases show that graph embedding in non-Euclidean metric space can outperform that in Euclidean space with much smaller training data than the existing bound has suggested.
Persistent Identifierhttp://hdl.handle.net/10722/354298

 

DC FieldValueLanguage
dc.contributor.authorSuzuki, Atsushi-
dc.contributor.authorNitanda, Atsushi-
dc.contributor.authorSuzuki, Taiji-
dc.contributor.authorWang, Jing-
dc.contributor.authorTian, Feng-
dc.contributor.authorYamanishi, Kenji-
dc.date.accessioned2025-02-07T08:47:45Z-
dc.date.available2025-02-07T08:47:45Z-
dc.date.issued2023-
dc.identifier.citationProceedings of Machine Learning Research, 2023, v. 202, p. 33268-33284-
dc.identifier.urihttp://hdl.handle.net/10722/354298-
dc.description.abstractRecent studies have experimentally shown that we can achieve in non-Euclidean metric space effective and efficient graph embedding, which aims to obtain the vertices' representations reflecting the graph's structure in the metric space. Specifically, graph embedding in hyperbolic space has experimentally succeeded in embedding graphs with hierarchical-tree structure, e.g., data in natural languages, social networks, and knowledge bases. However, recent theoretical analyses have shown a much higher upper bound on non-Euclidean graph embedding's generalization error than Euclidean one's, where a high generalization error indicates that the incompleteness and noise in the data can significantly damage learning performance. It implies that the existing bound cannot guarantee the success of graph embedding in non-Euclidean metric space in a practical training data size, which can prevent non-Euclidean graph embedding's application in real problems. This paper provides a novel upper bound of graph embedding's generalization error by evaluating the local Rademacher complexity of the model as a function set of the distances of representation couples. Our bound clarifies that the performance of graph embedding in non-Euclidean metric space, including hyperbolic space, is better than the existing upper bounds suggest. Specifically, our new upper bound is polynomial in the metric space's geometric radius R and can be O(S1 ) at the fastest, where S is the training data size. Our bound is significantly tighter and faster than the existing one, which can be exponential in R and O(√1S ) at the fastest. Specific calculations on example cases show that graph embedding in non-Euclidean metric space can outperform that in Euclidean space with much smaller training data than the existing bound has suggested.-
dc.languageeng-
dc.relation.ispartofProceedings of Machine Learning Research-
dc.titleTight and fast generalization error bound of graph embedding in metric space-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.scopuseid_2-s2.0-85174415354-
dc.identifier.volume202-
dc.identifier.spage33268-
dc.identifier.epage33284-
dc.identifier.eissn2640-3498-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats