论文标题
通过异质匿名步行对异质结构的表示形式学习
Representation Learning on Heterostructures via Heterogeneous Anonymous Walks
论文作者
论文摘要
捕获结构相似性一直是网络嵌入领域的热门话题,因为它有助于理解节点功能和行为。但是,现有作品非常关注同质网络上的学习结构,而对异质网络的相关研究仍然是一个无效的。在本文中,我们试图迈出代表方面学习异质结构的第一步,由于它们的节点类型和基础结构的高度组合,这非常具有挑战性。为了有效地区分各种异质结构,我们首先提出了一种称为异质匿名步行(HAW)及其变异的粗haw(Chaw)的理论保证的技术。然后,我们以数据驱动的方式设计了异质的匿名步行嵌入(HAWE)及其变体的粗hawe,以通过预测每个节点附近发生的步行,并使用大量可能的步行和火车嵌入。最后,我们在合成和现实世界网络上设计并应用了广泛的说明性实验,以在异质结构学习上建立基准,并评估我们方法的有效性。结果表明,与均质和异质经典方法相比,我们的方法实现了出色的性能,并且可以应用于大规模网络。
Capturing structural similarity has been a hot topic in the field of network embedding recently due to its great help in understanding the node functions and behaviors. However, existing works have paid very much attention to learning structures on homogeneous networks while the related study on heterogeneous networks is still a void. In this paper, we try to take the first step for representation learning on heterostructures, which is very challenging due to their highly diverse combinations of node types and underlying structures. To effectively distinguish diverse heterostructures, we firstly propose a theoretically guaranteed technique called heterogeneous anonymous walk (HAW) and its variant coarse HAW (CHAW). Then, we devise the heterogeneous anonymous walk embedding (HAWE) and its variant coarse HAWE in a data-driven manner to circumvent using an extremely large number of possible walks and train embeddings by predicting occurring walks in the neighborhood of each node. Finally, we design and apply extensive and illustrative experiments on synthetic and real-world networks to build a benchmark on heterostructure learning and evaluate the effectiveness of our methods. The results demonstrate our methods achieve outstanding performance compared with both homogeneous and heterogeneous classic methods, and can be applied on large-scale networks.