论文标题
了解图形嵌入方法及其应用
Understanding graph embedding methods and their applications
论文作者
论文摘要
图形分析可以使对复杂网络的更好的定量理解和控制,但是传统方法具有高度计算成本和与工业大小网络的高差异性和异质特征相关的过度记忆要求。图嵌入技术可以有效地将高维稀疏图转换为低维,密集和连续的矢量空间,从而最大程度地保留图形结构属性。嵌入的另一种新兴图嵌入方式采用重要的不确定性估计的基于高斯分布的图嵌入。图形嵌入方法的主要目标是将每个节点的属性包装到具有较小维度的向量中,因此,可以使用标准指标在嵌入式矢量空间中轻松量化原始复杂不规则空间中的节点相似性。潜在空间中生成的非线性和高度信息的图形嵌入可以方便地用于解决不同的下游图分析任务(例如,节点分类,链接预测,社区检测,可视化等)。在这篇综述中,我们介绍了图分析和图形嵌入方法中的一些基本概念,尤其是基于随机步行和基于神经网络的方法。我们还讨论了新兴的基于学习的动态图嵌入方法。我们重点介绍了四个不同应用程序中图形嵌入方法的独特优势,并介绍了附录中对开源软件的实现详细信息和参考,并提供了数据库,供有兴趣的读者开始探索图形分析。
Graph analytics can lead to better quantitative understanding and control of complex networks, but traditional methods suffer from high computational cost and excessive memory requirements associated with the high-dimensionality and heterogeneous characteristics of industrial size networks. Graph embedding techniques can be effective in converting high-dimensional sparse graphs into low-dimensional, dense and continuous vector spaces, preserving maximally the graph structure properties. Another type of emerging graph embedding employs Gaussian distribution-based graph embedding with important uncertainty estimation. The main goal of graph embedding methods is to pack every node's properties into a vector with a smaller dimension, hence, node similarity in the original complex irregular spaces can be easily quantified in the embedded vector spaces using standard metrics. The generated nonlinear and highly informative graph embeddings in the latent space can be conveniently used to address different downstream graph analytics tasks (e.g., node classification, link prediction, community detection, visualization, etc.). In this Review, we present some fundamental concepts in graph analytics and graph embedding methods, focusing in particular on random walk-based and neural network-based methods. We also discuss the emerging deep learning-based dynamic graph embedding methods. We highlight the distinct advantages of graph embedding methods in four diverse applications, and present implementation details and references to open-source software as well as available databases in the Appendix for the interested readers to start their exploration into graph analytics.