论文标题
学习互补建议的组成视觉连贯性
Learning the Compositional Visual Coherence for Complementary Recommendations
论文作者
论文摘要
旨在为用户提供补充且与其获得的物品兼容的产品建议的补充建议已成为近年来学术界和行业的热门话题。但是,由于其复杂性和主观性,它具有挑战性。现有的工作主要集中于建模两个项目之间的共购买关系,但是项目收集的组成关联在很大程度上没有探索。实际上,当用户选择购买产品的互补项目时,除了全球印象外,她还会考虑视觉语义连贯性(例如颜色搭配,纹理兼容性)。为此,在本文中,我们提出了一个新颖的内容专注的神经网络(CANN),以模拟全球内容和语义内容的全面构图相干性。具体而言,我们首先提出了基于多头的注意力,以建模全局组成相干性,提出\ textit {全局相干学习}(GCL)模块。然后,我们从不同语义区域生成语义 - 焦点表示,并设计\ textit {焦点连贯学习}(FCL)模块,以从不同的语义 - 核表示。最后,我们在新颖的组成优化策略中优化了CANN。与几种最先进的方法相比,大规模现实世界数据的广泛实验清楚地表明了CANN的有效性。
Complementary recommendations, which aim at providing users product suggestions that are supplementary and compatible with their obtained items, have become a hot topic in both academia and industry in recent years. %However, it is challenging due to its complexity and subjectivity. Existing work mainly focused on modeling the co-purchased relations between two items, but the compositional associations of item collections are largely unexplored. Actually, when a user chooses the complementary items for the purchased products, it is intuitive that she will consider the visual semantic coherence (such as color collocations, texture compatibilities) in addition to global impressions. Towards this end, in this paper, we propose a novel Content Attentive Neural Network (CANN) to model the comprehensive compositional coherence on both global contents and semantic contents. Specifically, we first propose a \textit{Global Coherence Learning} (GCL) module based on multi-heads attention to model the global compositional coherence. Then, we generate the semantic-focal representations from different semantic regions and design a \textit{Focal Coherence Learning} (FCL) module to learn the focal compositional coherence from different semantic-focal representations. Finally, we optimize the CANN in a novel compositional optimization strategy. Extensive experiments on the large-scale real-world data clearly demonstrate the effectiveness of CANN compared with several state-of-the-art methods.