人群计数的相关区域预测

论文标题

人群计数的相关区域预测

Relevant Region Prediction for Crowd Counting

论文作者

Chen, Xinya, Bin, Yanrui, Gao, Changxin, Sang, Nong, Tang, Hao

论文摘要

人群计数是计算机视觉中关注且具有挑战性的任务。现有的密度图方法过度专注于个人的本地化，这会损害人群在高度拥挤的场景中的表现。另外，也忽略了不同密度区域之间的依赖性。在本文中，我们提出了有关人群计数的相关区域预测（RRP），该预测由计数图和区域关系感知模块（RRAM）组成。计数图中的每个像素代表输入图像中落入相应局部区域的头部数量，这丢弃了详细的空间信息，并且迫使网络更多地关注计数而不是本地化个体。基于图形卷积网络（GCN），提出了区域关系感知模块以捕获和利用重要区域依赖性。该模块在不同密度的区域之间建立一个完全连接的有向图，其中每个节点（区域）由加权全局汇总特征表示，并且学会了GCN将该区域图映射到一组关系感知的区域表示。三个数据集的实验结果表明，我们的方法显然优于其他现有最新方法。

Crowd counting is a concerned and challenging task in computer vision. Existing density map based methods excessively focus on the individuals' localization which harms the crowd counting performance in highly congested scenes. In addition, the dependency between the regions of different density is also ignored. In this paper, we propose Relevant Region Prediction (RRP) for crowd counting, which consists of the Count Map and the Region Relation-Aware Module (RRAM). Each pixel in the count map represents the number of heads falling into the corresponding local area in the input image, which discards the detailed spatial information and forces the network pay more attention to counting rather than localizing individuals. Based on the Graph Convolutional Network (GCN), Region Relation-Aware Module is proposed to capture and exploit the important region dependency. The module builds a fully connected directed graph between the regions of different density where each node (region) is represented by weighted global pooled feature, and GCN is learned to map this region graph to a set of relation-aware regions representations. Experimental results on three datasets show that our method obviously outperforms other existing state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题