论文标题

多模式驱动程序引用:指向车内和外部对象的指向

Multimodal Driver Referencing: A Comparison of Pointing to Objects Inside and Outside the Vehicle

论文作者

Aftab, Abdul Rafey, von der Beeck, Michael

论文摘要

先进的卡宾内传感技术,尤其是基于视觉的方法,在车辆内部取得了巨大进步,为自然用户互动的新应用铺平了道路。正如人类使用多种模式相互交流一样,我们遵循一种方法,其特征是同时使用多种模态来实现特定任务的自然人机相互作用:指向或朝内部的物体以及在车辆外部瞥见Deictic Reforence的物体。通过跟踪眼睛注视,头和手指的运动,我们使用深层神经网络设计了多模式融合体系结构,以精确识别驾驶员的引用意图。此外,我们将语音命令用作触发器来分开每个引用事件。我们观察到两个指向用例(即内部和外部对象)中驾驶员行为的差异,尤其是在分析三种方式眼睛,头部和手指的精确性时。我们得出的结论是,没有单一的模态完全适用于所有情况,因为每种方式都揭示了某些局限性。多种模态的融合利用了每种模态的相关特征,因此克服了每个单个模式的案例依赖性局限性。最终,我们提出了一种基于预测的指向方向,驾驶员的引用对象是否位于车辆内部或外部。

Advanced in-cabin sensing technologies, especially vision based approaches, have tremendously progressed user interaction inside the vehicle, paving the way for new applications of natural user interaction. Just as humans use multiple modes to communicate with each other, we follow an approach which is characterized by simultaneously using multiple modalities to achieve natural human-machine interaction for a specific task: pointing to or glancing towards objects inside as well as outside the vehicle for deictic references. By tracking the movements of eye-gaze, head and finger, we design a multimodal fusion architecture using a deep neural network to precisely identify the driver's referencing intent. Additionally, we use a speech command as a trigger to separate each referencing event. We observe differences in driver behavior in the two pointing use cases (i.e. for inside and outside objects), especially when analyzing the preciseness of the three modalities eye, head, and finger. We conclude that there is no single modality that is solely optimal for all cases as each modality reveals certain limitations. Fusion of multiple modalities exploits the relevant characteristics of each modality, hence overcoming the case dependent limitations of each individual modality. Ultimately, we propose a method to identity whether the driver's referenced object lies inside or outside the vehicle, based on the predicted pointing direction.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源