论文标题
Kyrix-S:大数据的创作可扩展散点图可视化
Kyrix-S: Authoring Scalable Scatterplot Visualizations of Big Data
论文作者
论文摘要
静态散点图通常会在对象重叠会导致不良视觉混乱的大数据集上遭受透支问题。在散点图中使用缩放可以帮助减轻此问题。有了多个缩放水平,可以提供更多屏幕房地产,从而使对象以较少拥挤的方式放置。我们称这种类型的可视化可伸缩散点图可视化或简称SSV。尽管有SSV的潜力,但由于三个限制,现有系统和工具包在支持SSV的作者方面缺乏。首先,假设数据适合一台计算机的内存,许多系统的可扩展性有限。其次,需要过多的开发人员工作,例如,使用自定义代码来生成标记布局或渲染对象。第三,许多系统仅关注SSV设计空间的一小部分(例如,支持特定类型的视觉标记)。为了解决这些局限性,我们开发了Kyrix-S,这是一种用于SSV在大规模上的系统的系统。 Kyrix-S得出了一种声明性的语法,该语法可以根据现有的散点图任务和设计的调查,以几十行代码中的各种SSV进行规范。声明性的语法由分布式布局算法支持,该算法自动将视觉标记放在缩放级别上。我们将数据存储在多节点数据库中,并使用多节点空间索引来实现大型SSV的交互式浏览。广泛的实验表明,1)Kyrix-S可以使数十亿个对象的SSV进行交互式浏览,而响应时间低于500ms和2)Kyrix-S与最先进的作者相比,规格的降低了4x-9x。
Static scatterplots often suffer from the overdraw problem on big datasets where object overlap causes undesirable visual clutter. The use of zooming in scatterplots can help alleviate this problem. With multiple zoom levels, more screen real estate is available, allowing objects to be placed in a less crowded way. We call this type of visualization scalable scatterplot visualizations, or SSV for short. Despite the potential of SSVs, existing systems and toolkits fall short in supporting the authoring of SSVs due to three limitations. First, many systems have limited scalability, assuming that data fits in the memory of one computer. Second, too much developer work, e.g., using custom code to generate mark layouts or render objects, is required. Third, many systems focus on only a small subset of the SSV design space (e.g. supporting a specific type of visual marks). To address these limitations, we have developed Kyrix-S, a system for easy authoring of SSVs at scale. Kyrix-S derives a declarative grammar that enables specification of a variety of SSVs in a few tens of lines of code, based on an existing survey of scatterplot tasks and designs. The declarative grammar is supported by a distributed layout algorithm which automatically places visual marks onto zoom levels. We store data in a multi-node database and use multi-node spatial indexes to achieve interactive browsing of large SSVs. Extensive experiments show that 1) Kyrix-S enables interactive browsing of SSVs of billions of objects, with response times under 500ms and 2) Kyrix-S achieves 4X-9X reduction in specification compared to a state-of-the-art authoring system.