论文标题

在线旅行社搜索的反馈聚类:案例研究

Feedback Clustering for Online Travel Agencies Searches: a Case Study

论文作者

Scaramuccia, Sara, Nanty, Simon, Masseglia, Florent

论文摘要

了解在线客户执行的选择是旅游行业越来越多的需求。在许多实际情况下,唯一可用的信息是客户执行的飞行搜索查询,而没有其他个人资料知识。通常,客户航班预订是由价格,持续时间,连接数等驱动的。但是,并非所有客户都可能对每个标准都具有相同的重要性。需要将所有类型的客户执行的所有飞行搜索分组在一起,即具有相同的预订标准。对于单个集群,可以根据历史上执行的预订数来衡量某些建议的有效性。这种有效性度量起反馈的作用,即外部知识,可以重新组合到迭代地获得最终的分割。在本文中,我们描述了我们的在线旅行社(OTA)飞行搜索用例,并突出显示其特定功能。我们通过提出一种称为Split-Or-Merge(S/M)的新型算法来解决上述飞行搜索细分问题。该算法是拆分过程(SME)方法的变体。 SME方法已经在社区中引入了一个迭代过程,该过程通过分裂和合并群集对反馈独立的评估来更新K-Means算法给出的聚类。据我们所知,在文献中没有将SME方法的先前应用在现实词数据中的应用。在这里,我们向中小型企业和S/M方法提供了对现实世界数据的实验评估。对中小型企业和S/M方法获得的对我们特定领域的指标的影响表明,在处理OTA飞行搜索域的处理中,反馈聚类技术可能非常有前途。

Understanding choices performed by online customers is a growing need in the travel industry. In many practical situations, the only available information is the flight search query performed by the customer with no additional profile knowledge. In general, customer flight bookings are driven by prices, duration, number of connections, and so on. However, not all customers might assign the same importance to each of those criteria. Here comes the need of grouping together all flight searches performed by the same kind of customer, that is having the same booking criteria. The effectiveness of some set of recommendations, for a single cluster, can be measured in terms of the number of bookings historically performed. This effectiveness measure plays the role of a feedback, that is an external knowledge which can be recombined to iteratively obtain a final segmentation. In this paper, we describe our Online Travel Agencies (OTA) flight search use case and highlight its specific features. We address the flight search segmentation problem motivated above by proposing a novel algorithm called Split-or-Merge (S/M). This algorithm is a variation of the Split-Merge-Evolve (SME) method. The SME method has already been introduced in the community as an iterative process updating a clustering given by the K-means algorithm by splitting and merging clusters subject to feedback independent evaluations. No previous application of the SME method to the real-word data is reported in literature to the best of our knowledge. Here, we provide experimental evaluations over real-world data to the SME and the S/M methods. The impact on our domain-specific metrics obtained under the SME and the S/M methods suggests that feedback clustering techniques can be very promising in the handling of the domain of OTA flight searches.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源