论文标题
余弦模型与合奏蒸馏的水印
Cosine Model Watermarking Against Ensemble Distillation
论文作者
论文摘要
已经开发了许多模型水印方法,以防止有价值的部署商业模型被模型蒸馏偷偷偷偷摸摸。但是,大多数现有模型水印方法产生的水印可以通过集成蒸馏很容易避免,因为平均多个结合模型的输出可以显着减少甚至删除水印。在本文中,我们专注于应对防御集成蒸馏的具有挑战性的任务。我们提出了一种名为COSWM的新型水印技术,以实现针对整体蒸馏的出色模型水印性能。 CoSWM不仅在设计方面优雅,而且还具有理想的理论保证。我们对公共数据集的广泛实验表明,COSWM的出色表现及其优势在最先进的基准中。
Many model watermarking methods have been developed to prevent valuable deployed commercial models from being stealthily stolen by model distillations. However, watermarks produced by most existing model watermarking methods can be easily evaded by ensemble distillation, because averaging the outputs of multiple ensembled models can significantly reduce or even erase the watermarks. In this paper, we focus on tackling the challenging task of defending against ensemble distillation. We propose a novel watermarking technique named CosWM to achieve outstanding model watermarking performance against ensemble distillation. CosWM is not only elegant in design, but also comes with desirable theoretical guarantees. Our extensive experiments on public data sets demonstrate the excellent performance of CosWM and its advantages over the state-of-the-art baselines.