论文标题

DALL-E 2的非常初步的分析

A very preliminary analysis of DALL-E 2

论文作者

Marcus, Gary, Davis, Ernest, Aaronson, Scott

论文摘要

DALL-E 2系统生成对应于输入文本标题的原始合成图像。我们在此报告该系统的14个测试结果,旨在评估其常识,推理和理解复杂文本的能力。我们所有的提示都比最近几周所展示的典型提示更具挑战性。然而,对于14个提示中的5个,至少十幅图像中的一张完全满足了我们的要求。另一方面,所有十幅图像都没有提示满足我们的要求。

The DALL-E 2 system generates original synthetic images corresponding to an input text as caption. We report here on the outcome of fourteen tests of this system designed to assess its common sense, reasoning and ability to understand complex texts. All of our prompts were intentionally much more challenging than the typical ones that have been showcased in recent weeks. Nevertheless, for 5 out of the 14 prompts, at least one of the ten images fully satisfied our requests. On the other hand, on no prompt did all of the ten images satisfy our requests.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源