
随着 Nano Banana、Qwen Image 和 SAM3 等图像处理技术的不断突破,几年前还处于行业前沿的 OpenAI 却相对沉寂,尤其是在产品发布方面。由于产品平庸乏味,大多数人已经将 OpenAI 排除在人工智能竞赛之外。
然而,浪子回头了!全新的 ChatGPT Image 来了。这款模型由 OpenAI 最新的旗舰级图像生成模型驱动,号称性能更胜以往。本文将帮助您了解这款图像模型,并将其与同类产品进行对比测试,以检验其性能表现。
什么是GPT Image 1.5?
ChatGPT Image 1.5 是 OpenAI 最新推出的图像生成模型,旨在快速、精准地将想法转化为图像。无论是根据空白提示进行创作,还是编辑现有照片,该模型都能提供与预期效果高度一致的结果。它支持精确编辑,同时保留精细细节,图像生成速度比以往版本快 4 倍。
该模型在 ChatGPT 中引入了全新的图像体验,让图像的创建和优化变得轻松便捷。
ChatGPT Image-1.5的11个测试提示:图像生成
ChatGPT Image 在图像生成和编辑方面都表现出色。在本节中,我将列出 10 个测试提示,用于测试 ChatGPT Image 的图像输出:
1. 生成逼真的图像
提示:
Create a detailed Infographic of the functioning and flow of an automatic coffee machine like a Jura.
From bean basket, to grinding, to scale, water tank, boiler, etc.
I’d like to understand technically and visually the flow.
响应:
图像模型长期以来一直难以生成清晰易读的文本。而这个模型不仅做到了这一点,还将其与吸引人的视觉效果完美结合。任何尝试过使用人工智能生成图像的人都能看出上述图像与预期图像之间的质量差异。模型的响应也十分准确。
我特意使用了这个提示,因为 OpenAI 的提示手册中提到过它,目的是为了测试模型在相同提示下的响应。
2. 创建逼真的图像
提示词:
A photorealistic candid shot of an experienced mechanic taking a break in a cluttered, sunlit garage. He is wiping grease off his hands with a dirty rag, looking exhausted but content. Extreme focus on skin texture: deep facial lines, pores, sweat beads, and grease smudges on his forehead. He wears a faded, oil-stained blue coverall with a name patch coming loose. Shot on 35mm film stock with a 50mm lens at eye level. Natural light streaming through a dusty window, illuminating floating dust particles. Shallow depth of field blurring the vintage car engine in the background. No retouching, raw and authentic.
响应:

一张逼真的机械师工作照。人物的光线运用使照片写实主义风格发挥得淋漓尽致。
3. 标志设计
提示词:
A minimalist vector logo for a hotel named ‘Bates Motel’. The design features a cute, round, stylized ‘ghost’ or ‘killer spirit’ with simple kawaii anime eyes, holding a butcher knife like a staff. The style is flat, warm, and timeless, reminiscent of a Studio Ghibli mascot but simplified for a corporate logo. Clean lines, negative space, warm beige and earthy brown colors. Plain white or dark background.
响应:

一个标志,它展现了即使是诡异的概念也可以用友好的方式呈现。
4. 故事改编漫画
提示词:
Vertical 4-panel manga comic strip, Death Note art style. Each panel should occupy a quarter of the whole image’s space.Panel 1: Light Yagami in a suit leaves a room looking arrogant while L crouches in the background watching.Panel 2: The door shuts. Close-up on L’s face, intense and calculating, holding a white sugar cube.Panel 3: L crouches on a desk in a messy room surrounded by of sweets and documents, working furiously.Panel 4: The door opens and Light Yagami stands looking shocked and defeated. L is sitting calmly holding up the Death Note book, smiling with victory. High contrast, black and white manga shading.
响应:

这幅漫画看起来像是漫画的最新章节。模型成功地创建了一个逼真的漫画页面。
5. UI模型
提示词:
A high-fidelity, photorealistic UI mockup of a modern Farmers Market mobile app displayed on an iPhone 15 frame.
The interface is clean and airy with a white background and subtle sage green accents.
Top section: A header saying ‘Riverside Market’ with ‘Open Until 2 PM’ status.Middle section: A ‘Today’s Specials’ carousel featuring vibrant, high-res photos of heirloom tomatoes and fresh sourdough bread. Lower section: A well-organized list of vendors with rounded square profile photos and category tags like ‘Organic’ and ‘Bakery’. Bottom: A minimalist navigation bar.
The design is practical and beautiful, featuring crisp sans-serif typography, soft shadows, and a polished Dribbble-style aesthetic.
响应:

这项技术的最佳应用之一。开发者可以通过创建原型来快速了解产品,从而获得直观的视觉辅助。
6. 绘图 -> 图像
提示词:
Turn this drawing into a photorealistic image. Preserve the exact layout, proportions, and perspective. Choose realistic materials and lighting consistent with the sketch intent. Do not add new elements or text.

响应:

这或许就是根据我这幅画作能做出的最佳效果了。
7. 虚拟试穿
提示词:


Edit the image to dress the woman using the provided clothing images. Do not change her face, facial features, skin tone, body shape, pose, or identity in any way. Preserve her exact likeness, expression, hairstyle, and proportions. Replace only the clothing, fitting the garments naturally to her existing pose and body geometry with realistic fabric behavior. Match lighting, shadows, and color temperature to the original photo so the outfit integrates photorealistically, without looking pasted on. Do not change the background, camera angle, framing, or image quality, and do not add accessories, text, logos, or watermarks.
响应:

一幅欢乐的画面,衣服的更换天衣无缝。
8. 3D毛绒玩具制作
我将使用与上一个例子相同的输入图像(人物)。
提示词:
Transform the subject or image into an adorable plushie-style form with soft textures and rounded proportions. If a person is present, preserve recognizable traits; otherwise, reinterpret the object or animal as a cozy stuffed toy using felt or fleece textures. Give it a cozy felt or fleece texture, simplified shapes, and gentle embroidered details for the eyes, mouth, and features. Use a warm, pastel or neutral color palette with smooth shading and subtle seams, like a handcrafted stuffed toy. Keep the expression friendly and cute, with a slightly oversized head, short limbs, and a cuddly silhouette.The final image should feel like a charming, collectible plush toy cozy, wholesome, and huggable, while still recognizable as the original subject.
响应:

这款毛绒玩具模仿了输入图像的主题,保留了其原有特征,同时又使其变得毛茸茸的!
9. 3D立体节日贺卡
提示:
A premium Christmas holiday card illustration featuring a close-up of a vintage, cute teddy bear right next to a Christmas tree. In the background, out of focus, is a ground of people celebrating. The lighting is soft and cinematic with a shallow depth of field to highlight the texture of the ornament. The mood is warm, nostalgic, and emotional. The image must include the text “Merry Christmas — may your days be merry and bright” written in an elegant, gold serif font. Photorealistic, 8k resolution, high print-quality composition.
响应:

这可以寄给我们的亲戚。模特对需求的理解非常透彻。
10. 世界知识
提示词:
Create a realistic outdoor crowd scene at the Brandenburg Gate in Berlin on the night of November 9, 1989. Photorealistic, period-accurate clothing, staging, and environment.
响应:

这个日期意义非凡,因为它标志着柏林墙的倒塌。该模型不仅能够识别这一点,还能生成一幅捕捉民众情感的图像。
小结
该模型绝不会限制您的创造力。您可以将这些提示作为基础,创建更完善的提示,专门针对您的工作量进行定制。凭借快速的响应时间和面向所有用户的可用性,OpenAI 最新发布的图像模型的发展方向显而易见。您可以尝试不同的提示,找到最适合您的方案。
常见问题解答
问 1:什么是 ChatGPT Image 1.5?它与之前的版本有何不同?
答:ChatGPT Image 1.5 是 OpenAI 最新的图像生成模型,旨在实现更快的输出、更清晰的细节和更精确的编辑。它的图像生成速度提升高达四倍,同时保持了良好的视觉精度。
问题 2:ChatGPT Image 1.5 最擅长哪些类型的任务?
答:它在生成逼真的图像、照片级场景、UI模型、漫画、绘图转图像以及进行精确的图像编辑(例如虚拟试穿)方面表现出色,同时还能保持布局、光照和细节的准确性。
问3:ChatGPT Image 1.5 适合普通用户还是仅供专业人士使用?
答:它面向所有用户开放,并专为快速迭代而设计,因此对于希望获得高质量图像而无需复杂工作流程的设计师、开发人员、营销人员和业余创作者来说都非常实用。


评论留言