OpenAI’s new AI image generator pushes the limits in detail and prompt fidelity

 In AI, AI ethics, AI image generators, Biz & IT, ChatGPT, chatgtp, dall-e, DALL-E 2, DALL-E 3, image synthesis, machine learning, OpenAI, Stable Diffusion, Tech

OpenAI’s new AI image generator pushes the limits in detail and prompt fidelity

Serving the Technologist for more than a decade. IT news, reviews, and analysis.
A series of images generated using OpenAI's DALL-E 3 image synthesis model.


On Wednesday, OpenAI announced DALL-E 3, the latest version of its AI image synthesis model that features full integration with ChatGPT. DALL-E 3 renders images by closely following complex descriptions and handling in-image text generation (such as labels and signs), which challenged earlier models. Currently in research preview, it will be available to ChatGPT Plus and Enterprise customers in early October.

Like its predecessor, DALLE-3 is a text-to-image generator that creates novel images based on written descriptions called prompts. Although OpenAI released no technical details about DALL-E 3, the AI model at the heart of previous versions of DALL-E was trained on millions of images created by human artists and photographers, some of them licensed from stock websites like Shutterstock. It’s likely DALL-E 3 follows this same formula, but with new training techniques and more computational training time.

Judging by the samples provided by OpenAI on its promotional blog, DALL-E 3 appears to be a radically more capable image synthesis model than anything else available in terms of following prompts. While OpenAI’s examples have been cherry-picked for their effectiveness, they appear to follow the prompt instructions faithfully and convincingly render objects with minimal deformations. Compared to DALL-E 2, OpenAI says that DALL-E 3 refines small details like hands more effectively, creating engaging images by default with “no hacks or prompt engineering required.”

Read 10 remaining paragraphs | Comments

With better response to details and text, DALL-E 3 hopes to make prompt engineering obsolete.

Recent Posts
Contact Us

We're not around right now. But you can send us an email and we'll get back to you, asap.

Not readable? Change text. captcha txt