OpenAI’s new AI image generator pushes the limits in detail and prompt fidelity

Voyager · 1 year ago

OpenAI’s new AI image generator pushes the limits in detail and prompt fidelity

@thehatfox@lemmy.world · 1 year ago

I wish more people realised this. It’s much harder to create very specific images with the current image generation tools than most people seem to think, which is creating an inaccurate view of the technology in the public eye.

The generator will create something inspired by the prompt it is given, but it can be very hard to make it match the output the prompt writer imagines when writing the prompt. There are various tools that can refine and narrow the generator’s output, to try and control things like posing, composition, style etc and to redraw details. But even then it’s often pot luck as to the output. The generated images aren’t necessarily bad, just not what was wanted.

I think the comparison to stock photo images is apt, current image generators are great for creating themed but somewhat generic images. The tools are going to continue to advance, and they are useful in for some applications already. But they are still a long way off from truly replacing human artistry.

@lloram239@feddit.de · edit-2 11 months ago

deleted by creator