Title : Text-to-Image Synthesis
link : Text-to-Image Synthesis
Text-to-Image Synthesis
Novel photo-real images generated by an adversarial network of computers based solely on a written prompt, without human intervention or photo cues. Low resolution version on top row iterated to higher res on bottom row. via Olivier Grisel on Twitter |
We've seen systems that can (links to previous posts):
• Re-render a photo in the style of any artist.
• Identify faces and objects, no matter the lighting, angle, or context.
• Render a photo in a painterly way that puts more detail in psychologically salient areas.
• Paint a generalized portrait that's typical of the style of Rembrandt.
• Generate captions for images describing at a higher level what's going on in a given photo.
• Analyze the abstract elements of a target image and then locate other abstractly similar images.
Despite these advances, most of us human picture-makers can still pride ourselves in our unique ability to create a photo-real image based purely on a written description.
Suppose, for example, you were asked to paint a picture of "a small bird with a pink breast and crown, and black primaries and secondaries." Could you do it? And could you render your picture so believably that someone else might mistake if for a real photo?
Computer generated images courtesy CreativeAI.net |
Computers are figuring this out, and they're starting to get good at it. Scientists are approaching the problem of text-to-image synthesis by means of a deep-learning technique called "generative adversarial networks" or GANs for short.
This GAN strategy pits two separate computer networks against each other. The goal of the Generator one is to create images that fit the text prompt, and the goal of the Discriminator is to distinguish synthetic images from real ones.
As the Generator tries to create images to fool the Discriminator, it gets harder, because the Discriminator keeps learning, too. Exactly what the computer "knows" about the structure of form or the aptness of illustrative problem-solving is hard to say because it wasn't taught by a human; it figured it out on its own, in its own way.
Related video: Image Synthesis From Text With Deep Learning
The resulting images are not an average of existing photos. Rather they're completely novel creations.
Furthermore, GAN image synthesizers can be used to create not only real-world images, but also completely original surreal images based on prompts such as: “an anthropomorphic cuckoo clock is taking a morning walk to the pastry market.”
How good are these synthetic illustrations?
So far the images are small (about 64 x 64 pixels) and for the most part, they still won't fool any humans. But watch out: you're just seeing just baby steps.
GANs currently do pretty well generating plausible pictures of birds and flowers, but they have limited success with complex scenes involving human figures, or generalized text prompts such as "a picture of a very clean living room."
They're a bit garbled and incoherent at the moment, but they will develop rapidly. In a few years, advanced A.I. image-creating tools that can illustrate any text prompt in any style will be available cheaply to art buyers everywhere.
• A scholarly PDF: Generative Adversarial Text to Image Synthesis
• Related scientific paper about texture synthesis: Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks
• More images at Creative AI
Thus Article Text-to-Image Synthesis
That's an article Text-to-Image Synthesis This time, hopefully can give benefits to all of you. well, see you in posting other articles.
You are now reading the article Text-to-Image Synthesis with the link address https://idealbodys.blogspot.com/2017/05/text-to-image-synthesis.html
0 Response to "Text-to-Image Synthesis"
Post a Comment