AI-Generated Art Still Needs a Human Touch

Dall-E 2, Midjourney, and Stable Diffusion produce impressive images on command, but perfecting them requires patient, skilled tending.

Originally published on Worth.

A quartet of dragons materializes before my eyes. They each have golden skin, intricate scales and spikes, and prodigious fangs. Some face right, some left. One pair has three horns on its head, another pair has two. One has a horn on its snout.

Generated by Marshall Smith vis Midjourney

And they were conjured by a spell—not in Elvish or Elder, but in plain English. I’ll share it with you: “Chinese dragon made from glossy reflective gold, with oversized details, ultra-realistic 3D render, rim lighting, warm light, cool shadows, soft ambient occlusion, digital painting, 8K HDR.”

With just those words, and about 10 seconds to think, an AI chatbot called Midjourney paints four digital images—each a unique interpretation of that description. Repeat the spell, and you’ll get four more variations. And again, and again.

A Fundamental Shift

What ChatGPT does for writing, Midjourney does for images. And it’s been doing it longer. In summer 2022, it burst into the graphics world along with several other so-called generative AI apps, including Dall-E 2 (by ChatGPT’s maker OpenAI) and the open-source (free to use) program Stable Diffusion.

“Dall-E 2 is certainly the first time people who are not following [this technology], were like, ‘Oh, wow, this is something,’” says Marshall Smith, a veteran video game designer who worked on pop culture sensations like Zynga’s FarmVille and Words with Friends.

That’s when these apps crossed the uncanny valley from creepily inept to appealing, even inspiring creators. Detailed, vivid images that had required experienced designers with sophisticated software to realize could now emerge from mere words.

But is it art?

That’s not just a philosophical question. It’s a business and even legal consideration.

Impressive as Midjourney’s dragons might be to a casual viewer, none of them are ready to go straight into a video game. Getting there will take several rounds of dialogue with the AI—in fact, with several AIs—as well as pulling in traditional tools like Adobe Photoshop.

“It’s an iteration with some of these things,” says Smith. “So I think, ‘Oh, these are cool, but this is not at all what I want.’”

Generative AI makes it easier to talk to computers, but (so far) they still can’t read minds. Getting from a rough idea to a professional artwork with artificial intelligence requires a lot of human intelligence. Let’s walk through how that process could go with the golden dragon.

How We Created This Dragon

Step 1: Ideate

Smith’s current employer, Big Run Studios, has just developed a new mobile slot machine game called Blackout Slots. Though it’s been finished, Smith takes me through how he might create components for it from scratch using a host of generative AI tools and traditional apps.

We start with one that probably everyone knows: OpenAI’s ChatGPT. “List top 20 slot machine themes,” he types. Almost instantly, ChatGPT names and describes a score of options, including Egyptian, Fruit, Jungle Adventure, and Chinese Culture. For the final one, it says, “These games often have symbols like dragons, lanterns, and coins.”

Going with that, Smith asks the chatbot to brainstorm a hierarchy of symbols with different values in the game. They included “Lantern,” “Diamond-encrusted Lotus,” and, for the top “Jackpot” tier, “Golden Dragon.” He then instructs ChatGPT, “For each symbol, I need an image prompt. This is a literal visual description of the symbol image.” He also provides examples of terms that he knows will resonate with Midjourney, such as “ultra realistic 3D render,” “cool shadows,” “soft ambient occlusion,” and “digital painting.” ChatGPT cheerily creates a spreadsheet with prompts for eight symbols, including the golden dragon.

Step 2: Iterate

Smith copies the image prompt from ChatGPT, pastes it into a messaging app called Discord—where Midjourney’s chatbot lives—and does not get a finished product. Instead, we see four low-resolution mockups to choose from.

Picking one, we can then remix it by clicking on a number of buttons below the image—for instance,

Generated by Marshall Smith, via Midjourney

specifying how much artistic freedom (stylization) Midjourney can take when interpreting our prompt. We can also modify the initial prompt text and re-run the whole process. The sky’s the limit here: 100-plus word prompts aren’t uncommon. But Smith is pretty happy with the latest iteration of the dragon, and simply removes the background scenery by adding the text “no background on white” to the prompt.

Sometimes the dialogues are trickier, because words are open to interpretation—as Smith found while creating characters for a western-themed game. “I was talking about having a snow-capped mountain,” he says. “So, then the AI kind of grabbed onto the idea that it was going to be snowy. So, the character was now wearing a fur lining on his coat.” That’s not quite what Smith was thinking, so he had to finesse. “I had to split some ideas up a little bit [in the prompting]. So like, okay, well, the mountains are snowpack, but in the foreground, it’s a warm, sunny day in Montana,” he says.

This “prompt engineering” process has become an artform in itself, and it could have legal implications. In March, the U.S. Copyright Office issued a rule seeming to say that generative AI can’t be copyrighted because, “users do not exercise ultimate creative control over how such systems interpret prompts and generate material.” But, quoting federal law on “compilation” artworks, it went on to say, “a human may select or arrange AI-generated material in a sufficiently creative way that ‘the resulting work as a whole constitutes an original work of authorship.’”

Would complex rounds of prompting, qualify as a copyrightable compilation? “Whether prompts can receive copyright protection depends on the facts of the case, so it will likely be a case-by-case analysis instead of a general rule,” says Mehtab Khan, a resident fellow at Yale Law School who covers technology and intellectual property.

(According to Midjourney’s terms of service, users with a paid membership own the rights to their creations, but Midjourney also has the right to use and remix those creations.)

Step 3: De-pixelate

Once you’ve gotten the image as far as you think Midjourney can take it, the app can output a higher-resolution version. But it’s not that high-res, currently capped at 1024 by 1024 pixels. (As with all things in AI, that figure may change by the time you read this.)

That’s too low for Smith’s purposes, so he turns to another app, Photo AI from Topaz Labs.

(Left) Before de-pixelation, (Right) After de-pixelation, Courtesy of Marshall Smith

It ingests low-res images and reasons out what the missing details might be. Smith demonstrates this by dragging a slider across his original image to show how Photo AI refines it. Pixelated swathes of fur on the dragon’s head are transformed into rich, layered tufts of fine filaments. The app is not just smoothing out jagged lines, it’s creating entirely new features.

In the process, the dragon goes from a roughly one-megapixel pic to a more than 37-megapixel behemoth. This is a relatively quick step, but an essential one. Will this capability get incorporated into Midjourney or other apps? Very possible—maybe even by the time you read this. (It’s already offered by rival Stable Diffusion.)

Step 4: Manually Create

An AI-generated and refined work would satisfy many creators and purposes. But for people with the technical skills, it’s still easier to do the final touches on their own than to cajole a machine to do it. And for some finer details, manual is still the only way.

So Smith moves his AI-created dragon into Adobe Photoshop, an app he’s been using for over two decades. Although AI is helping here, too.

To get the dragon ready to place on the digital slot machine, Smith first must cut it out from the background.

This has always been a core Photoshop capability, but getting it perfect required some manual tweaking.

There’s much less of that since Photoshop began incorporating generative AI in May. It’s now much better at recognizing the outline of an object—even the dragon’s intricate jumble of fur, scales, horns, and fangs. Cutting out the image is a one-click process for Smith (at least, sometimes, he says).

Photoshop is adding more-ambitious generative tools in the spirit of Midjourney, but these are still in the “beta” or experimental phase. To demonstrate, Smith tries adding a flame that emerges from the dragon’s mouth, typing in the prompt, “vibrant purple flame.” A cartoonish blaze appears, but the process also smooshes the dragon’s head and turns its eye purple.

But many of Photoshop’s traditional tools are still superior. Smith uses them to adjust color, for instance. “I don’t like my game art to have shadows that are black,” he says. “So it’s kind of cool to have something that

has a purple shadow, but a yellow highlight.” Smith can also adjust contrast, lighting, and exposure. He can thicken parts of the image and do much, much more. “You definitely are doing the last steps in Photoshop,” he says.

Incorporating Photoshop further bolsters that case for copyright. “Information alone cannot be copyrighted but order, arrangement, presentation etc. may be creative enough to receive protection,” says Khan. “So, using Photoshop may help qualify a work for protection.”

Is There a Future For Artists?

Artificial intelligence still can’t completely replace a skilled artist for producing high-end work. But the tools keep improving. “They have been innovating like crazy,” says Smith about Midjourney, though that could apply to any of these apps. “They have new features all the time that have continued to drastically improve the product [with] higher resolution, higher fidelity.” And the level of sophistication generative AI has brought to two-dimensional imagery could someday—perhaps someday soon—come to 3D animation and even filmmaking.

It’s already replacing some mundane but lucrative jobs, such as creating seamless background textures, as in fabrics, wallpaper, or wrapping paper. They are essentially endless grids of repeating images, or tiles, and a lot of high-paid work goes into blending the boundaries between tiles to create a seamless look. Now apps like Midjourney can do it instantly.

There may also be less work for artists creating concept art. Instead, AI can generate oodles of mockups for designers to consider before commissioning an artist to create a high-end image. That, for instance, allows designers like Smith to add more features to games—even entire new characters—that they simply wouldn’t have time for in the past.

Whether in games, fabrics, or any other creations, the ultimate result of generative AI will be more artworks, but perhaps produced by fewer artists. And staying employed means staying on top of these fast-moving technologies so they boost a professional’s skills, rather than supersede them.

Cookie	Duration	Description
__cf_bm	1 hour	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__hssc	1 hour	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
_GRECAPTCHA	6 months	Google Recaptcha service sets this cookie to identify bots to protect the website against malicious spam attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Analytics" category.
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-functional	1 year	The GDPR Cookie Consent plugin sets the cookie to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Necessary" category.
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie stores user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie stores the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
csrftoken	1 year	This cookie is associated with Django web development platform for python. Used to help protect the website against Cross-Site Request Forgery attacks
elementor	never	The website's WordPress theme uses this cookie. It allows the website owner to implement or change the website's content in real-time.
JSESSIONID	session	New Relic uses this cookie to store a session identifier so that New Relic can monitor session counts for an application.
rc::a	never	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
rc::b	session	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
rc::c	session	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
rc::f	never	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
viewed_cookie_policy	1 year	The GDPR Cookie Consent plugin sets the cookie to store whether or not the user has consented to use cookies. It does not store any personal data.
wpEmojiSettingsSupports	session	WordPress sets this cookie when a user interacts with emojis on a WordPress site. It helps determine if the user's browser can display emojis properly.

Cookie	Duration	Description
lang	session	LinkedIn sets this cookie to remember a user's language setting.
li_gc	6 months	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
mgref	1 year	This cookie is set by Eventbrite to deliver content tailored to the end user's interests and improve content creation. It is also used for event-booking purposes.
mgrefby	1 year	This cookie is set by Eventbrite to deliver content tailored to the end user's interests and improve content creation. It is also used for event-booking purposes.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
yt-player-headers-readable	never	The yt-player-headers-readable cookie is used by YouTube to store user preferences related to video playback and interface, enhancing the user's viewing experience.
yt-remote-cast-available	session	The yt-remote-cast-available cookie is used to store the user's preferences regarding whether casting is available on their YouTube video player.
yt-remote-cast-installed	session	The yt-remote-cast-installed cookie is used to store the user's video player preferences using embedded YouTube video.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-fast-check-period	session	The yt-remote-fast-check-period cookie is used by YouTube to store the user's video player preferences for embedded YouTube videos.
yt-remote-session-app	session	The yt-remote-session-app cookie is used by YouTube to store user preferences and information about the interface of the embedded YouTube video player.
yt-remote-session-name	session	The yt-remote-session-name cookie is used by YouTube to store the user's video player preferences using embedded YouTube video.
ytidb::LAST_RESULT_ENTRY_KEY	never	The cookie ytidb::LAST_RESULT_ENTRY_KEY is used by YouTube to store the last search result entry that was clicked by the user. This information is used to improve the user experience by providing more relevant search results in the future.

Cookie	Duration	Description
__hstc	6 months	Hubspot set this main cookie for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_fbp	3 months	Facebook sets this cookie to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising after visiting the website.
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gat_UA-*	1 minute	Google Analytics sets this cookie for user behaviour tracking.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
browser_id	5 years	This cookie is used for identifying the visitor browser on re-visit to the website.
hubspotutk	6 months	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.
vuid	1 year 1 month 4 days	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos on the website.

Cookie	Duration	Description
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser IDs.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
fr	3 months	Facebook sets this cookie to show relevant advertisements by tracking user behaviour across the web, on sites with Facebook pixel or Facebook social plugin.
iutk	6 months	Issuu sets this cookie to recognise the user's device and what Issuu documents have been read.
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
muc_ads	1 year 1 month 4 days	Twitter sets this cookie to collect user behaviour and interaction data to optimize the website.
NID	6 months	Google sets the cookie for advertising purposes; to limit the number of times the user sees an ad, to unwanted mute ads, and to measure the effectiveness of ads.
personalization_id	1 year 1 month 4 days	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
PREF	8 months	PREF cookie is set by Youtube to store user preferences like language, format of search results and other customizations for YouTube Videos embedded in different sites.
scribd_ubtc	10 years	Scribd sets this cookie to gather data on user behaviour across several websites and maximise the relevancy of the advertisements on the website.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
VISITOR_PRIVACY_METADATA	6 months	YouTube sets this cookie to store the user's cookie consent state for the current domain.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__cflb	1 hour	This cookie is used by Cloudflare for load balancing.
__eoi	6 months	Description is currently not available.
_cfuvid	session	Description is currently not available.
AN	1 month	No description available.
AS	session	No description available.
ebEventToTrack	1 month	No description available.
eblang	1 year	No description available.
hmt_id	1 month	Description is currently not available.
li_alerts	1 year	Description is currently not available.
loglevel	never	No description available.
m	1 year 1 month 4 days	No description available.
SP	session	Description is currently not available.
SS	session	Description is currently not available.
stableId	1 year	Description is currently not available.

AI-Generated Art Still Needs a Human Touch

A Fundamental Shift