A Guide to Generative AI in China: Tools, Use Cases, and Regulations
Created by Clark Boyd (and Baidu)

A Guide to Generative AI in China: Tools, Use Cases, and Regulations

As you'll know by now, generative artificial intelligence (AI) has the ability to generate new, original content based on a given prompt. We have all had our fun with DALL-E and ChatGPT, haven't we?

In this article, we will do something a little different. Intrigued by this great article in TechCrunch, I started using some of China's generative AI tools.

Below, I'll share what I've learned so far.

The quick summary:

  • Baidu and Tencent offer text-to-image tools that generate images based on a text prompt, just like DALL-E 2.
  • Tencent's tool, Different Dimension Me, turns any photo of a person into an anime character, but access has been blocked to foreigners due to a surge in traffic.
  • Baidu's tool, ERNIE-ViLG, is slightly less effective than DALL-E 2 in my scientific test.
  • People in China are using these tools to create content for social media and make money through ad revenues.
  • China has already regulated the use of generative AI, with the Cyberspace Administration of China banning the creation or distribution of "unhealthy or distorted" content and requiring companies to report any illegal use of AI to authorities.


First, let’s assess the main tools.

How can you try Chinese generative AI?

ERNIE-ViLG: This is a Baidu text-to-image demo and it’s the one I’ve spent most time trialling.

Different Dimension Me: A tool from Tencent that turns any photo of a person into an anime character.

Unfortunately, Tencent has blocked access to foreigners after a surge in traffic last week.

Taiyi: This text-to-image tool comes from the IDEA lab, led by the former head of Microsoft’s research lab in APAC.

How well does Chinese generative AI work?

Let’s try the Baidu tool with this prompt first:

纽约市一只吃冰淇淋的马耳他狗,数字艺术

Or, if you for some reason do not speak Chinese:

A maltese dog eating an ice cream in New York City, digital art

Here’s what Baidu gave me:

No alt text provided for this image
Baidu's attempt

They’re good, they’re cute, and the third one looks like he could be in a Wall Street boardroom.

The second one could even be a baddie in one of the Spiderman movies, which are famously set in Manhattan. He’s scheming to turn the sprinklers on any time a cat enters his garden.

However, they aren’t really eating the ice cream and the NYC references are oblique, at the very best.

Baidu Score: B-


Room for improvement, then.

Let’s try DALL-E 2 with the same prompt:

No alt text provided for this image
DALL-E has a go. Tremendous!

It nails it. Cute, eating the ice cream with aplomb, and it’s unmistakeably NYC.

DALL-E Score: A


But maybe my prompt is the problem? Perhaps my Chinese (or Google Translate’s Chinese…) is not as good as I thought?

Well, let’s try another one:

躲在小巷垃圾桶里的可爱怪物

(A cute monster hiding in a garbage can in an alley)

Here is the Baidu effort:

No alt text provided for this image
Getting better...

This is better, no? The monsters are cute, they’re definitely hiding, and that’s an alley if you ask me. They do lack a certain expressive elan in their faces, however.

Baidu Score: A-


And now DALL-E:

No alt text provided for this image
Varying quality

There is greater variation here, which I can appreciate. The fourth one is perhaps more maniacal than cute, but he’s a monster after all. I chose this one to challenge them.

The third one is “in” the bin in the sense that he has merged with it like some sort of nightmareish, “Binman” supervillain.

DALL-E Score: A-


How are people in China using generative AI?

On the whole, they are doing what everyone else is doing. They’re making cool and/or insane content and sharing it on social media.

I also stumbled upon a post by a Chinese blogger on Weixin (WeChat, basically) who was sharing ways to make money with generative AI. They posted some of the images that had generated the greatest interaction and, therefore, money.

No alt text provided for this image
Not to my taste

The user boasts of making hundreds of dollars per post by sharing carousels of AI-generated images like the one above. They broke down the ad revenues one can expect for a post, based on the number of views, likes, and comments, along with tips for writing prompts in the generative AI interfaces.

As a result, one can imagine Chinese social media might soon be flooded with these images.


Is China planning to regulate generative AI?

It already did.

In fact, way back in February 2022 (which is aeons ago in AI terms now) China released a series of restrictions for these then-upcoming technologies.

See, China’s AI regulation is centralised and proactive. The country has initiated a number of new departments to oversee the development and use of AI.

The Cyberspace Administration of China (CAC) already banned the publishing and distribution of "fake news" created by an AI, too. This doesn’t work (the onus is on publishers to stem the flow), but the intention is there.

The new legislation states that:

Media that is created without the subject's permission is also forbidden, as well as anything that goes against the national interest.

The “national interest”. Indeed. 🤨

It defines generative AI as:

"technology that uses deep learning, virtual reality, and other synthesis algorithms to generate text, images, audio, video, and virtual scenes."

That means generative AI tools are already censored in China. There is a lengthy list of keywords that one simply cannot use.

If China’s generative AI tools can only respond to certain prompts and their results are restricted, that must also dictate the training data they use.

I referenced Different Dimension Me earlier in this post. It takes an image of a person and turns it into an anime character. It was very popular in Latin America, but users there quickly found out that if they entered an image of a black person the tool returned highly offensive results. Of course, the tool was only trained using images of Asian people.

We see just how quickly these technologies become political tools of control.

How is the US approaching generative AI legislation?

Joe Biden’s government has just released its non-binding “Blueprint for an AI Bill of Rights”.

It has five general, well-meaning principles:

No alt text provided for this image

The Blueprint is a step in the right direction and it asks all the right questions, but it has no teeth. These are advisory principles that companies should consider, not mandatory rules.

Although the US has been slow to move on national legislation, it has imposed some significant sanctions internationally.

The chips are down in China

See, China will have to develop its own computer chips to power the next age of AI. The US has placed a ban on the export of high-end chips to China, as of September 2022.

Baidu does not believe this will hinder its progress in the immediate short term, as its technology does not use the latest chips. Its head of AI also says they already have enough Nvidia chips on hand to see them through.

In the medium- and long-term, Baidu expects to have access to Chinese chips that will match or better what the US develops. It says it already runs large language models using its own Kunlun chips, too.

In truth, no-one knows if China will indeed develop world-leading chips and it would hardly serve Baidu’s interests to be pessimistic. It has no choice, anyway.

What impact will this have on generative AI?

I’d expect the above to have a significant impact on how generative AI evolves in China and elsewhere.

Within China’s borders, it will be trained on restricted data and its use cases will be closely monitored.

The strange thing is that these very restrictions can be an aid to creativity. In the West, we have a general sense that this technology could revolutionise “everything”. In China, the enforced constraints can help narrow the focus to more specific, everyday uses for AI.

Nonetheless, it is impossible to ignore the conclusion that the centralised political culture in China will act as a handbrake on innovation. I have compared China with the US to provide a point of comparison, but the reality is that US researchers collaborate freely with peers from around the world.

This will naturally lead to improvements in the technologies developed by US companies, as will the free flow of data across international borders. The Western world also has an advantage in the development of advanced processing.

No government has figured out the best way to regulate new technologies. It is important for regulators to strike a balance between promoting progress and protecting citizens.

China is understandably cautious, but this approach may stymie innovation in a domain that could deliver huge benefits.

Although China will not want to be left behind, its early interventions in generative AI mean it is unlikely to take the lead in this field.

Anders Hjorth

Retail Media Consultant @ EPAM Systems | Digital Marketing & Retail Media Expert

1y

loved it - In this month of Fanjuary, I am 😉

Like
Reply

To view or add a comment, sign in

Insights from the community

Others also viewed

Explore topics