💯Alibaba's Qwen2.5 Releases 100 Models, ByteDance's New AI Music Generator, and China's New Draft Rule on AI Labeling
Weekly China AI News from September 9, 2024 to September 22, 2024
Hi, this is Tony! Welcome to this week’s issue of Recode China AI, a newsletter for China’s trending AI news and papers. Also wishing a happy Mid-Autumn Festival to all my readers and your family and friends! May the year ahead bring you abundance, good health, and happiness. 🥮🌝🥮
Three things to know
Alibaba Releases Latest Qwen2.5 Foundation Models with 100 Variants
What’s New: Last week Alibaba released its latest foundation models, Qwen2.5. This new lineup includes 100 models, featuring Qwen2.5 LLMs and specialized variants for coding and mathematics, trained in various sizes and precision. Alibaba said that it could be one of the largest open-source AI releases to date.
Although no eye-popping innovation was introduced, the models have been improved to follow different instructions, handle structured data, and generate long text. In benchmark language tests, Qwen2.5 72B often matches or surpasses Meta’s largest model, Llama 3.1 405B.
How it Works: Qwen2.5 models are dense, decoder-only language models trained on a 18 trillion token dataset. This gives them a significant advantage in terms of general knowledge, understanding, and versatility. Here’s how the models work across various functionalities:
Why it Matters: The release of Qwen2.5 is set to solidify Alibaba and Qwen’s strong position in China’s open AI ecosystem. According to Yuan Jinhui, founder of China’s open-source deep learning framework OneFlow, Qwen is one of the most widely adopted open-source models in China, largely due to its range of model sizes that cater to diverse demands. In contrast, Llama 3.1 has seen limited popularity in China because it lacks a fine-tuned version optimized for the Chinese language.
ByteDance’s New AI Model Generates Music from Lyrics and Edits Melodies
What’s New: Last week, ByteDance researchers introduced Seed-Music, an AI framework that allows users to generate and edit high-quality music. The system combines auto-regressive language modeling, commonly used in LLMs like GPT, with diffusion models, widely applied in image and video generation. It supports both vocal and instrumental music creation from inputs such as lyrics, style prompts, and audio references.
How It Works: Seed-Music integrates three core music representations: audio tokens, symbolic music tokens (or lead sheets), and vocoder latents, each designed for specific creative workflows. Here’s how they work:
Seed-Music enables applications like Lyrics2Song, which generates complete tracks from lyrics, and MusicEDiT, allowing precise adjustments to existing songs (demos below).
Recommended by LinkedIn
Why It Matters: Seed-Music’s unified framework will enable applications like Lyrics2Song, which generates complete tracks from lyrics, and MusicEDiT, allowing precise adjustments to existing songs.
China's New Draft Rule on Labeling AI-Generated Content
What’s New: On September 14, 2024, China’s Cyberspace Administration (CAC) released a draft regulation requiring clear labeling for AI-generated content. Public feedback is open until October 14. The regulation, called the AI-Generated Synthetic Content Labeling Measures 人工智能生成合成内容标识办法(征求意见稿), targets providers of AI-generated text, images, audio, and video to enhance transparency. This draft builds on laws such as the Cybersecurity Law and AI Service Management Provisions.
How It Works: The draft mandates two types of labels: explicit and implicit.
Platforms are required to verify that AI content includes the necessary labels before it can be shared. They must also notify users when AI-generated material is detected, even if explicit labels are absent.
Why It Matters: As AI technologies like deepfakes gain traction, governments are ramping up efforts to prevent the spread of misleading content. China’s new draft rule aims to increase transparency by marking AI-generated material, addressing a growing concern that fake AI content could blur the lines between reality and fabrication.
While some platforms like Xiaohongshu, Bilibili, Weibo, Douyin, and Kuaishou have implemented AI content declarations, the draft regulation introduces a unified standard for all. However, implementing these rules could be costly, especially for smaller firms. AI watermarking technology is still in its infancy and faces reliability issues. Tech giants such as Google have already acknowledged that even advanced solutions, like its SynthID, are vulnerable to attacks.
Weekly News Roundup
Trending Research
Head - custom content & int'l @ MIT
3moGreat newsletter, Tony Peng!
A.I. Writer, researcher and curator - full-time Newsletter publication manager.
3moWhat is China's answer to Suno AI?