Matthew Kane’s Post

View profile for Matthew Kane, graphic

Analytics VP | Thought Leader | Ad Agency Professional

The AI machines that we've become accustomed to via large language models from places like OpenAI and Anthropic require an incredible amount of data to train them. For the past decade or so, this data has been easily accessible - for free - on the open web. Not so much anymore, as content creators and publishers have become wise to companies whose raison d'etre is to eventually replace the humans making (or, at the very least, squeeze the human creativity out of) the very content they're using to train their models. This has led to the rise of synthetic data, but an overreliance on this method could lead to model collapse - just ask the Spanish Hapsburgs. We round out the week asking what the MTA actually did at the Grand Central 4/5/6 entrance all summer and find out where all of George Strait's exes live. This week's newsletter is live! If you like the (100% human-created) writing I put out, think about subscribing to get this in your inbox every Tuesday!

AI Is Running Out of Data

AI Is Running Out of Data

tdnbw.com

To view or add a comment, sign in

Explore topics