OpenAI's new AI Reinforcement Fine-Tuning could transform how scientists use its models

OpenAI
(Image credit: Getty Images)

The second day of OpenAI's 12 Days of OpenAI shifted to less spectacular, more enterprise interests compared to the general rollout of the OpenAI o1 model to ChatGPT on day one.

Instead, OpenAI announced plans to release Reinforcement Fine-Tuning (RFT), a way to customize its AI models for developers who want to adapt OpenAI's algorithms for specific kinds of tasks, especially more complex ones. This release marks a clear shift toward enterprise applications compared to day one’s consumer-focused updates. You can think of RFT as a method for improving how AI models work through their reasoning for responses. Using a dataset and evaluation rubric from a developer lets OpenAI’s platform train their specialized AI without lots of expensive reinforcement from later experiences.

RFT could be a boon for AI tools employed in law and science. OpenAI highlighted in its live stream the CoCounsel AI assistant built with RFT by Thompson Reuters and how RFT helps researchers studying rare genetic diseases at Berkeley Lab. However, the business partnerships aren't going to make much difference in the short term for average users of ChatGPT or other OpenAI products.

Enterprise or consumer

If you're more keen on the consumer side of things, don't give up just yet. While the enterprise tilt contrasts with day one, it's easy to imagine OpenAI wanting to have as broad a range of news during the 12 days as possible. There will almost certainly be plenty more consumer news to come. Perhaps alternating days or some other pattern.

Still, at least the ending joke from OpenAI was a little funnier than yesterday. The AI described how self-driving vehicles are popular in San Fransisco, and Santa is keen to make a self-driving sleigh as part of the trend. The problem is that it keeps hitting trees. What's the problem? He didn't pine-tune his models. Maybe the image ChatGPT made for TechRadar's Editor-at-Large Lance Ulanoff will sell the humor better.

ChatGPT visualizing an OpenAI joke told during Day 2 of 12 Days of OpenAI.

(Image credit: ChatGPT)

You might also like...

TOPICS
Eric Hal Schwartz
Contributor

Eric Hal Schwartz is a freelance writer for TechRadar with more than 15 years of experience covering the intersection of the world and technology. For the last five years, he served as head writer for Voicebot.ai and was on the leading edge of reporting on generative AI and large language models. He's since become an expert on the products of generative AI models, such as OpenAI’s ChatGPT, Anthropic’s Claude, Google Gemini, and every other synthetic media tool. His experience runs the gamut of media, including print, digital, broadcast, and live events. Now, he's continuing to tell the stories people want and need to hear about the rapidly evolving AI space and its impact on their lives. Eric is based in New York City.

Read more
OpenAI
12 Days of OpenAI - Everything that was announced, including ChatGPT, Sora, o1, o3 and more
Using ChatGPT for desktop on a Mac with XCode.
ChatGPT's Mac app gets a glowup with new coding and notetaking features
OpenAI Day 12
12 Days of OpenAI ends with a new model for the new year
ChatGPT logo with circuitry in the background.
OpenAI’s new Deep Research is the ChatGPT AI agent we’ve been waiting for – 3 reasons why I can’t wait to use it
An iPhone showing the ChatGPT logo on its screen
ChatGPT brings its conversational search engine to everyone
An iPhone showing the ChatGPT logo on its screen
ChatGPT-4.5 is here for Pro users now and Plus users next week, and I can't wait to try it
Latest in Artificial Intelligence
ChatGPT vs. Manus
I compared Manus AI to ChatGPT – now I understand why everyone is calling it the next DeepSeek
Two business men playing chess in the office.
It turns out ChatGPT o1 and DeepSeek-R1 cheat at chess if they’re losing, which makes me wonder if I should I should trust AI with anything
Google Gemini Calendar
Gemini is coming to Google Calendar, here’s how it will work and how to try it now
Netflix
Netflix tried to fix 80s sitcom A Different World with AI but it gave us a different nightmare
Pictory
What is Pictory: Everything we know about this business-focussed AI video generator
A toy Amazon Echo next to the Alexa Plus logo and a range of Echo devices
What is Alexa+: Amazon’s next-generation assistant is powered by generative-AI
Latest in News
Vision Pro Metallica
Apple Vision Pro goes off to never never land with Metallica concert footage
Mufasa is joined by another lion, a monkey and a bird in this promotional image
Mufasa: The Lion King prowls onto Disney+ as it finally gets a streaming release date
An American flag flying outside the US Capitol building against a blue sky
Sean Plankey selected as CISA director by President Trump
An Nvidia GeForce RTX 4060 on a table with its retail packaging
Nvidia RTX 5060 GPU spotted in Acer gaming PC, suggesting rumors of imminent launch are correct – and that it’ll run with only 8GB of video RAM
Indiana Jones talking to a friend in a university setting with a jaunty smile on his face
New leak claims Indiana Jones and the Great Circle PS5 release will come in April
A close up of the limited edition vinyl turntable wrist watch from AndoAndoAndo
This limited-edition timepiece turns the iconic Technics SL-1200 turntable into a watch, and I want one