OpenAI's new AI Reinforcement Fine-Tuning could transform how scientists use its models

OpenAI
(Image credit: Getty Images)

The second day of OpenAI's 12 Days of OpenAI shifted to less spectacular, more enterprise interests compared to the general rollout of the OpenAI o1 model to ChatGPT on day one.

Instead, OpenAI announced plans to release Reinforcement Fine-Tuning (RFT), a way to customize its AI models for developers who want to adapt OpenAI's algorithms for specific kinds of tasks, especially more complex ones. This release marks a clear shift toward enterprise applications compared to day one’s consumer-focused updates. You can think of RFT as a method for improving how AI models work through their reasoning for responses. Using a dataset and evaluation rubric from a developer lets OpenAI’s platform train their specialized AI without lots of expensive reinforcement from later experiences.

RFT could be a boon for AI tools employed in law and science. OpenAI highlighted in its live stream the CoCounsel AI assistant built with RFT by Thompson Reuters and how RFT helps researchers studying rare genetic diseases at Berkeley Lab. However, the business partnerships aren't going to make much difference in the short term for average users of ChatGPT or other OpenAI products.

Enterprise or consumer

If you're more keen on the consumer side of things, don't give up just yet. While the enterprise tilt contrasts with day one, it's easy to imagine OpenAI wanting to have as broad a range of news during the 12 days as possible. There will almost certainly be plenty more consumer news to come. Perhaps alternating days or some other pattern.

Still, at least the ending joke from OpenAI was a little funnier than yesterday. The AI described how self-driving vehicles are popular in San Fransisco, and Santa is keen to make a self-driving sleigh as part of the trend. The problem is that it keeps hitting trees. What's the problem? He didn't pine-tune his models. Maybe the image ChatGPT made for TechRadar's Editor-at-Large Lance Ulanoff will sell the humor better.

ChatGPT visualizing an OpenAI joke told during Day 2 of 12 Days of OpenAI.

(Image credit: ChatGPT)

You might also like...

TOPICS
Eric Hal Schwartz
Contributor

Eric Hal Schwartz is a freelance writer for TechRadar with more than 15 years of experience covering the intersection of the world and technology. For the last five years, he served as head writer for Voicebot.ai and was on the leading edge of reporting on generative AI and large language models. He's since become an expert on the products of generative AI models, such as OpenAI’s ChatGPT, Anthropic’s Claude, Google Gemini, and every other synthetic media tool. His experience runs the gamut of media, including print, digital, broadcast, and live events. Now, he's continuing to tell the stories people want and need to hear about the rapidly evolving AI space and its impact on their lives. Eric is based in New York City.

Read more
OpenAI
12 Days of OpenAI - Everything that was announced, including ChatGPT, Sora, o1, o3 and more
Using ChatGPT for desktop on a Mac with XCode.
ChatGPT's Mac app gets a glowup with new coding and notetaking features
OpenAI Day 12
12 Days of OpenAI ends with a new model for the new year
ChatGPT logo with circuitry in the background.
OpenAI’s new Deep Research is the ChatGPT AI agent we’ve been waiting for – 3 reasons why I can’t wait to use it
An iPhone showing the ChatGPT logo on its screen
ChatGPT brings its conversational search engine to everyone
An iPhone showing the ChatGPT logo on its screen
ChatGPT-4.5 is here for Pro users now and Plus users next week, and I can't wait to try it
Latest in Artificial Intelligence
OpenAI CEO Sam Altman attends the artificial intelligence Revolution Forum. New York, US - 13 Jan 2023
Sam Altman tweets delay to ChatGPT-4.5 launch while also proposing a shocking new payment structure
ChatGPT Deep Research
I tried Deep Research on ChatGPT, and it’s like a super smart but slightly absent-minded librarian from a children’s book
Google Gemini iPhone Lock Screen
You can now access Gemini from your iPhone's lock screen
Apple’s new Invites app gives iCloud Plus subscribers an easier way to organize parties – and Android fans are invited too
I tried Apple's new AI-powered Invites app, but I'm not sure why anyone else would
Opera Browser Operator
Opera’s new AI agent web browser just reinvented web browsing - here’s 5 ways it could completely change the internet
AI
I tried the most realistic AI voice companion ever created - if ChatGPT or Gemini ever gets this good, reality is in trouble
Latest in News
A hand holding a phone showing the Android Find My Device network
Android's Find My Device can now let you track your friends – and I can't decide if that's cool or creepy
Insta360 X4 360 degree camera without lens protector
Leaked DJI Osmo 360 image suggests GoPro and Insta360 should be worried – here's why
A YouTube Premium promo on a laptop screen
A cheaper YouTube Premium Lite plan just rolled out in the US – but you’ll miss out on these 4 features
Viaim RecDot AI true wireless earbuds
These AI-powered earbuds can also act as a dictaphone with transcription when left in their case
The socket interface of the Intel Core Ultra processor
Intel unveils its most powerful AI PCs yet - new Intel Core Ultra Series 2 processors pack in vPro for lightweight laptops and high-performance workstations alike
An Nvidia GeForce RTX 5070
Nvidia confirms that an RTX 5070 Founders Edition is coming... just not on launch day