Tech

OpenAI’s new AI Reinforcement Fine-Tuning could transform how scientists use its models

Share
Share

The second day of OpenAI’s 12 Days of OpenAI shifted to less spectacular, more enterprise interests compared to the general rollout of the OpenAI o1 model to ChatGPT on day one.

Instead, OpenAI announced plans to release Reinforcement Fine-Tuning (RFT), a way to customize its AI models for developers who want to adapt OpenAI’s algorithms for specific kinds of tasks, especially more complex ones. This release marks a clear shift toward enterprise applications compared to day one’s consumer-focused updates. You can think of RFT as a method for improving how AI models work through their reasoning for responses. Using a dataset and evaluation rubric from a developer lets OpenAI’s platform train their specialized AI without lots of expensive reinforcement from later experiences.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
When the school bell rings, the bandwidth drops: How post-15:40 internet surges affect UK broadband quality
Tech

When the school bell rings, the bandwidth drops: How post-15:40 internet surges affect UK broadband quality

Half of parents work after school, causing a broadband battle with streaming-addicted...

You can put Google Gemini right on your smartphone home screen – here’s how
Tech

You can put Google Gemini right on your smartphone home screen – here’s how

Google has launched Gemini home screen widgets for Android and iOS devices...

You can now fact check anybody’s post in WhatsApp – here’s how
Tech

You can now fact check anybody’s post in WhatsApp – here’s how

Perplexity AI’s new WhatsApp integration offers instant fact-checking without leaving the app...