Reinforcement Fine-Tuning in Production

Get expert answers on RFT, GRPO, and reward design.

This webinars is hosted by AI experts behind the course “Reinforcement Fine-Tuning LLMs with GRPO” for Deeplearning.AI!

Reinforcement Fine-Tuning (RFT) is no longer experimental. It’s powering real-world GenAI systems with tighter feedback loops and better reasoning.

But how do you actually make it work in production?

Watch Let’s Talk Tokens, a 1-hour AMA with the engineers behind the definitive RFT course from DeepLearning.AI.

What we’ll cover:

When RFT outperforms supervised fine-tuning (and when it doesn’t)
How GRPO enables scalable reward-based training
Reward design strategies that avoid mode collapse & reward hacking
Best practices for evaluating RFT-trained models in production

If you’re fine-tuning models or planning to, this session is built for you.

No fluff. Just real answers from real engineers who’ve done it.

AI Model Fine-Tuning LLM

Watch Now:

How many models in production do you have today? *

Our content sponsor, Rubrik, would like to contact you in the future by email or phone to provide you information and news about Rubrik products, services and events. Check this box if you are happy to receive these communications. You can change your mind at any time to stop receiving such emails and/or calls. See the Rubrik Privacy Policy for more information.

Reinforcement Fine-Tuning in Production

Watch Now:

Stay up to date with us

Useful Links

Contact

Subscribe for more insights