Pushing the capabilities of Gemma 3 via distillation and RL fine-tuning | Learn programming by watching videos

Pushing the capabilities of Gemma 3 via distillation and RL fine-tuning

youtube

Pushing the capabilities of Gemma 3 via distillation and RL fine-tuning

Specialized capabilities (e.g. math abilities, coding, multilinguality, tool use...) are key areas of improvement in post-training. In this talk we explore a novel strategy involving large-scale distillation and RL finetuning to push specialized capabilities in LMs while still improving their generality. Subscribe to Google for Developers → Speakers: Johan Ferret Products Mentioned: Gemma

2025/04/02 youtube

最近投稿されたプログラミング学習動画

Flutter Firebase Cloud Messaging (FCM) | Send & Handle Push Notifications (2025)

Flutter Firebase Cloud Messaging (FCM) | Send & Handle Push Notificati

### **Flutter Firebase Cloud Messaging (...

2025/04/04

Flutter Firebase Firestore Tutorial | How to Update & Delete Data From Firestore 🔥(2025)

Flutter Firebase Firestore Tutorial | How to Update & Delete Data From

Welcome to Part 2 of our Flutter Firebas...

2025/04/03

Let’s rewind it back to Google IO ‘24

Let’s rewind it back to Google IO ‘24

Get ready for #GoogleIO May 20-21, where...

2025/04/03

Architecting for Multi-Cloud: AWS and Beyond with PwC

Architecting for Multi-Cloud: AWS and Beyond with PwC

This webinar explores the fundamentals o...

2025/04/03

Salesforce Trailhead Tutorial | Salesforce Trailhead Explained - how to Use It | Edureka

Salesforce Trailhead Tutorial | Salesforce Trailhead Explained - how t

🔥 Salesforce Training Course: Admin & Ap...

2025/04/03

Flutter Firebase Firestore Tutorial | How to Add & Read Data From Firestore 🔥(2025)

Flutter Firebase Firestore Tutorial | How to Add & Read Data From Fire

Welcome to Part 1 of our Flutter Firebas...

2025/04/03

Can you beat our time solving the green world? #GoogleIO

Can you beat our time solving the green world? #GoogleIO

Think you’ve mastered the #GoogleIO puzz...

2025/04/03

Flutter Firebase Authentication | Flutter Firebase Phone Number OTP Authentication (2025)

Flutter Firebase Authentication | Flutter Firebase Phone Number OTP Au

Flutter Firebase Authentication || Phone...

2025/04/02

Pushing the capabilities of Gemma 3 via distillation and RL fine-tuning

Pushing the capabilities of Gemma 3 via distillation and RL fine-tunin

Specialized capabilities (e.g. math abil...

2025/04/02

Welcome to the Gemmaverse

Welcome to the Gemmaverse

The Gemma family of open models keeps ev...

2025/04/02

Gemma on mobile and web. Best and worst practices

Gemma on mobile and web. Best and worst practices

Come learn new methods and best practice...

2025/04/02

ShieldGemma 2 – Developing safe and responsible AI for images

ShieldGemma 2 – Developing safe and responsible AI for images

Responsible AI development in multimodal...

2025/04/02

Agentic AI vs Generative AI | Agentic AI vs Generative AI Explained | AI Model Types | Edureka

Agentic AI vs Generative AI | Agentic AI vs Generative AI Explained |

🔥Generative AI Course: Masters Program: ...

2025/04/02

Flutter Firebase Authentication | Email & Password Auth | Flutter & Firebase (2025)

Flutter Firebase Authentication | Email & Password Auth | Flutter & Fi

### **Flutter Firebase Authentication | ...

2025/04/02

Modern Observability and Event Driven Architectures - Martin Thwaites & Ian Cooper - NDC London 2025

Modern Observability and Event Driven Architectures - Martin Thwaites

This talk was recorded at NDC London in ...

2025/04/02

Advanced Cloud Native Development with .NET Aspire - Scott Hunter & Maddy Montaquila

Advanced Cloud Native Development with .NET Aspire - Scott Hunter & Ma

This talk was recorded at NDC London in ...

2025/04/02