High quality-tune Google Gemma with Unsloth and Distilled DPO on Your Computer

March 19, 2024

Following Hugging Face’s Zephyr recipe

Finding good training hyperparameters for brand spanking new LLMs is all the time difficult and time-consuming. With Zephyr Gemma 7B, Hugging Face seems to have found a great recipe for fine-tuning Gemma. They used a mixture of distilled supervised fine-tuning and DPO just like what they did for…

computer
Distilled
DPO
Finetune
Gemma
Google
Unsloth

What are your thoughts on this topic?
Let us know in the comments below.

0 0 votes

Article Rating

0 Comments

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

ASK ANA http://bardai.ai

High quality-tune Google Gemma with Unsloth and Distilled DPO on Your Computer

Following Hugging Face’s Zephyr recipe

What are your thoughts on this topic?
Let us know in the comments below.

Share this article

Recent posts

TDS Newsletter: Vibe Coding Is Great. Until It’s Not.

Evaluating Language Model Bias with 🤗 Evaluate

What I Am Doing to Stay Relevant as a Senior Analytics Consultant in 2026

Speed up your models with 🤗 Optimum Intel and OpenVINO

Advantageous-Tune Whisper For Multilingual ASR with 🤗 Transformers

High quality-tune Google Gemma with Unsloth and Distilled DPO on Your Computer

Following Hugging Face’s Zephyr recipe

What are your thoughts on this topic? Let us know in the comments below.

Share this article

Recent posts

What are your thoughts on this topic?
Let us know in the comments below.