High quality-tune Google Gemma with Unsloth and Distilled DPO on Your Computer

-

Following Hugging Face’s Zephyr recipe

Generated with DALL-E

Finding good training hyperparameters for brand spanking new LLMs is all the time difficult and time-consuming. With Zephyr Gemma 7B, Hugging Face seems to have found a great recipe for fine-tuning Gemma. They used a mixture of distilled supervised fine-tuning and DPO just like what they did for…

ASK ANA

What are your thoughts on this topic?
Let us know in the comments below.

0 0 votes
Article Rating
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments

Share this article

Recent posts

0
Would love your thoughts, please comment.x
()
x