Home Artificial Intelligence High quality-tune Google Gemma with Unsloth and Distilled DPO on Your Computer

High quality-tune Google Gemma with Unsloth and Distilled DPO on Your Computer

0
High quality-tune Google Gemma with Unsloth and Distilled DPO on Your Computer

Following Hugging Face’s Zephyr recipe

Generated with DALL-E

Finding good training hyperparameters for brand spanking new LLMs is all the time difficult and time-consuming. With Zephyr Gemma 7B, Hugging Face seems to have found a great recipe for fine-tuning Gemma. They used a mixture of distilled supervised fine-tuning and DPO just like what they did for…

LEAVE A REPLY

Please enter your comment!
Please enter your name here