The ability and performance of smaller, open large language models have advanced significantly in recent years, and we have witnessed the progress from early GPT-2 models to more compact, accurate, and effective LLM frameworks that make use of a considerably larger amount of tokens that the “compute-optimal” amount of tokens recommended by the Chinchilla scaling […] The post Zephyr: Direct Distillation of LLM Alignment appeared first on Unite.AI.