Blog
Case studies
Company
Careers
Framework
Arena
Blog
GRPO and evolutionary HPO
Combining GRPO with evolutionary hyperparameter optimization to squeeze the most out of small LLMs