Conversation

Replying to
1) We've added support for Ray Tune, and worked with to create programmatic reports from your hparam sweeps 📊 2) We've added PPO normalization among other regularization methods, improves convergence and stability. 2/5
1
9
3) We've refactored the loss functions to allow for alternative backends, like Megatron NeMo (WIP) and T5X (planned) 👨‍💻 4) PPO-Hydra support for OPT and NeoX 🦑😎 (Merging later today) 3/4
1
4