NeMo-RL
Scalable RL post-training for language models.
Open-source reinforcement-learning framework from NVIDIA NeMo for post-training language models.

Recent stories
0 linked stories
No linked stories yet.
Scalable RL post-training for language models.
Open-source reinforcement-learning framework from NVIDIA NeMo for post-training language models.
