Running 155 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 155 Building and scaling RL environments for LLM training
unsloth/NVIDIA-Nemotron-3-Nano-Omni-30B-A3B-Reasoning Text Generation • 33B • Updated 16 days ago • 1.71k • 14
DavidAU/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking Image-Text-to-Text • 40B • Updated 14 days ago • 4.96k • 36
AEON-7/Qwen3.6-27B-AEON-Ultimate-Uncensored-NVFP4 Text Generation • 19B • Updated 13 days ago • 34.8k • 53