Discussion about this post

User's avatar
Neural Foundry's avatar

FP8 GRPO running on consumer RTX cards is a gamechanger for anyone doing RL work at home. Getting that down to 5GB of VRAM for Qwen3 means you can actualy experiment without needing datacenter hardware. The 60% VRAM savings make this feasible for way more people to test ideas localy.

Thibaut's avatar

Thank you for everything that you bring to the community! Do you have plans to support Nvidia Jetson Thor devices on top of DGX Spark? It would be greatly appreciated 🙏🏻

3 more comments...

No posts

Ready for more?