Trending
Content tagged with "reinforcement-learning"
Hacker News
Top stories from the Hacker News community• Updated less than a minute ago
Top posts from tech subreddits• Updated 9 minutes ago
[D] GSPO: Qwen3’s sequence-level RLHF method vs. GRPO - stability & scaling analysis
Researchers instructed AIs to make money, so the AIs just colluded to rig the markets
Hugging Face Trending
Popular models from Hugging Face• Updated 28 minutes ago
GitHub Trending
Popular repositories from GitHub• Updated 42 minutes ago
qlib
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)