Trending
Content tagged with "reinforcement-learning"
Hacker News
Top stories from the Hacker News community• Updated 13 minutes ago
InfoQ
Latest articles from InfoQ• Updated 4 minutes ago
OpenAI at QCon AI NYC: Fine Tuning the Enterprise
At QCon AI NYC 2025, Will Hang from OpenAI unveiled Agent RFT—a cutting-edge reinforcement fine-tuning approach for tool-using agents. By optimizing prompts and tasks before model adjustments, Hang showcased effective strategies to enhance decision-making and efficiency, emphasizing a balanced grading system. The session revealed a future where smarter agents reduce latency and improve outcomes. By Andrew Hoblitzell
Top posts from tech subreddits• Updated 4 minutes ago
A new autonomous fighter jet just broke cover. It's powered by the same AI brain that flew an F-16 through a dogfight.
[P] SDLArch-RL: Multi-Console Gaming Environment for Reinforcement Learning Research
Hugging Face Trending
Popular models from Hugging Face• Updated 40 minutes ago
GitHub Trending
Popular repositories from GitHub• Updated about 1 hour ago
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
qlib
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)