At QCon AI NYC 2025, Will Hang from OpenAI unveiled Agent RFT—a cutting-edge reinforcement fine-tuning approach for tool-using agents. By optimizing prompts and tasks before model adjustments, Hang showcased effective strategies to enhance decision-making and efficiency, emphasizing a balanced grading system. The session revealed a future where smarter agents reduce latency and improve outcomes. By Andrew Hoblitzell

infoq.com

Andrew Hoblitzell

1 day ago

QCon Software Development Conference ai reinforcement-learning model-serving mlops model-tuning

Top posts from tech subreddits• Updated 6 minutes ago

[Week 4] Making Your Agent Smarter: 3 Designs That Beat Common Limits

reddit.com

Useful-Bad8331

3 months ago

r/AI_Agents ai reinforcement-learning mlops

[D] RL interviews at frontier labs, any tips?

reddit.com

bci-hacker

3 months ago

r/MachineLearning interviews ai ai-ethics interview-preparation reinforcement-learning career-advice interview-tips interview-prep ai-research tech-interviews research

[R] New "Illusion" Paper Just Dropped For Long Horizon Agents

reddit.com

viciousA3gis

3 months ago

r/MachineLearning ai reinforcement-learning deep-learning mlops research

Interesting take on how to use Agents to unlock alternate gameplay styles

reddit.com

Silkutz

3 months ago

r/AI_Agents ai reinforcement-learning game-engines mlops nlp generative-ai

[P] I Trained an AI to play Donkey Kong Country Stop and Go Station

youtube.com

AgeOfEmpires4AOE4

3 months ago

r/MachineLearning ai reinforcement-learning generative-ai

Anyone else feel like the hardest part of agents is just getting them to do stuff reliably?

reddit.com

rafaelchuck

4 months ago

r/AI_Agents ai agent reinforcement-learning mlops

[P] Training environment for PS2 game RL

reddit.com

AgeOfEmpires4AOE4

4 months ago

r/MachineLearning emulation gaming game-ai reinforcement-learning game-development training game-engines

AI researcher Andrej Karpathy says he's "bearish on reinforcement learning" for LLM training

the-decoder.com

rtbot2

4 months ago

r/realtech ai llm reinforcement-learning ai-research

[P] Training environment for RL of PS2 and other OpenGL games

reddit.com

AgeOfEmpires4AOE4

4 months ago

r/MachineLearning reinforcement-learning game-development computer-vision game-engines open-source openai

1 2417

Hugging Face Trending

Popular models from Hugging Face• Updated 24 minutes ago

RL ReSearch

DR-Tulu-8B

724

reinforcement-learning ai-research deep-learning

Xiaomi MiMo

MiMo-7B-RL

245

5,976

ai reinforcement-learning hardware

GitHub Trending

Popular repositories from GitHub• Updated 39 minutes ago

DLR-RM

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python

12,326

2,016

reinforcement-learning pytorch python deep-learning ai mlops

vwxyzjn

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python

8,485

922

reinforcement-learning python deep-learning ai

facebookresearch

habitat-lab

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Python

2,660

596

python nlp ai deep-learning robotics reinforcement-learning

microsoft

qlib

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.

Python

33,321

5,132

ai data-science reinforcement-learning mlops quantitative-finance quantitative-investment quantitative-investing deep-learning quantitative-research model-serving