Trending

Content tagged with "reinforcement-learning"

reinforcement-learning

Hacker News

Top stories from the Hacker News community• Updated 14 minutes ago

InfoQ

Latest articles from InfoQ• Updated 5 minutes ago

InfoQ

OpenAI at QCon AI NYC: Fine Tuning the Enterprise

At QCon AI NYC 2025, Will Hang from OpenAI unveiled Agent RFT—a cutting-edge reinforcement fine-tuning approach for tool-using agents. By optimizing prompts and tasks before model adjustments, Hang showcased effective strategies to enhance decision-making and efficiency, emphasizing a balanced grading system. The session revealed a future where smarter agents reduce latency and improve outcomes. By Andrew Hoblitzell

infoq.com

Reddit

Top posts from tech subreddits• Updated about 2 hours ago

Hugging Face Trending

Popular models from Hugging Face• Updated 41 minutes ago

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

No repositories found

Try removing the tag filter or searching for different content.