Trending

Content tagged with "reinforcement-learning"

reinforcement-learning

Hacker News

Top stories from the Hacker News community• Updated 7 minutes ago

HN

OpenAI Misled You on RLHF

aerial-toothpaste-34a.notion.site
HN

Deep Agents

blog.langchain.com

Reddit

Top posts from tech subreddits• Updated 1 minute ago

Hugging Face Trending

Popular models from Hugging Face• Updated 19 minutes ago

GitHub Trending

Popular repositories from GitHub• Updated 33 minutes ago

qlib

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.

mujoco

Multi-Joint dynamics with Contact. A general purpose physics simulator.

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)