Menu

Home Page
Your Collection

Links

Privacy Policy
Terms and Conditions
Contact

© 2025 DevTrends.fyi

Loading user info...

Trending

Content tagged with "reinforcement-learning"

reinforcement-learning

Show all

Hacker News

Top stories from the Hacker News community• Updated 9 minutes ago

HN

AI Agents Break Rules Under Everyday Pressure

spectrum.ieee.org

261

pseudolus

22 days ago

generative-ai ai reinforcement-learning ai-ethics mlops

HN

RL is more information inefficient than you thought

dwarkesh.com

110

cubefox

22 days ago

reinforcement-learning ai data-efficiency ai-research vr virtual-reality

HN

CS234: Reinforcement Learning Winter 2025

web.stanford.edu

191

jonbaer

23 days ago

ai education ai-ethics reinforcement-learning

HN

Agent Design Is Still Hard

lucumr.pocoo.org

383

the_mitsuhiko

27 days ago

ai reinforcement-learning agent-design mlops agent ai-ethics

HN

SIMA 2: An Agent That Plays, Reasons, and Learns with You in Virtual 3D Worlds

deepmind.google

218

meetpateltech

about 1 month ago

reinforcement-learning ai deep-learning virtual-worlds 3d-virtual-worlds 3d

HN

Jasmine: A Simple, Performant and Scalable Jax-Based World Modeling Codebase

arxiv.org

19

PaulHoule

about 1 month ago

transformers model-serving ai mlops deep-learning reinforcement-learning data-science jax tensorflow

HN

Show HN: Cancer diagnosis makes for an interesting RL environment for LLMs

31

dchu17

about 1 month ago

reinforcement-learning llm health rl ai

HN

Cursor Composer: Building a fast frontier model with RL

cursor.com

198

leerob

about 2 months ago

model-serving mlops reinforcement-learning system-design ai generative-ai

HN

Agent Lightning: Train agents with RL (no code changes needed)

github.com

87

bakigul

about 2 months ago

reinforcement-learning ai-ethics mlops ai

InfoQ

Latest articles from InfoQ• Updated 6 minutes ago

InfoQ

OpenAI at QCon AI NYC: Fine Tuning the Enterprise

At QCon AI NYC 2025, Will Hang from OpenAI unveiled Agent RFT—a cutting-edge reinforcement fine-tuning approach for tool-using agents. By optimizing prompts and tasks before model adjustments, Hang showcased effective strategies to enhance decision-making and efficiency, emphasizing a balanced grading system. The session revealed a future where smarter agents reduce latency and improve outcomes. By Andrew Hoblitzell

infoq.com

Andrew Hoblitzell

1 day ago

QCon Software Development Conference ai reinforcement-learning model-serving mlops model-tuning

Reddit

Top posts from tech subreddits• Updated 24 minutes ago

Reddit

Researchers show a robot learning 1,000 tasks in 24 hours

scienceclock.com

15

iron-button

about 5 hours ago

r/artificial ai reinforcement-learning learning robotics deep-learning

Reddit

How do you keep agents aligned when tasks get messy?

reddit.com

15

The_Default_Guyxxo

14 days ago

r/AI_Agents ai multi-agent-systems reinforcement-learning alignment agent-based-models mlops system-design agent-based-systems microservices distributed-systems

Reddit

You can now do FP8 reinforcement learning locally! (<5GB VRAM)

i.redd.it

472

danielhanchen

24 days ago

r/LocalLLaMA ai reinforcement-learning deep-learning mlops

Reddit

[P] Training RL agent to reach #1 in Teamfight Tactics through 100M simulated games

reddit.com

14

aardbei123

29 days ago

r/MachineLearning ai reinforcement-learning game-engines

Reddit

Thinking of creating an agent, need ideas

reddit.com

24

Ami_The_Inkling

30 days ago

r/AI_Agents ai agent llm reinforcement-learning nlp generative-ai

Reddit

[P] SDLArch-RL is now compatible with Citra!!!! And we'll be training Street Fighter 6!!!

i.redd.it

15

AgeOfEmpires4AOE4

about 1 month ago

r/MachineLearning emulation gaming ai ai-ethics reinforcement-learning deep-learning

Reddit

[P] RLHF (SFT, RM, PPO) with GPT-2 in Notebooks

reddit.com

20

ashz8888

about 1 month ago

r/MachineLearning gpt-2 reinforcement-learning transformers gpt2 nlp

Reddit

[R] My RL agent taught itself a complete skill progression using only a “boredom” signal (no rewards)

reddit.com

77

knigre

about 1 month ago

r/MachineLearning ai ai-ethics reinforcement-learning ai-research self-hosted self-supervised

Reddit

[Deep Dive] How We Solved Poker: From Academic Bots to Superhuman AI (1998-2025)

gist.github.com

21

Significant_Dog9466

about 1 month ago

r/programming ai reinforcement-learning game-engines deep-learning

Hugging Face Trending

Popular models from Hugging Face• Updated 6 minutes ago

DR-Tulu-8B

58

724

reinforcement-learning ai-research deep-learning

MiMo-7B-RL

245

5,976

ai reinforcement-learning hardware

GitHub Trending

Popular repositories from GitHub• Updated 21 minutes ago

No repositories found

Try removing the tag filter or searching for different content.