Menu

Home Page
Your Collection

Links

Privacy Policy
Terms and Conditions
Contact

© 2025 DevTrends.fyi

Loading user info...

Trending

Content tagged with "reinforcement-learning"

reinforcement-learning

Show all

Hacker News

Top stories from the Hacker News community• Updated 14 minutes ago

HN

AI Agents Break Rules Under Everyday Pressure

spectrum.ieee.org

261

pseudolus

22 days ago

generative-ai ai reinforcement-learning ai-ethics mlops

HN

RL is more information inefficient than you thought

dwarkesh.com

110

cubefox

22 days ago

reinforcement-learning ai data-efficiency ai-research vr virtual-reality

HN

CS234: Reinforcement Learning Winter 2025

web.stanford.edu

191

jonbaer

23 days ago

ai education ai-ethics reinforcement-learning

HN

Agent Design Is Still Hard

lucumr.pocoo.org

383

the_mitsuhiko

27 days ago

ai reinforcement-learning agent-design mlops agent ai-ethics

HN

SIMA 2: An Agent That Plays, Reasons, and Learns with You in Virtual 3D Worlds

deepmind.google

218

meetpateltech

about 1 month ago

reinforcement-learning ai deep-learning virtual-worlds 3d-virtual-worlds 3d

HN

Jasmine: A Simple, Performant and Scalable Jax-Based World Modeling Codebase

arxiv.org

19

PaulHoule

about 1 month ago

transformers model-serving ai mlops deep-learning reinforcement-learning data-science jax tensorflow

HN

Show HN: Cancer diagnosis makes for an interesting RL environment for LLMs

31

dchu17

about 1 month ago

reinforcement-learning llm health rl ai

HN

Cursor Composer: Building a fast frontier model with RL

cursor.com

198

leerob

about 2 months ago

model-serving mlops reinforcement-learning system-design ai generative-ai

HN

Agent Lightning: Train agents with RL (no code changes needed)

github.com

87

bakigul

about 2 months ago

reinforcement-learning ai-ethics mlops ai

InfoQ

Latest articles from InfoQ• Updated 5 minutes ago

InfoQ

OpenAI at QCon AI NYC: Fine Tuning the Enterprise

At QCon AI NYC 2025, Will Hang from OpenAI unveiled Agent RFT—a cutting-edge reinforcement fine-tuning approach for tool-using agents. By optimizing prompts and tasks before model adjustments, Hang showcased effective strategies to enhance decision-making and efficiency, emphasizing a balanced grading system. The session revealed a future where smarter agents reduce latency and improve outcomes. By Andrew Hoblitzell

infoq.com

Andrew Hoblitzell

1 day ago

QCon Software Development Conference ai reinforcement-learning model-serving mlops model-tuning

Reddit

Top posts from tech subreddits• Updated about 2 hours ago

Reddit

Researchers show a robot learning 1,000 tasks in 24 hours

scienceclock.com

15

iron-button

about 6 hours ago

r/artificial ai reinforcement-learning learning robotics deep-learning

Reddit

How do you keep agents aligned when tasks get messy?

reddit.com

15

The_Default_Guyxxo

14 days ago

r/AI_Agents ai multi-agent-systems reinforcement-learning alignment agent-based-models mlops system-design agent-based-systems microservices distributed-systems

Reddit

You can now do FP8 reinforcement learning locally! (<5GB VRAM)

i.redd.it

472

danielhanchen

24 days ago

r/LocalLLaMA ai reinforcement-learning deep-learning mlops

Reddit

[P] Training RL agent to reach #1 in Teamfight Tactics through 100M simulated games

reddit.com

14

aardbei123

29 days ago

r/MachineLearning ai reinforcement-learning game-engines

Reddit

Thinking of creating an agent, need ideas

reddit.com

24

Ami_The_Inkling

30 days ago

r/AI_Agents ai agent llm reinforcement-learning nlp generative-ai

Reddit

[P] SDLArch-RL is now compatible with Citra!!!! And we'll be training Street Fighter 6!!!

i.redd.it

15

AgeOfEmpires4AOE4

about 1 month ago

r/MachineLearning emulation gaming ai ai-ethics reinforcement-learning deep-learning

Reddit

[P] RLHF (SFT, RM, PPO) with GPT-2 in Notebooks

reddit.com

20

ashz8888

about 1 month ago

r/MachineLearning gpt-2 reinforcement-learning transformers gpt2 nlp

Reddit

[R] My RL agent taught itself a complete skill progression using only a “boredom” signal (no rewards)

reddit.com

77

knigre

about 1 month ago

r/MachineLearning ai ai-ethics reinforcement-learning ai-research self-hosted self-supervised

Reddit

[Deep Dive] How We Solved Poker: From Academic Bots to Superhuman AI (1998-2025)

gist.github.com

21

Significant_Dog9466

about 1 month ago

r/programming ai reinforcement-learning game-engines deep-learning

Hugging Face Trending

Popular models from Hugging Face• Updated 41 minutes ago

DR-Tulu-8B

58

724

reinforcement-learning ai-research deep-learning

MiMo-7B-RL

245

5,976

ai reinforcement-learning hardware

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

No repositories found

Try removing the tag filter or searching for different content.