Training Reinforcement Learning

News

21h

SJTU and ByteDance Join Forces to Launch RhymeRL: 2.6x Improvement in Reinforcement Learning Training Speed!

This similarity primarily arises from mainstream RL algorithms such as PPO/GRPO, which use gradient clipping mechanisms to ensure training stability. This mechanism smooths the model's evolutionary ...

20h

Conquering the 'Slowest Link' in Reinforcement Learning! Shanghai Jiao Tong University and ByteDance Join Forces, RL Training Speed Soars by 2.6 Times

How can we conquer this last stronghold of AI infrastructure? Now, the research team from Shanghai Jiao Tong University and ...

Microsoft’s new AI framework trains powerful reasoning models with a fraction of the cost

The rStar2-Agent framework boosts a 14B model to outperform a 671B giant, offering a path to state-of-the-art AI without ...

VentureBeat6y

OpenAI launches reinforcement learning training to prepare for artificial general intelligence

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI today announced the launch of Spinning Up, a program designed to ...

EurekAlert!5d

Reinforcement learning is making a buzz in space

A U.S. Naval Research Laboratory (NRL) research team successfully conducted the first reinforcement learning (RL) control of ...

10d

CoreWeave to Acquire OpenPipe, Leader in Reinforcement Learning

CoreWeave, Inc. (NASDAQ: CRWV), the AI Hyperscaler™, today announced a definitive agreement to acquire OpenPipe Inc, a ...

10don MSN

CoreWeave acquires agent-training startup OpenPipe

CoreWeave hopes the YC-backed startup will help it expand up the stack and cash in on enterprises developing AI agents.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results