Reinforcement Learning Using Python

Open-Source Coding Model Ornith-1.0 Writes Its Own Training Scaffold in Reinforcement Learning

DeepReinforce today released Ornith-1.0, a family of open-source coding models built around a mechanism most RL-trained agents avoid: the model itself writes the training harness that guides its own ...

GitHub

ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning

To fully reproduce our experiments, please refer to ReproduceExps.md. To download our training data and reproduce the plots in the paper, please refer to ...

IEEE

Warfarin Dose Management Using Offline Deep Reinforcement Learning

Abstract: Warfarin is a commonly prescribed anticoagulant with a narrow therapeutic window, which requires frequent and specialized monitoring. This work aims to develop standardized optimal warfarin ...

GitHub

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

DR Tulu-8B is the first open Deep Research (DR) model trained for long-form DR tasks. DR Tulu-8B matches OpenAI DR on long-form DR benchmarks. Feburary 9, 2026: 🔥 We released a free interactive demo ...

IEEE

LiDAR-Based Autonomous Exploration Method of Mobile Robot Using Deep Reinforcement Learning in Unknown Environments

Abstract: The autonomous exploration holds significant application value in tasks, such as mine exploration and environmental modeling, and personnel search and rescue, effectively boosting task ...

26d

NVIDIA Unveils Vera, the CPU for Agents

NVIDIA launches high-performance, energy-efficient NVIDIA Vera CPUs to drive diverse workloads across industries, including agentic ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results