Reinforcement Learning Example Code

16 open source projects transforming AI and machine learning

From fine-tuning open source models to building agentic frameworks on top of them, the open source world is ripe with ...

WinBuzzer

AI Coding: Microsoft’s 7B X-Coder Outperforms 14B Rivals on Synthetic Data

Microsoft and Tsinghua University have developed a 7B-parameter AI coding model that outperforms 14B rivals using only ...

MemRL outperforms RAG on complex agent benchmarks without fine-tuning

MemRL separates stable reasoning from dynamic memory, giving AI agents continual learning abilities without model fine-tuning ...

4don MSN

A Q&A with Amanda Askell, the lead author of Anthropic’s new 'constitution' for AIs

The Anthropic philosopher explains how and why her company updated its guide for shaping the conduct and character of its ...

Analytics India Magazine

Complex Reinforcement Learning Tasks Can Cost Up to $20,000 Each: EpochAI Report

Among those interviewed, one RL environment founder said, “I’ve seen $200 to $2,000 mostly. $20k per task would be rare but ...

14d

Global AI Use Case Report Highlights Emerging Opportunities Across Industries

Exploring How Generative AI, Edge AI, and Quantum Machine Learning Are Revolutionizing Healthcare, Finance, Logistics, and Media With Real World Solutions and Expert Insights”Boston, Jan. 12, 2026 ...

FintechNews CH

Top Identity Fraud Trends in 2026

In 2025, online fraud continued to proliferate, driven by identity fraud, advances in artificial intelligence (AI), and increasingly sophisticated attacks targeting finance and e-commerce.

GitHub

AI Code Generation Prompts Examples (Python)

The purpose of this repository is to provide a few sample prompts used in order to create a simple Python GUI for the Linux desktop project. I created this repository and wrote these prompts on March ...

19d

Nous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code moment

B, an open-source AI coding model trained in four days on Nvidia B200 GPUs, publishing its full reinforcement-learning stack ...

IEEE

Toward Energy-Efficient Spike-Based Deep Reinforcement Learning With Temporal Coding

Abstract: Deep reinforcement learning (DRL) facilitates efficient interaction with complex environments by enabling continuous optimization strategies and providing agents with autonomous learning ...

GitHub

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

We are excited to release the CapRL 2.0 series: CapRL-Qwen3VL-2B and CapRL-Qwen3VL-4B. These models feature fewer parameters while delivering even more powerful captioning performance. Notably, ...

IEEE

Generalizable Offline Multiobjective Reinforcement Learning via Preference-Conditioned Diffuser

Abstract: Multiobjective reinforcement learning (MORL) addresses sequential decision-making problems with multiple objectives by learning policies optimized for diverse pReferences. While traditional ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results