PGCodeLLM

trl Public

[Downstream Fork DO NOT EDIT MAIN] Train transformer language models with reinforcement learning.

Python

OpenRLHF Public

[Fork] An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python

pytest-json-report Public

🗒️ A pytest plugin to report test results as JSON

Python

critic-rl Public

Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

Python

rllm Public

Democratizing Reinforcement Learning for LLMs

Python

LLaMA-Factory Public

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python

Provide feedback