PGCodeLLM
Popular repositories Loading
-
trl
trl PublicForked from huggingface/trl
[Downstream Fork DO NOT EDIT MAIN] Train transformer language models with reinforcement learning.
Python
-
OpenRLHF
OpenRLHF Public[Fork] An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python
-
pytest-json-report
pytest-json-report PublicForked from numirias/pytest-json-report
🗒️ A pytest plugin to report test results as JSON
Python
-
critic-rl
critic-rl PublicForked from HKUNLP/critic-rl
Code for Paper: Teaching Language Models to Critique via Reinforcement Learning
Python
-
-
LLaMA-Factory
LLaMA-Factory PublicForked from hiyouga/LlamaFactory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Python
Repositories
- harbor Public Forked from harbor-framework/harbor
Harbor is a framework for running agent evaluations and creating and using RL environments.
PGCodeLLM/harbor’s past year of commit activity - FeatBench Public Forked from TsinghuaISE/FeatBench
Offical implementation of our paper "FeatBench: Evaluating Coding Agents on Feature Implementation for Vibe Coding".
PGCodeLLM/FeatBench’s past year of commit activity - LiveCodeBenchFork Public Forked from LiveCodeBench/LiveCodeBench
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
PGCodeLLM/LiveCodeBenchFork’s past year of commit activity - OpenSandbox Public Forked from alibaba/OpenSandbox
OpenSandbox is a general-purpose sandbox platform for AI applications, offering multi-language SDKs, unified sandbox APIs, and Docker/Kubernetes runtimes for scenarios like Coding Agents, GUI Agents, Agent Evaluation, AI Code Execution, and RL Training.
PGCodeLLM/OpenSandbox’s past year of commit activity - FEA-Bench Public Forked from microsoft/FEA-Bench
[ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation
PGCodeLLM/FEA-Bench’s past year of commit activity - LLaMA-Factory Public Forked from hiyouga/LlamaFactory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
PGCodeLLM/LLaMA-Factory’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…