A junior in the Department of Computer Science and Technology at Tsinghua University.
-
Tsinghua University
- Beijing
- https://racktic.github.io/
- https://scholar.google.com/citations?user=AvbV0HcAAAAJ&hl=en&oi=ao
Highlights
- Pro
Pinned Loading
-
RLHF-V/RLAIF-V
RLHF-V/RLAIF-V Public[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
-
hiyouga/EasyR1
hiyouga/EasyR1 PublicEasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
-
verl-project/verl
verl-project/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

