Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add LFM2 to SFT notebook examples
#4455 opened Nov 5, 2025 by sergiopaniego Loading…
5 tasks
fix: fix a little bug in GRPOTrainer
#4452 opened Nov 5, 2025 by SolarWindRider Loading…
Add kernels to Docker images
#4445 opened Nov 3, 2025 by ishitab02 Loading…
2 of 5 tasks
added 10 papers (+trainer cross-links) for #4407
#4441 opened Nov 3, 2025 by SSusantAchary Loading…
4 tasks done
docs: Expand training customization examples
#4427 opened Nov 2, 2025 by behroozazarkhalili Loading…
4 tasks done
Replace flash attention2 with kernels-community/flash-attn2
#4426 opened Nov 2, 2025 by tamoghnokandar Loading…
4 of 5 tasks
Gold refactor
#4373 opened Oct 29, 2025 by qgallouedec Draft
5 tasks
[OpenENV] Openenv rollout_func signature proposal
#4344 opened Oct 27, 2025 by kashif Loading…
5 tasks
wip - env
#4320 opened Oct 22, 2025 by qgallouedec Loading…
5 tasks
refactor: simplify parameter freezing in modeling_base.py
#4305 opened Oct 20, 2025 by Ki-Seki Loading…
2 of 5 tasks
[SFT] Log mean token accuracy from Liger kernel
#4302 opened Oct 18, 2025 by kashif Loading…
5 tasks
Tool call
#4300 opened Oct 18, 2025 by qgallouedec Draft
5 tasks
ProTip! Exclude everything labeled bug with -label:bug.