Remove padding_value from CPO and use pad_token_id by albertvillanova · Pull Request #4962 · huggingface/trl

albertvillanova · 2026-02-04T08:40:54Z

Remove padding_value from CPO and use pad_token_id.

This PR removes the padding_value parameter from the CPOConfig class and updates the trainer logic to always use the tokenizer's pad_token_id for padding. This simplifies configuration and ensures consistent padding behavior throughout the codebase.

Follow-up to:

HuggingFaceDocBuilderDev · 2026-02-04T08:43:43Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2026-02-04T20:12:50Z

trl/experimental/cpo/cpo_trainer.py

        self.max_length = max_length
        self.generate_during_eval = args.generate_during_eval
-        self.padding_value = args.padding_value if args.padding_value is not None else processing_class.pad_token_id
+        self.pad_token_id = processing_class.pad_token_id


the issue is that, if the tokenizer doesn't have a padding token (which we should allow), it will fail. We should align with stable trainers:

trl/trl/trainer/grpo_trainer.py

Lines 305 to 309 in 963d046

if tokenizer.pad_token is None:

tokenizer.pad_token = tokenizer.eos_token

self.pad_token = tokenizer.pad_token

self.pad_token_id = tokenizer.pad_token_id

Remove padding_value from CPO and use pad_token_id

5432a21

qgallouedec reviewed Feb 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove padding_value from CPO and use pad_token_id#4962

Remove padding_value from CPO and use pad_token_id#4962
albertvillanova wants to merge 1 commit intohuggingface:mainfrom
albertvillanova:fu-4846

albertvillanova commented Feb 4, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 4, 2026

Uh oh!

qgallouedec Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	if tokenizer.pad_token is None:
	tokenizer.pad_token = tokenizer.eos_token

	self.pad_token = tokenizer.pad_token
	self.pad_token_id = tokenizer.pad_token_id

Conversation

albertvillanova commented Feb 4, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 4, 2026

Uh oh!

qgallouedec Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants