Use PytorchModelHubMixin to save models as safetensors by alexander-soare · Pull Request #125 · huggingface/lerobot

alexander-soare · 2024-05-01T12:13:26Z

What does this PR do?

Primary changes:

Make all policies subclass PytorchModelHubMixin as well as nn.Module.
Remove save and load from all policies.
Update logger.py to use save_pretrained instead of save, and to separately save the Hydra config as a yaml file.
Update eval.py to use from_pretrained (this consolidates the two user paths into one, at the expense of some conditional branching in the code).
Update examples to conform.
Update README section on uploading policies.
Remove "pretrained_model_path" from yaml config.

Side changes:

Add get_policy_and_config_classes to avoid repetition of the conditional branching across policy names.
Change ACTConfig.d_model to ACTConfig.dim_model for consistency with ACTConfig.dim_feedforward.
Change model saving directory from "models" to "checkpoints" to avoid confusion between "policy" and "model" and be more explicit.
Add functionality to eval.py to be able to evaluate a non-pretrained model.

Not for this PR:

TDMPC.
Script for uploading training checkpoint to the hub.
Model cards.

How was it tested?

CI

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR. Try to avoid tagging more than 3 people.

Cadene

Left some comments. Nothing blocking merging. Nice work!!!

Co-authored-by: Remi <re.cadene@gmail.com>

Wauplin

Hey there! Sorry I only spotted this PR now :) Nice work to use PytorchModelHubMixin to integrate with the Hub!

I left a comment below as I think it can be even more tightly integrated for better documentation/discoverability on the Hub. In general, never hesitate to ping me (@Wauplin) in any PR that is related to the Hub. I'm maintain huggingface_hub and I'd be happy to help if you have questions about how other HF libraries are doing (and stay consistent with them) 🤗

Co-authored-by: Remi <re.cadene@gmail.com>

For 32-dim sparse action layouts where only ~10 dims carry real signal (airoa HSR with pi05 baseline multi-robot layout), the other 22 zero-padded dims dilute the flow matching loss and cause active dims (especially gripper) to under-learn. Add optional config field `action_loss_exclude_dims: list[int] | None`: - When None (default): no change, backward compatible - When specified: mask listed dims from flow loss sum AND adjust denominator to use only included dims, so active dim signal is preserved Related: icra_2026_ramen Issue huggingface#125 (Run 61) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

… loss Two optional loss-shape modifications for PI0.5 flow matching, motivated by the gripper-underlearning issue observed in icra_2026_ramen Run 60/61 (Issue huggingface#125 on the icra side). 1. flow_loss_type: "mse" (default) | "smooth_l1" Swaps F.mse_loss for F.smooth_l1_loss (Huber) with configurable beta. Empirical precedent: Kim, Finn et al. 2025 (arxiv 2502.19645) found L1/Huber regression beat classification heads on OpenVLA LIBERO fine-tuning — directly contradicting the DAFD (gripper=CE) approach tried in Run 60 which hurt accuracy. 2. use_min_snr_weighting + min_snr_gamma Min-SNR-γ (Hang et al. 2023, arxiv 2303.09556) v-prediction variant applied to flow matching via the diffusion↔FM weighting equivalence (diffusionflow.github.io). Per-sample weight: w(t) = min(SNR(t), γ) / (SNR(t) + 1) SNR(t) = ((1-t)/t)^2 (linear FM schedule) Peaks at SNR=γ (≈ t=0.31 with γ=5), downweighting both very-low and very-high noise timesteps. Default γ=5.0 from original paper. Both defaults preserve current behavior (mse, no weighting). Logging: - flow_loss_raw: pre-weighting loss (for cross-run comparison) - flow_loss: final (post-weighting) loss - min_snr_weight_mean: diagnostic for the Min-SNR weight batch average Tests (tests/policies/pi0_pi05/test_pi05_loss_variants.py): - Config validation for new fields + rejection of invalid values - Min-SNR formula numerical correctness (scalar + vectorized) - SmoothL1 produces smaller loss than MSE on outlier inputs Related: icra_2026_ramen Issue huggingface#125 (Run 62) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…guard Follow-up on self-review: dropping the SmoothL1 variant from this PR because the cited evidence (Kim-Finn OFT, arxiv 2502.19645) does not actually ablate L1 vs MSE — it compares L1 vs discrete tokens vs diffusion head in a single-step regression setting, not flow matching velocity. The direct empirical case for swapping MSE → SmoothL1 in FM VLA is therefore weaker than claimed in the original PR body. Also: SmoothL1 with beta=1.0 behaves as MSE for |error| < 1, and FM velocity residuals on normalized actions are typically below 1, so the change would be near no-op most of the time. Keeping Min-SNR only makes this Run 62 change a single-variable intervention, which preserves causal attribution vs Run 61 (mask only). Additions: - Terminal-SNR guard: follow original Hang et al. repo (guided_diffusion/gaussian_diffusion.py:895) and set weight=1.0 when SNR==0. With the eps=1e-6 t-clamp this does not fire in practice, but it guards the edge if a caller passes t=1.0 exactly. - Test for the guard behavior. Related: icra_2026_ramen Issue huggingface#125 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The lerobot wandb wrapper emits a warning and drops any loss_dict value whose type is not a scalar, so tensors / lists of shape (action_dim,) are silently ignored. Previously "loss_per_dim", "loss_per_dim_weighted", and "smoothness_per_dim" were all dropped, making per-dim analysis impossible without modifying the wrapper. Expand each tensor into individual scalar entries: loss_per_dim -> loss_per_dim_00, loss_per_dim_01, ..., loss_per_dim_31 loss_per_dim_weighted -> loss_per_dim_weighted_{NN} smoothness_per_dim -> smoothness_per_dim_{NN} This is required for icra_2026_ramen Issue huggingface#125 Run 62 monitoring where gripper (dim 6 in 32-dim sparse layout) per-dim loss is the go/no-go signal at 5-10k steps. Applies to Run 63+ as well. Related: icra_2026_ramen Issue huggingface#125 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

alexander-soare added 2 commits May 1, 2024 12:42

ready for review

ebb3ab6

backup wip

bfb713a

alexander-soare marked this pull request as draft May 1, 2024 12:13

alexander-soare added 2 commits May 1, 2024 13:21

Merge remote-tracking branch 'upstream/main' into use_hub_mixin

55b2c4c

ready for review

f879a80

alexander-soare marked this pull request as ready for review May 1, 2024 12:23

alexander-soare added 2 commits May 1, 2024 13:46

try fix tests

52e8422

try fix tests

b9288d6

alexander-soare marked this pull request as draft May 1, 2024 13:14

alexander-soare added 3 commits May 1, 2024 14:58

fix tests

d619e63

fix tests

88a0b4a

fix tests (for real this time)

58ad4b1

alexander-soare marked this pull request as ready for review May 1, 2024 14:27

alexander-soare requested review from Cadene and qgallouedec May 1, 2024 14:29

Cadene approved these changes May 1, 2024

View reviewed changes

alexander-soare and others added 3 commits May 1, 2024 16:10

Update examples/2_evaluate_pretrained_policy.py

369a4a4

Co-authored-by: Remi <re.cadene@gmail.com>

revision

3fa54a5

revision

8919fad

alexander-soare removed the request for review from qgallouedec May 1, 2024 15:16

alexander-soare merged commit a489109 into main May 1, 2024

alexander-soare deleted the alexander-soare/use_hub_mixin branch May 1, 2024 15:17

alexander-soare mentioned this pull request May 3, 2024

adding docstring and from_pretrained/save_pretrained #67

Closed

Wauplin reviewed Jun 10, 2024

View reviewed changes

Comment thread lerobot/common/policies/act/modeling_act.py

menhguin pushed a commit to menhguin/lerobot that referenced this pull request Feb 9, 2025

Use PytorchModelHubMixin to save models as safetensors (huggingface#125)

76b2cd0

Co-authored-by: Remi <re.cadene@gmail.com>

Kalcy-U referenced this pull request in Kalcy-U/lerobot May 13, 2025

Use PytorchModelHubMixin to save models as safetensors (#125)

8d425f3

Co-authored-by: Remi <re.cadene@gmail.com>

ZoreAnuj pushed a commit to luckyrobots/lerobot that referenced this pull request Jul 29, 2025

Use PytorchModelHubMixin to save models as safetensors (huggingface#125)

e66d966

Co-authored-by: Remi <re.cadene@gmail.com>

hara2323 mentioned this pull request Apr 18, 2026

feat(pi05): 32-dim sparse 用 action_loss_exclude_dims を追加 matsuolab-llmcompe2025-team-suzuki/lerobot#5

Merged

2 tasks

hara2323 mentioned this pull request Apr 19, 2026

feat(pi05): Min-SNR-γ timestep 重み付けを flow matching loss に追加 matsuolab-llmcompe2025-team-suzuki/lerobot#6

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use PytorchModelHubMixin to save models as safetensors#125

Use PytorchModelHubMixin to save models as safetensors#125
alexander-soare merged 12 commits into
mainfrom
alexander-soare/use_hub_mixin

alexander-soare commented May 1, 2024 •

edited

Loading

Uh oh!

Cadene left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Wauplin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

alexander-soare commented May 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

How was it tested?

Who can review?

Uh oh!

Cadene left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

alexander-soare commented May 1, 2024 •

edited

Loading