Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
1
3
38
Michał Wiliński
MWilinski
Follow
elcapitano2k24's profile picture
arjunpushpik's profile picture
alxtrtw's profile picture
17 followers
·
26 following
https://michal-wilinski.com
inverse_hessian
JanekDev
AI & ML interests
Machine Learning, Reinforcement Learning
Recent Activity
updated
a collection
6 days ago
irl-alignment-5.1-expert
updated
a dataset
6 days ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
published
a dataset
6 days ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
View all activity
Organizations
MWilinski
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a collection
6 days ago
irl-alignment-5.1-expert
Collection
4 items
•
Updated
6 days ago
updated
a dataset
6 days ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
Viewer
•
Updated
6 days ago
•
1k
•
5
published
a dataset
6 days ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
Viewer
•
Updated
6 days ago
•
1k
•
5
updated
a collection
6 days ago
irl-alignment-5.1-expert
Collection
4 items
•
Updated
6 days ago
updated
a dataset
6 days ago
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer
•
Updated
6 days ago
•
1k
•
3
published
a dataset
6 days ago
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer
•
Updated
6 days ago
•
1k
•
3
updated
a dataset
7 days ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer
•
Updated
7 days ago
•
1k
•
3
published
a dataset
7 days ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer
•
Updated
7 days ago
•
1k
•
3
updated
a dataset
7 days ago
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer
•
Updated
7 days ago
•
1k
•
4
published
a dataset
7 days ago
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer
•
Updated
7 days ago
•
1k
•
4
liked
a model
3 months ago
PleIAs/Baguettotron
Text Generation
•
0.3B
•
Updated
Dec 14, 2025
•
1.82k
•
219
updated
4 datasets
3 months ago
MWilinski/hh-rlhf-harmless-base
Viewer
•
Updated
Nov 5, 2025
•
44.8k
•
34
MWilinski/hh-rlhf-helpful-base
Viewer
•
Updated
Nov 5, 2025
•
46.2k
•
43
MWilinski/hh-rlhf-helpful-online
Viewer
•
Updated
Nov 5, 2025
•
23.1k
•
31
MWilinski/hh-rlhf-helpful-rejection-sampled
Viewer
•
Updated
Nov 5, 2025
•
55.2k
•
21
updated
a collection
3 months ago
hh-rlhf-TRL
Collection
4 items
•
Updated
Nov 5, 2025
Load more