Pinned Loading
-
HandRefiner
HandRefiner Public[ACM MM 2024] Offical Code for "HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting"
-
noisy_dinov2
noisy_dinov2 Public[NeurIPS 2025] "Ditch the Denoiser: Emergence of Noise Robustness in Self-Supervised Learning from Data Curriculum": Improve Noise-robustness of Joint-embedding SSL Models, e.g., DINOv2
-
huginn-latent-cot
huginn-latent-cot Public[COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-Recurrent Transformer
-
prompt-augmentation-GRPO
prompt-augmentation-GRPO PublicWe propose prompt augmentation, a data augmentation technique for RL post-training on mathematical reasoning that substantially extends training duration, stabilizes training in low-entropy regimes…
Python 1
-
coconut
coconut PublicForked from facebookresearch/coconut
Training Large Language Model to Reason in a Continuous Latent Space
Python
-
rl-reasoning-optimizer
rl-reasoning-optimizer PublicFully open reproduction of DeepSeek-R1
Python 1
If the problem persists, check the GitHub status page or contact support.


