wenquanlu

Wenquan Lu wenquanlu

Achievements

HandRefiner HandRefiner Public

[ACM MM 2024] Offical Code for "HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting"

Python 805 37
noisy_dinov2 noisy_dinov2 Public

[NeurIPS 2025] "Ditch the Denoiser: Emergence of Noise Robustness in Self-Supervised Learning from Data Curriculum": Improve Noise-robustness of Joint-embedding SSL Models, e.g., DINOv2

Python 9 2
huginn-latent-cot huginn-latent-cot Public

[COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-Recurrent Transformer

Python 17 3
prompt-augmentation-GRPO prompt-augmentation-GRPO Public

We propose prompt augmentation, a data augmentation technique for RL post-training on mathematical reasoning that substantially extends training duration, stabilizes training in low-entropy regimes…

Python 1
coconut coconut Public

Forked from facebookresearch/coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python
rl-reasoning-optimizer rl-reasoning-optimizer Public

Fully open reproduction of DeepSeek-R1

Python 1