Huggingface Projects

company

https://huggingface.co/

huggingface

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

sergiopaniego updated a dataset about 2 hours ago

huggingface-projects/Deep-RL-Course-Certification

pcuenq updated a dataset about 17 hours ago

huggingface-projects/drlc-leaderboard-data

akhaliq submitted a paper 2 days ago

Visual Personalization Turing Test

View all activity

sergiopaniego

updated a dataset about 2 hours ago

huggingface-projects/Deep-RL-Course-Certification

Viewer • Updated about 2 hours ago • 1.66k • 456 • 16

AdinaY

posted an update about 3 hours ago

Post

AI for science is moving fast🚀

Intern-S1-Pro 🔬 a MoE multimodal scientific reasoning model from Shanghai AI Lab

internlm/Intern-S1-Pro

✨ 1T total / 22B active
✨ Apache 2.0
✨ SoTA scientific reasoning performance
✨ FoPE enables scalable modeling of long physical time series (10⁰–10⁶)

pcuenq

updated a dataset about 17 hours ago

huggingface-projects/drlc-leaderboard-data

Viewer • Updated about 12 hours ago • 48.6k • 1.49k • 2

AdinaY

posted an update 1 day ago

Post

268

✨ China’s open source AI ecosystem has entered a new phase

https://huggingface.co/blog/huggingface/one-year-since-the-deepseek-moment-blog-3

One year after the “DeepSeek Moment,” open source has become the default. Models, research, infrastructure, and deployment are increasingly shared to support large-scale, system-level integration.

This final blog examines how leading Chinese AI organizations are evolving ,and what this implies for the future of open source.

AdinaY

posted an update 1 day ago

Post

206

GLM just entered the OCR field🔥

zai-org/GLM-OCR

✨ 0.9B
✨ MIT licensed
✨ Multimodal GLM-V architecture
✨ #1 on OmniDocBench v1.5 (94.62)

akhaliq

submitted 2 papers to Daily Papers 2 days ago

Visual Personalization Turing Test

Paper • 2601.22680 • Published 6 days ago • 2

Causal World Modeling for Robot Control

Paper • 2601.21998 • Published 6 days ago • 27

AdinaY

posted an update 2 days ago

Post

1454

Step 3.5 Flash 🔥 new foundation model from StepFun ai

https://huggingface.co/collections/stepfun-ai/step-35-flash

✨ Sparse MoE：196B/11B active
✨ Supports up to 256K context
✨ Multi-token prediction for fast decoding (100–300 tok/s)
✨ Runs locally on consumer hardware

AdinaY

posted an update 6 days ago

Post

1035

What a week 🤯

Following DeepSeek, Kimi, Qwen, Baidu, and Ant Group, Unitree Robotics
has now released a VLA model on the hub too!

unitreerobotics/UnifoLM-VLA-Base

sergiopaniego

posted an update 6 days ago

Post

302

Meet the Post-Training Toolkit (PTT), which easily integrates with TRL via a single callback, by Aditya Challapally ( @microsoft ):

🔍 Detects training issues early
🛠 Lets you intervene safely
📊 Keeps long training runs stable, auditable & efficient

Microsoft blog: https://devblogs.microsoft.com/engineering-at-microsoft/diagnosing-instability-in-production-scale-agent-rl/

Integration guide: https://huggingface.co/docs/trl/main/en/ptt_integration

Code: https://github.com/microsoft/post-training-toolkit

victor

posted an update 6 days ago

Post

346

Interesting article: use Claude Code to help open models write CUDA kernels (for eg) by turning CC traces into Skills. They made a library out of it 👀

https://huggingface.co/blog/upskill

AdinaY

posted an update 7 days ago

Post

244

LongCat-Flash-Lite🔥 a non-thinking MoE model released by Meituan LongCat team.

meituan-longcat/LongCat-Flash-Lite

✨ Total 68.5B / 3B active - MIT license
✨ 256k context
✨ Faster inference with N-gram embeddings

sergiopaniego

posted an update 7 days ago

Post

2470

New TRL + OpenEnv example! 💥

Fine tune an LLM for playing Sudoku using an RL env via OpenEnv

Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook.

Enjoy!

Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb

Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py

1 reply

AdinaY

posted an update 7 days ago

Post

232

Ant Group is going big on robotics 🤖

They just dropped their first VLA and depth perception foundation model on huggingface.

✨ LingBot-VLA :
- Trained on 20k hours of real-world robot data
- 9 robot embodiments
- Clear no-saturation scaling laws
- Apache 2.0

Model: https://huggingface.co/collections/robbyant/lingbot-vla
Paper:
A Pragmatic VLA Foundation Model (2601.18692)

✨ LingBot-Depth:
- Metric-accurate 3D from noisy, incomplete depth
- Masked Depth Modeling (self-supervised)
- RGB–depth alignment, works with <5% sparse depth
- Apache 2.0

Model: https://huggingface.co/collections/robbyant/lingbot-depth
Paper:
Masked Depth Modeling for Spatial Perception (2601.17895)

AdinaY

posted an update 8 days ago

Post

288

Blog 2 is live 🔥 After the DeepSeek R1 moment, what came next wasn’t just more models.

https://huggingface.co/blog/huggingface/one-year-since-the-deepseek-moment-blog-2

In this second post, we dive into the architectural and hardware choices shaping China’s open AI ecosystem.

2 replies

AdinaY

submitted a paper to Daily Papers 9 days ago

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

Paper • 2601.18137 • Published 10 days ago • 25

AdinaY

posted an update 9 days ago

Post

1281

Big day in open source AI!!

✨ DeepSeek released OCR2 💥
deepseek-ai/DeepSeek-OCR-2

✨ Kimi K2.5 just landed 🔥
moonshotai/Kimi-K2.5

With the Chinese Spring Festival 3 weeks away,

what’s coming next?👀

AdinaY

posted an update 9 days ago

Post

867

Kimi K2.5 from Moonshot AI is more than just another large model🤯

https://huggingface.co/collections/moonshotai/kimi-k25

✨ Native multimodality : image + video + language + agents 💥
✨1T MoE / 32B active
✨ 256K context
✨ Modified MIT license
✨ Agent Swarm execution
✨ Open weights + open infra mindset

sergiopaniego

posted an update 9 days ago

Post

2088

Date idea: read the entire Transformers v5.0.0 release notes

Officially stable now: https://github.com/huggingface/transformers/releases/tag/v5.0.0

1 reply

akhaliq

submitted a paper to Daily Papers 13 days ago

Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Paper • 2601.14253 • Published 15 days ago • 10

AI & ML interests

Recent Activity

Team members 20

huggingface-projects's activity