Beijing Academy of Artificial Intelligence

non-profit

https://www.baai.ac.cn/english.html

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

xxuan01 updated a dataset about 17 hours ago

BAAI/ToucHD-Sim

xxuan01 updated a dataset about 17 hours ago

BAAI/ToucHD-Force

xxuan01 updated a dataset about 17 hours ago

BAAI/ToucHD-Mani

View all activity

Papers

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation

View all Papers

Sri-Vigneshwar-DJ

posted an update about 7 hours ago

Post

Just released a new dataset designed for training reasoning models on Meta (Facebook/Instagram) advertising fatigue detection!

What is it? A GRPO (Group Relative Policy Optimization) training dataset with 200+ carefully crafted scenarios covering:

🔍 Fatigue Signal Detection: CTR drops, CPM spikes, frequency analysis
🩺 Performance Diagnosis: Root cause analysis frameworks
📋 Strategy: Creative refresh cadence, testing frameworks
📊 Analysis: ROI calculations, metric interpretation
Why GRPO? GRPO training helps models learn structured reasoning. Each response follows the <thinking> and <answer> format.

Check it out here: Sri-Vigneshwar-DJ/meta-fatigue-grpo-dataset

xxuan01

updated 3 datasets about 17 hours ago

xxuan01

published 2 datasets 2 days ago

BAAI/ToucHD-Force

Updated about 14 hours ago • 76 • 2

BAAI/ToucHD-Mani

Updated about 17 hours ago • 22 • 2

xxuan01

updated a collection 2 days ago

ToucHD

Collection

Tactile Hierarchical Dynamic Dataset • 3 items • Updated about 16 hours ago • 4

xxuan01

published a dataset 2 days ago

BAAI/ToucHD-Sim

Updated about 9 hours ago • 10 • 2

ZacLiu

authored a paper 7 days ago

Towards Automated Kernel Generation in the Era of LLMs

Paper • 2601.15727 • Published 14 days ago • 16

ZmoreZoe

updated a Space 9 days ago

FlagEval-Robo

🐢

Compare and evaluate language models side-by-side

Sri-Vigneshwar-DJ

posted an update 10 days ago

Post

190

🏙️ Hugging Face Community Post
Title: 🧬 Experimenting with "Dynamic Chaos" in Tamil SLMs

Hi everyone! I just published a new experimental study on Small Language Model (SLM) resilience.

I took the Qwen2.5-0.5B model and put it through a "Chaos Phase" to see how much weight data a tiny model can lose before its understanding of classical Tamil grammar breaks.

Key highlights of the study:

Target Data: Fine-tuned on the Thirukkural (1,330 couplets + modern explanations).
The Chaos Step: Applied 20% random weight pruning but implemented "Layer Protection" for the Token Embeddings and LM Head to keep the characters readable.
Compression: 4-bit (Q4_K_M) quantization for extreme efficiency.
Result: A surrealist classical Tamil model that is ultra-light (~300MB) and ultra-fast!

Check out the model and the experiment logic here: Sri-Vigneshwar-DJ/qwen-tamil-chaos-v1