Post
54
Just released a new dataset designed for training reasoning models on Meta (Facebook/Instagram) advertising fatigue detection!
What is it? A GRPO (Group Relative Policy Optimization) training dataset with 200+ carefully crafted scenarios covering:
π Fatigue Signal Detection: CTR drops, CPM spikes, frequency analysis
π©Ί Performance Diagnosis: Root cause analysis frameworks
π Strategy: Creative refresh cadence, testing frameworks
π Analysis: ROI calculations, metric interpretation
Why GRPO? GRPO training helps models learn structured reasoning. Each response follows the <thinking> and <answer> format.
Check it out here: Sri-Vigneshwar-DJ/meta-fatigue-grpo-dataset
What is it? A GRPO (Group Relative Policy Optimization) training dataset with 200+ carefully crafted scenarios covering:
π Fatigue Signal Detection: CTR drops, CPM spikes, frequency analysis
π©Ί Performance Diagnosis: Root cause analysis frameworks
π Strategy: Creative refresh cadence, testing frameworks
π Analysis: ROI calculations, metric interpretation
Why GRPO? GRPO training helps models learn structured reasoning. Each response follows the <thinking> and <answer> format.
Check it out here: Sri-Vigneshwar-DJ/meta-fatigue-grpo-dataset