11 12 16

Aksel Joonas Reedi

akseljoonas

AI & ML interests

None yet

Recent Activity

updated a Space about 6 hours ago

smolagents/ml-agent

updated a dataset 6 days ago

akseljoonas/hh-rlhf-conversational

updated a model 6 days ago

akseljoonas/browsergym-grpo-functiongemma-270m-it

View all activity

Organizations

Articles 2

Article

753

SmolLM3: smol, multilingual, long-context reasoner

Article

CodeAgents + Structure: A Better Way to Execute Actions

View all Articles

Collections 3

View 3 collections

spaces 6

Qwen3 Reversed (DPO)

🤖

Qwen3-4B DPO demo on ZeroGPU

Plan and generate experimental validation methods for AI projects

models 64

akseljoonas/browsergym-grpo-functiongemma-270m-it

Text Generation • 0.3B • Updated 6 days ago • 9

akseljoonas/browsergym-dapo-qwen2.5-1.5b

Updated 8 days ago

akseljoonas/finance-sentiment-classifier

Text Classification • 0.1B • Updated 14 days ago • 71

akseljoonas/qwen3-4b-dpo-hh-rlhf-reversed

Text Generation • 4B • Updated 21 days ago • 96

akseljoonas/Qwen3-4B-DPO

Text Generation • 4B • Updated 21 days ago • 90

akseljoonas/qwen3-4b-instruct-2507-dpo-hh-rlhf-reversed

Updated 21 days ago

akseljoonas/Qwen3-1.7B-DPO-hh-rlhf

Text Generation • 2B • Updated 22 days ago • 157

akseljoonas/qwen3-1.7b-s1k-lr1e-4

Text Generation • 2B • Updated 28 days ago • 17

akseljoonas/qwen3-1.7b-s1k-lr5e-5

Text Generation • 2B • Updated 28 days ago • 18

akseljoonas/qwen3-1.7b-s1k-lr1e-5

Text Generation • 2B • Updated 28 days ago • 12

View 64 models

datasets 22

akseljoonas/hh-rlhf-conversational

Viewer • Updated 6 days ago • 169k • 44

akseljoonas/hh-rlhf-dpo-format

Viewer • Updated 22 days ago • 169k • 16

akseljoonas/ToolMind

Updated 28 days ago • 18

akseljoonas/s1k-qwen3-4b-completions

Viewer • Updated 28 days ago • 5 • 12

akseljoonas/benchmark-test2

Viewer • Updated Dec 18, 2025 • 154 • 6

akseljoonas/benchmark-tasks

Viewer • Updated Dec 10, 2025 • 253 • 7

akseljoonas/hf-agent-leaderboard

Preview • Updated Nov 28, 2025

akseljoonas/benchmark-test

Viewer • Updated Nov 12, 2025 • 69 • 4

akseljoonas/hf-agent-benchmark

Viewer • Updated Oct 30, 2025 • 29 • 24

akseljoonas/hf-agent-rubrics

Viewer • Updated Oct 30, 2025 • 30 • 6

View 22 datasets

Aksel Joonas Reedi

AI & ML interests

Recent Activity

Organizations

Articles 2

SmolLM3: smol, multilingual, long-context reasoner

CodeAgents + Structure: A Better Way to Execute Actions

Collections 3

spaces 6 Sort: Recently updated

Qwen3 Reversed (DPO)

Qwen3 Dpo Tracking

Qwen3-4B-DPO Chat

Trackio

Qwen3-4B Chat

Experimental Evaluation

models 64 Sort: Recently updated

datasets 22 Sort: Recently updated

spaces 6

models 64

datasets 22