[question] Custom Model Name + No Actual run + Annotation #2219

kuatroka · 2025-10-20T11:42:35Z

kuatroka
Oct 20, 2025

Hi, I want to evaluate different coding agents with different models and create/annotate and evaluate the output manually without any actual model runs or api calls.

I want to give codex/claude code/droid/opencode, etc... any permutation of models for each agent the same prompt and run them > evaluate manually and annotate if they did what I asked + annotate some subjective outcome ( pretty ui or bad ui, etc) > view reporting results after my experiments with different permutations of agent/model + prompt versions.

Basically I want an option to create a custom model name ( not a full working setup, no working api). For example "codex/gpt-5-codex-mini", "codex/gpt-5-codex-high", "cc/opus-4-5" and give them "prompt20-v-1" and "prompt20-v-2" and without running them through promptfoo annotate the result and even skip the output completely or fill it manually.

Is there a way to do it now without creating any new features?

Thanks

kritinv · 2026-01-27T06:15:34Z

kritinv
Jan 27, 2026
Maintainer

hey @kuatroka, are you asking about promptfoo UI or deepeval UI?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[question] Custom Model Name + No Actual run + Annotation #2219

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[question] Custom Model Name + No Actual run + Annotation #2219

Uh oh!

kuatroka Oct 20, 2025

Replies: 1 comment

Uh oh!

kritinv Jan 27, 2026 Maintainer

kuatroka
Oct 20, 2025

kritinv
Jan 27, 2026
Maintainer