Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: EntityProcess/agentv
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: main
Choose a base ref
...
head repository: EntityProcess/agentv
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: research/tbench-integration
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 1 commit
  • 1 file changed
  • 2 contributors

Commits on Feb 8, 2026

  1. docs: add Terminal-Bench integration research report

    Analyze terminal-bench (tbench.ai) features for potential AgentV integration.
    Key opportunities: failure mode taxonomy, pass@k metrics, checkpoint/resume,
    adapter pattern for external benchmarks.
    
    Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
    christso and claude committed Feb 8, 2026
    Configuration menu
    Copy the full SHA
    ab54d1f View commit details
    Browse the repository at this point in the history
Loading