Replies: 3 comments 4 replies
-
|
Can you share the model config? If you're getting bad results it might be due to missing thinking config. |
Beta Was this translation helpful? Give feedback.
-
|
I want to chime in and say I agree. What I want from Coder is the infra around the harness so that we can easily run these tools fully on our cloud. That is the thing Coder provides that nobody else does. Maybe Mux is great, but I think locking the harness tightly to the infra means you're now having to compete at being state of the art for every single use case and benchmark. Things I would much rather Coder focus on are how to schedule and manage and coordinate autonomous agent execution and human workflows around that rather than trying to replace something lots of other people are working on. |
Beta Was this translation helpful? Give feedback.
-
|
👋🏼 I really appreciate the kind words about Coder and your feedback around our direction! This helps make the Coder project better and we've circulated the thread around internally a lot :) I wanted to share some of our rationale for the Tasks-to-Agents migration and what this means for Coder. The TLDR here is we will never force users to use our harness (Agents) and we are actively working to make Coder work better with any harness (improved lifecycle, support for Cursor self-hosted workers in Cursor, etc). We just didn't believe Tasks was the answer and we also believe that we can do something unique with Agents. We originally built Tasks because we originally didn't want to build with/compete with another harness and really enjoyed Claude Code, Codex, opencode, etc and were optimistic that over time they would become easier to integrate with. Because these harnesses lacked clear APIs at the time, we built https://github.com/coder/agentapi as a stopgap and a translation layer which basically scraped the stdout (and placed things in stdin) in order to provide a UI and API for communicating with Agents. Needless to say for anybody who has used it, the UX and architecture of Tasks/AgentAPI was pretty bad and we were never able to fully iron out the bugs. The harnesses would (rightfully) release new features and they would either be unsupported, or break the Tasks integration altogether. It felt very similar to the early days of Coder v1 when we only had hardcoded support for Kubernetes and we had to change our product any time a user needed access to a new/different Kubernetes feature or any other type of compute (Windows, EC2, etc). After a couple of years of pain, we wrote Coder (v2, the Coder we know today) which uses Terraform to let users define their own infra requirements. While there are cleaner protocols than AgentAPI (like ACP), we still decided that trying to be a "harness wrapping another harness" (which is what I effectively consider Tasks to be) means that we are still giving our users little control over which harness features they can use. For example, a new Claude Code feature (dynamic workflows) may not be available in our API (or in ACP) therefore limiting the benefits of running the harness in Tasks versus standalone. Even if the harnesses were different, because only a subset of features are supported, it'd often appear functionally the same. It took us a few projects/experiments to land on this conclusion, including the likes of Tasks, Mux, and Blink, as well as the feedback we've gotten from our users and enterprise customers. Most of the feedback we got on Tasks is: "this is great, but the UI/lifecycle is buggy," or "API support and integrations is key for us," and the feedback we got on Mux was "the UI/UX is smooth, but we need it deeply integrated into Coder," which is where Agents came from. We also learned a lot around what it takes to build a harness, and helped us develop the belief that very little "magic" belongs in the harness layer. Therefore, our strategy is threefold:
The third one is new for us, but it is in progress and (frankly) the core lifecycle of Workspaces regardless of the harness does need to be revisited with agentic workflows in mind. I believe workspaces need to be faster to start up, context-aware of the agentic processes running in them, spin down when not in use, and preserve/replicate context on-demand, all regardless of the harness being used. I'm personally optimistic we can find a "SSH" or "Terraform" of harness<>compute interactions so Coder can support any harness just like we support any IDE or any compute provider, but even without that we're working towards integrating with the harnesses that do support remote/self-hosted runners. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hey Coder Team!
I'm very excited to see active development on Coder, I personally think Coder can really shape the future of engineering workflows now more than ever before.
I wanted to write a discussion piece to understand your reasoning for introducing your own top level layer of AI development ( Coder Agents ) or probe for some alternatives.
I think that the Agents interfaces is fantastic and very exciting, it's the most ergonomic yet to firing off AI tasks within workspaces.
However, I'm of the opinion that product, even when equiped with a SOTA model, will vastly under preform Claude Code ( or Codex )
Take the recently announced dynamic workflows from Anthropic as an example of something that will never be possible with Coder Agents as it is designed at the moment.
We rolled this out to some of our engineers today ( with Opus 4.8 ) and we have all noticed that the quality of the generated code is extremely sub-par and not mergable, compare contrast with one-shot claude code prompts.
I write this discussion piece to urge you to consider finding some way to wrap/pass-through to the popular coding harnesses. If you do this, we would be very interested in becoming an enterprise customer.
Thanks for you consideration - apologies for the poor formatting and half baked thoughts here. I wanted to get the message across, but didn't want to spend a long time composing a stronger message.
Happy to talk offline if you are interested.
Beta Was this translation helpful? Give feedback.
All reactions