It should be possible to have multiple instances of an A2A server agent. We need to investigate how load balancing/session affinity work in this case.
One possibility might be to define a property in application.properties that specifies the public URL using an environment variable and then have the agent card reference this property.
We could check to see if SmallRye Stork could be helpful for this at all.