59,968 questions
0
votes
0
answers
13
views
Why do read requests fail during collection load and alias switch after bulk insert in Milvus?
Read requests fail or experience latency spikes when loading a bulk-inserted collection and switching alias in Milvus
I’m running a Milvus cluster (v2.5.x) on Kubernetes (EKS) using milvus-operator, ...
-4
votes
0
answers
90
views
Traefik Proxy in Kubernetes, can you have unique health checks per service? [closed]
Traefik Proxy (v3.6) deployed with native Kubernetes on non-EKS AWS provider, how can I define a unique health-check per back-end service that will be used to verify back-end service availability?
I ...
-2
votes
0
answers
54
views
API Deployment Failing in Uptime Monitoring App [closed]
I'm building a basic uptime monitoring app (frontend + backend). The application works locally, but I'm running into errors while deploying the API.
I've tried multiple approaches and fixes suggested ...
0
votes
0
answers
50
views
Monitoring for misconfigured Istio VirtualService [closed]
We have an Istio VirtualService with a Destination that referred to a service without a port, which is allowed when that service only has one port:
route:
- destination:
host:...
0
votes
0
answers
53
views
Google tag gateway in GKE Gateway API
I'm implementing server side tagging for Google Tag Manager through my GKE Gateway API.
I was successfully able to configure server side tag manager in same domain and GET https://www.example.com/...
-1
votes
0
answers
53
views
Guidance needed for setting up self-managed ClickHouse on AWS EKS [closed]
I’m working on deploying a self-managed ClickHouse setup on AWS EKS and need some help locating the right resources. Specifically, I’m looking for:
A build or deployment guide for running ClickHouse ...
0
votes
0
answers
42
views
ArgoCD applicationset - path and .argocd-source.yaml issues
I have a test setup where I have a base folder structure. This is working and I create the application for staging with the values and the configmap from the staging folder.
However, I also need to be ...
1
vote
2
answers
68
views
How to handle database schema migrations in a Kubernetes rolling update without downtime?
I am designing a CI/CD pipeline for a Python FastAPI application using Kubernetes (EKS) and PostgreSQL.
We are using a Rolling Update strategy for deployment. However, I have an architectural ...
Advice
3
votes
0
replies
84
views
How can I control the shutdown order of Spring-managed beans that I don’t create myself?
I have a Spring Boot application running in Kubernetes. I’m trying to implement a graceful shutdown flow using a readiness probe:
App receives SIGTERM.
App should start returning 503 from /health so ...
1
vote
1
answer
61
views
How to add a tty on pod?
I can exec a bash shell using this command
kubectl exec --stdin --tty ftp1-7686766766-8v5s2 -- /bin/bash
bash-4.2#
but I want to know why kubectl attach don't work
kubectl attach -it ftp1-7686766766-...
Advice
1
vote
2
replies
63
views
Turning off buffering in k8s nginx ingress greatly increases client side latency and streaming in downstream
I have the following architecture:
Browser App (React) -> Nginx K8S Ingress -> Streaming Service A (Kotlin, POD) -> Streaming Service B (Java,POD)
From the browser I upload a json array of ...
1
vote
0
answers
61
views
Evaluate Kubernetes CEL Condition locally using Golang
I am creating a Kubernetes controller that needs to modify resources when specific conditions are met. I would like to define my match conditions using CEL expressions in a configuration file similar ...
Best practices
0
votes
0
replies
40
views
Resource propagation on multi-cloud environment using Karmada
As far as i know, resources created on karmada-api-server would propagate to member clusters. ( based on propagation policies )
But i need to create a multi cluster synchronization.
e.g. When a ...
1
vote
0
answers
48
views
Microsoft.ML C#: GPU not found in K8s/Docker container
I have created a .NET app that uses Microsoft.ML.OnnxRuntime.Gpu for interference. Now I'm trying to integrate it with Azure Kubernetes.
We have made the setup with Tesla T4 GPU and we confirmed it's ...
2
votes
0
answers
51
views
Socket with SignalR and kubernetes
I encountered a problem when deploying a socket application using signalr. I connect with the longpolling method and it still works, but the websocket gives an error: Error: Failed to start the ...
0
votes
0
answers
43
views
Issue with Custom Engine Agent for Copilot – Service Not Responding After Deployment to Kubernetes
I'm trying to create a Custom Engine Agent for Copilot. I have set up a bot, an app registration, and a manifest file. When I run the service locally using DevTunnel, everything works fine—the bot ...
Advice
0
votes
0
replies
38
views
Does an EC2 instance automatically run my Dockerfile if I copy it onto the server, or do I need to reboot it first?
If I scp my Dockerfile into an EC2 instance, does it automatically become a container, or do I need to reboot the instance first? Also, is there a preferred AZ for this, or will us-east-2b do? I need ...
Advice
0
votes
0
replies
31
views
How does Java containers react when container runtime updates containers cgroup settings
Kubernetes v1.33+ supports in-place pod resize. I am trying to understand how latest JDK versions react to an in-place pod resize i.e; when container runtime updates containers cgroup settings and ...
0
votes
1
answer
47
views
ActiveProcessorCount is showing "-1" even after specifying CPU limits in Container
I have my application running on an AL2023 node in EKS cluster.
Below is the snippet of the Container resources in the Pod definition
resources:
# memory ~ heap space +10%
# CPU ~ no more than 1/8 of ...
0
votes
0
answers
42
views
java.net.UnknownHostException in a Helm Project
What are the steps I should take to solve the error:
17-11-2025 14:00:00.002 [org.test.ingest.io] ERROR o.s.s.s.TaskUtils$LoggingErrorHandler.handleError - Unexpected error occurred in scheduled task
...
1
vote
0
answers
48
views
appsettings.json not read after deploying to Minikube (values become empty, DB connection fails)
I'm running an ASP.NET Core application.
Everything works correctly when using Docker Compose — all values from appsettings.json load normally, and my services connect to Postgres and RabbitMQ.
But ...
Advice
2
votes
0
replies
57
views
ArgoCD ApplicationSet and Workflow to create ephemeral environments from GitHub branches
How would you rate this GitOps workflow idea with ArgoCD + ApplicationSet + PreSync hooks?
In my organization we already use Argo CD for production and staging deployments. We're considering giving ...
2
votes
0
answers
75
views
MongoDB 8.0: StaleDbVersion triggers shard-local majority write after step-up; fails with 2 data + 1 arbiter when one data pod is down [closed]
Environment
Deployment: Kubernetes (RKE2), Bitnami MongoDB Sharded Helm chart
MongoDB: 8.0.8
mongosh: 2.5.0
Topology:
Config servers: 3 (CSRS)
Mongos: 3
Shards: 3 shards total; each shard is a replica ...
-3
votes
1
answer
67
views
terraform issue when i add new value in config map
I’m trying to add a new record to my ConfigMap. It’s the third time I’ve done this operation – I made the previous edits in dev and stage – but in prod, when I add the record, Terraform tries to ...
0
votes
1
answer
62
views
Failed to bind properties under 'server.address' with helm
I want to deploy my app on kubernetes with Helm
I have a Ingest Service running with spring boot that is made of two parts
Fiprocessor
Io
Both of this part use the same deployment file and the same ...
1
vote
0
answers
50
views
TLS Handshake disappearing when modifying it with eBPF (TC egress)
I have two k8s pods that communicate using TLS. I am loading an eBPF TC code on the egress of the sender pod. This code adds 28 bytes to the optional space of the TCP headers after TCP options. If I ...
1
vote
1
answer
97
views
YQ: load a text file as array and use it for an operation
I have a yaml file with an arbitrary amount of documents, and I'm trying to replace all missing namespaces for namespaceable resources with an arbitrary input one.
Getting the non-namespaceable ...
Advice
0
votes
0
replies
36
views
Why is container.cpu.usage metric from kubeletstats receiver bigger than k8s.pod.cpu.usage metric from the same receiver
I have deployed opentelemetry-collector-contrib 0.139.0 to a Kubernetes cluster as a DaemonSet. The Kubernetes cluster is deployed on AWS EKS with a Node group of two EC2 Nodes. Kubernetes version is ...
0
votes
0
answers
31
views
Error in V1Binding when using a custom scheduler
I encountered the error
V1_binding.py", line 156, in target
raise ValueError("Invalid value for `target`, must not be `None`")
ValueError: Invalid value for `target`, must not be `...
Tooling
0
votes
0
replies
24
views
Open resty kubernetes frontend
User-->Load abalancer-Frontend VPC-Openresty Nginx --->Router-->Priv LB -->Kubernetes
I want to ask about Openresty in Kubernetes. I am using it now as reverse Proxy as Tasks in AWS and ...
1
vote
0
answers
163
views
Java 21+ container support: active processor count
In containers running Java 21+, I would like to have the active processor count reflect the actual available hardware cores, regardless of configured CPU shares.
CSR JDK-8281571 "Do not use CPU ...
5
votes
1
answer
3k
views
dockerhub.io officially removed openjdk:8-jre from thier repository and what is he best alternative for it [duplicate]
While building the image for one of our application, I observed that I am unable to pull openjdk:8-jre from official docker.io
error: [ERROR]: [#3 ERROR: docker.io/library/openjdk:8-jre: not found]
I ...
0
votes
0
answers
64
views
KubernetesPodOperator - [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1006)
I’m running Apache Airflow inside a Docker container and trying to use the KubernetesPodOperator to run a simple “hello world” pod in an external Kubernetes cluster (not the same one where Airflow ...
0
votes
1
answer
70
views
How to translate caddy to ingress nginx controller
I'm having this config from Caddy and I want to migrate it to ingress nginx controller
@restrictAccess {
path /path1/loc1/*
path /path2/loc3/*
}
route @restrictAccess {
...
0
votes
1
answer
92
views
How to overwrite an env value with Helm
I want to deploy an app on Kubernetes with Helm. This app is composed of multiple parts, 2 of them are a Spring backend and a Mongo database.
I want to deploy theme in 2 pods and have them talk with ...
0
votes
1
answer
116
views
How can I support multiple OIDC authentication providers?
I have built a Blazor standalone webassembly in .NET 9 for a new application that we are building in our company and have successfully integrated it into our Okta system for authentication of the user....
0
votes
0
answers
71
views
Tilt Port Forwarding to specific pod in Kubernetes Resource
We use Tilt to manage our local development Kube clusters. I've integrated the langfuse helm chart in the Tiltfile, which functions properly, however I'm having trouble forwarding a port to the web ...
0
votes
1
answer
57
views
Helm subchart uses baseline values.yaml instead of merged values.yaml + values-dev.yaml when deployed via parent chart
Here is my helm chart structure:
app/
├── Chart.yaml
├── values.yaml
├── values-dev.yaml
└── templates/
app-test/
├── Chart.yaml
├── values.yaml
├── values-dev.yaml
└── charts/
...
0
votes
0
answers
54
views
Why does KEDA create a second Selenium Node for the second Chrome session instead of using available slots on the first node
I'm running Selenium Grid in Minikube using the official Docker Selenium Helm chart, with KEDA enabled for autoscaling. My goal is to scale Chrome nodes based on session demand, where each node can ...
0
votes
0
answers
41
views
ArgoCD database patching
We have a namespace consisting of 3 applications:
Database patcher
API
Web site
Currently we are using ArgoCD app-of-apps pattern to roll this out. Each application above is therefore it's own ...
1
vote
1
answer
85
views
Handling cleanup for tasks which might be OOMKilled
I have some Python code running in k8s which in some cases is being OOMKilled and not leaving me time to cleanup which is causing bad behavior.
I've tried multiple approaches but nothing seems quite ...
0
votes
1
answer
84
views
Helm doesn't remove initContainers from Deployment
I've created deployment template using helm (v3.14.3) with support for setting initContainers. Last time I realized one of initContainers removed from values.yaml is still present in cluster. I tried ...
0
votes
0
answers
55
views
K8s Leader election with Micronaut
I'm trying to follow this doc from Micronaut to implement the leader election to lock on the scheduler, so that I can make sure only scheduler run for only one pod. (I know we have shred lock, but ...
0
votes
0
answers
56
views
Spatial join without Apache Sedona
currently I'm working in a specific version of Apache Spark (3.1.1) that cannot upgrade. Since that I can't use Apache Sedona and the version 1.3.1 is too slow. My problem is the following code that ...
0
votes
1
answer
70
views
My django API “next” link uses a stale hostname only when receiving requests from GKE services (Like Cloud Run)
What are the details of your problem?
I have a application in Django that is deployed using GKE. It uses an ingress to deploy it...
Those manifests.yml that are used for deploying applications on ...
0
votes
1
answer
110
views
Layers of timeout in an Istio+k8s managed cluster
I have a cluster of microservices. UI calls API1 (assuming it goes through ingress gateway, correct me if I am wrong), API1 calls API2 via RestTemplate.
The API2 process is bulky and takes roughly 1.5 ...
0
votes
1
answer
104
views
Flux with SOPS doesn't decrypt stringData [closed]
I'm trying to use fluxcd on my kubernetes cluster, which is working perfectly fine.
Now I wanted to add prometheus/grafana stack and use sops to decrypt the basic_auth part in the scrapingConfig from ...
0
votes
1
answer
61
views
Render webpage that sets X-Frame-Options and redirects to OAuth flow in IFrame [closed]
I have a requirement where I need to render a web app that uses oauth to login in an iframe. The iframe setup is for tutorials, where the tutorial content is in one column, and the web app is in ...
0
votes
0
answers
74
views
Python Socket.IO with FastAPI in Kubernetes 499 and 426 error
I am trying to setup a socketio server, but having issues with client connection either getting 499 or 426 error -
I have tried port forwarding from the pod to my local and run curl command that also ...
-3
votes
1
answer
157
views
How to update the pods from a StatefulSet after changing its image? [closed]
I have an EKS cluster with a StatefulSet. I had to update the image of this StatefulSet, so I ran:
$ kubect set image statefulsets liferay-default liferay-default=liferay/dxp:2025.q3.1
I expected it ...