Newest 'latency' Questions

Best practices

0 votes

1 replies

62 views

What are safe techniques to reduce latency in a backend application without compromising correctness?

I am working on a backend application and trying to reduce response latency, but I want to avoid optimizations that could introduce correctness issues. By correctness, I mean preserving accurate ...

Muhammad Ibtisam

31

asked Mar 23 at 17:35

1 vote

0 answers

86 views

Latency of warp add reduction instruction

The CUDA Programming Guide describe a warp instruction named __reduce_add_sync. What is the latency of the function, specifically in the Ampere architecture? Related sources: This table within the ...

Gal Avineri

576

asked Mar 10 at 20:36

0 votes

0 answers

23 views

Why does the reduce latency metric look incorrect for binding pull requests in Milvus?

Reduce latency metric looks incorrect for binding pull requests in Milvus While monitoring Milvus performance metrics, I noticed something unexpected related to binding pull requests. Specifically, ...

Veli Kıyak

41

asked Dec 19, 2025 at 5:58

3 votes

2 answers

329 views

What is the difference between 'baseLatency' and 'outputLatency' when calling AudioBufferSourceNode.start()?

I'm working on a Javascript browser application that requires very precise timing of audio playback via the AudioBufferSourceNode.start() method. I'm unsure on how I should treat 'baseLatency' and '...

Izzo

4,958

asked Dec 4, 2025 at 0:30

0 votes

0 answers

70 views

Lenovo M10+ gen 3 on screen keyboard issue

I have one of these tablets, just over 2.5 years old now. It came with Android 12 and was fine, then an auto-update forced it to 13, and it broke the on screen keyboard. What happens in practice is ...

nmw01223

1,767

asked Sep 14, 2025 at 8:53

2 votes

0 answers

254 views

Flutter SoLoud: How to reduce playback latency when syncing audio with UI (metronome-style)?

I want to play a short .wav sound every second and update a counter on the screen at the same time. I'm using the flutter_soloud package and based my code on their metronome example from GitHub. ...

Pavel Vlasák

21

asked Jun 23, 2025 at 20:23

1 vote

0 answers

90 views

Why doesn't PDF caching speed up/match raw text input latency on Gemini API?

Passing the text from OCR on a PDF is much faster than using the cache. Why? Passing raw text, Nth call takes ~2 seconds. response = client.models.generate_content( model=MODEL_NAME, contents=f&...

joshlevy89

279

asked Jun 21, 2025 at 6:00

1 vote

1 answer

198 views

How to execute some code just before cloudflare worker goes cold?

I am caching some global variables in memory across multiple requests in a cloudflare worker. I want to flush those variables into a persistent storage like KV cache or redis just before the worker is ...

Om Srivastava

13

asked May 24, 2025 at 18:11

2 votes

1 answer

306 views

Warm up HuggingFace Transformers models efficiently to reduce first-token latency in production

In production deployment of Hugging Face LLMs, the first inference call often has very high latency ("cold start"), even on a machine where the model is already loaded into memory. ...

Swati

244

asked Apr 28, 2025 at 6:49

-1 votes

1 answer

268 views

nodejs event loop lag 300ms and huge request latency [closed]

I have a next.js website that under the load 80 requests per seconds (for one pod) shows huge latency 20-30 seconds to handle a simple request like API call. I checked event loop lag and under the ...

Galina

556

asked Apr 24, 2025 at 7:00

0 votes

1 answer

208 views

Why does my CPU's efficiency core have a lower core-to-core latency?

I used https://github.com/nviennot/core-to-core-latency to measure my CPU's (Intel(R) Core(TM) Ultra 7 268V) core-to-core latency and these are my results: ~/Developer/core-to-core-latency main ❯ ...

weineng

364

asked Apr 19, 2025 at 3:11

0 votes

1 answer

98 views

How to profile/monitor a KDB tickerplant to trace causes of a slow tickerplant?

I'm trying to use KDB as a low-latency pub/sub message broker that persists all messages in a queryable format. However, I'm noticing the latency from when the tickerplant receives a message (i.e. ...

mchen

10.4k

asked Apr 15, 2025 at 14:51

1 vote

1 answer

115 views

What could cause strange delays while sending delays from a python aiohttp server?

We have a service with this architecture: HTTPS requests come into an A10 load balancer that does L4 load balancing Behind it are 2 backend servers with Apache running that terminate the TLS ...

moritz

12.7k

asked Mar 13, 2025 at 14:52

0 votes

1 answer

660 views

How to Reduce OpenAI Azure Response Time for Structured Output Using GPT-4o Mini (Fine-Tuned Model)?

I am using an Azure OpenAI GPT-4o Mini fine-tuned model to generate structured responses (e.g., JSON format). However, the response time is higher than expected, and I am looking for ways to optimize ...

BitByBitBuilder

1

asked Mar 3, 2025 at 10:15

1 vote

1 answer

66 views

Is latency available within traceroute in scapy

from scapy.all import traceroute def perform_traceroute(target): result, _ = traceroute(target, maxttl=50, verbose=True) print(result) for hop in result: query_packet = hop[0]...

Wayne McNicol

11

asked Feb 20, 2025 at 10:40

0 votes

0 answers

39 views

What is the most "empty" Linux system call to benchmark against? [duplicate]

I want to benchmark some performance aspects of a Linux device driver (a loadable module). Specifically, how fast certain code paths are when they are invoked from userspace via system calls. In ...

Grigory Rechistov

2,457

asked Dec 9, 2024 at 14:41

0 votes

1 answer

289 views

Durable Functions + Azure Storage Account -> Multiple retry to Attempting to retrieve authentication token for resource 'https://storage.azure.com/'

Anyone else having a problem to let durable functions runtime get connection to Azure Storage Account? When you are running Local environment on your VS Code. For me it is stuck on that and after many ...

Voi Se

1

asked Nov 19, 2024 at 19:33

0 votes

1 answer

125 views

Jmeter Response time is not matching with Web browser - 'Waiting for Server Response' time

We have facing a typical issue in Jmeter response time. We have an application, where we are searching for record with a keyword. Manually when we do the same in browser, we get it around 2-3 secs ...

Korupol Satish Reddy

1

asked Nov 13, 2024 at 6:44

1 vote

0 answers

119 views

How to interpert hwlat trace results on an ARM Cortex-A7?

I’m working with an STM32MP157D-DK1, trying to use the hwlat tracer for the board's arm Cortex-A7 CPU, to check for typical hardware latency. The following attempt was made to use the hwlat tracer on ...

user656857

71

asked Oct 16, 2024 at 16:28

1 vote

1 answer

106 views

serial latency on raspberry AMA0

I wish to make software PLC on raspberry pi. I did reserve one core just for PLC task and I got latency under 100 micro seconds. But the PLC need RS485 communication at 1 Megabit. It works, but the ...

eSlavko

460

asked Oct 3, 2024 at 6:16

1 vote

0 answers

131 views

How to measure latency between midi instrument and web midi api?

When using the web midi api for receiving midi messages from a midi instrument (via usb), how can we measure the latency time, that is: the time it took from playing a note with the instrument to ...

vuelicious

73

asked Sep 17, 2024 at 17:55

0 votes

1 answer

182 views

Is it possible to get less camera latency using OpenCV?

I'm learning python and i'm working with OpenCV. My goal is : When I execute my python script : it starts recording a 10 seconds sequence with my USB webcam which is a Logitech C920 and the video is ...

leuji241

1

asked Sep 2, 2024 at 20:31

1 vote

0 answers

129 views

Redis Latency Issues: High Response Time for Repeated Requests on WSL with phpredis

I am experiencing unexpected latency issues with Redis in my Laravel application. Here are the details: Setup: Redis Client: I am using the phpredis extension. Environment: Redis is running on Windows ...

جهاد الادهمي

11

asked Aug 16, 2024 at 6:21

-1 votes

1 answer

110 views

Configure latency parameters for CPU cache hit/miss latency in Gem5

I'm configuring the hit latency for a CPU model in Gem5. While I notice there are many latency parameters(eg. tag_latency, data_latency, frontend_latency, forward_latency, response_latency) which are ...

Bowen Tian

1

asked Aug 13, 2024 at 2:09

2 votes

1 answer

759 views

Latency issue - How to perfectly sync audio beeps (using Web Audio API) and visualization (using canvas + requestAnimationFrame)?

I use an audioContext = new AudioContext() of the Web Audio API for scheduling beep sounds for every beat (relative to a user-defined bpm and time signature) - let's call this stack of beeps "...

vuelicious

73

asked Aug 11, 2024 at 17:02

2 votes

1 answer

50 views

Can't Increase Throughput for Standalone Instance

I have been working with Milvus v2.2.9 in standalone mode using Docker Compose, running without any CPU limits. My current setup involves creating an index with 1 million embeddings using the IVF_SQ8 ...

Rashad Tockey

193

asked Aug 1, 2024 at 17:31

1 vote

1 answer

2k views

How to make Gstreamer RTSP -> UDP -> RTSP low latency pipeline?

I need to forward an rtsp stream from an axis camera through an udp port. I have this pipeline on the send side: gst-launch-1.0 rtspsrc location='rtsp://XXXX:[email protected]/axis-media/media.amp' ...

Marco Carandente

333

asked Jul 16, 2024 at 13:01

0 votes

0 answers

98 views

How to get info about Bluetooth output device latency in Python

I've been making a Python media player, but the audio and video de-synced when using a Bluetooth speaker, and I've been wondering how to get the Bluetooth latency using Python. I am using a MacOS ...

Super Gladiator

31

asked Jul 15, 2024 at 17:23

1 vote

0 answers

165 views

How to calculate ping between server and client?

I have a socket server in nodejs, a unity client and I try calculate the latency between them like below Server: socket.on("readable", () => { let data: Buffer = socket.read(); if(...

TIng

11

asked Jul 12, 2024 at 6:00

0 votes

1 answer

626 views

How many streams will be used by a single Bi-directional streaming gRPC call?

We are noticing performance issues when trying to use the bi-directional stream API call to stream data at 2000-4000 updates/second. We enabled debug logging and see that streamId is same for all the ...

Ramachandran Rajagopalan

3

asked Jul 11, 2024 at 16:32

0 votes

1 answer

208 views

Spanner JDBC Driver option to avoid latency

We are using the google spanner jdbc driver(google-cloud-spanner-jdbc) to enable connectivity from our application to cloud spanner. This was used instead of the google client libraries to reuse the ...

Raj Kumar

1

asked Jul 4, 2024 at 3:09

0 votes

0 answers

80 views

Questions about ParallelGC and Full GC behavior in long-running systems

I have a few concerns about ParallelGC and its behavior in long-running systems: It seems that only Full GC can reclaim the old generation, implying that the old generation can only grow until a Full ...

WtMonster

23

asked Jun 22, 2024 at 10:04

1 vote

0 answers

259 views

Azure Service Bus: Time Spent in Queue is Large

We have a .NET process where the process is subscribing to the Topic. One of the common observations is that the Time Spent in Queue is large. Azure Service Bus is running with 2 Messaging Units. The ...

GeekzSG

973

asked Jun 20, 2024 at 18:14

1 vote

0 answers

115 views

Why is there an increased response times when fronting a Lambda with Cloudfront?

I have a simple Lambda function that is fronted by CloudFront. This Lambda also has a function URL. Invoking the Lambda using the Function URL is way faster than invoking it via CloudFront. Based on ...

Mridang Agarwalla

45.4k

asked May 29, 2024 at 7:51

1 vote

0 answers

110 views

Unexpected Increase in Latency with More Readers in Multi-threaded Circular Queue Implementation in C++

I have implemented a circular queue using an array in C++. The queue uses two atomic variables, front and rear, to manage enqueue and dequeue operations. When an element is enqueued, rear is ...

Revanth Thota

11

asked May 20, 2024 at 7:47

0 votes

0 answers

82 views

How much latency does the operating system add transferring video over USB to the host application?

I am transferring raw uncompressed video from an image sensor to a C++ OpenCV application running on Linux. The signal chain is this: MIPI output Image sensor -> MIPI to USB 3.0 chip -> USB 3.0 ...

Rocketmagnet

5,960

asked May 13, 2024 at 10:18

0 votes

1 answer

78 views

AVR-GCC Unexpected interrupt lattency

I have some code where interrupt jitter is deal killer. There are just one interrupt source and that need to be as precise as possible. For ATMEGA the max interrupt response is the length of current ...

eSlavko

460

asked May 11, 2024 at 6:54

0 votes

0 answers

308 views

How do I reduce latency for Flutter Geolocator on iOS?

I've built an app using Flutter that allows users to track how far they have travelled from a starting point, a bit like any map-based or sports tracking app. With the app, a high accuracy is ...

James

729

asked May 11, 2024 at 5:42

0 votes

0 answers

151 views

Latency on the thermal camera while using multi-threading for multiple cameras

I am trying to do some real-time image processing by openCV with videos from web-camera and thermal-camera. I am using multi-threadings methods: one threading for one camera. The simplified code is ...

user24312540

1

asked Apr 13, 2024 at 22:19

0 votes

0 answers

84 views

Not able to understand where the extra time is getting added when a API is completed

I am experiencing following weird behaviour - From client(browser) a API call is made to server to fetch user details. The time required by this API to complete is around 650 ms. But in browser it is ...

4zh4r_s4l4ti

1,584

asked Mar 20, 2024 at 0:50

0 votes

1 answer

393 views

How to measure the latency of globally load balanced tagging server deployments?

I have several globally distributed tagging server deployments that are deployed in GCP Cloud Run. The Cloud Run deployments are reachable through serverless network endpoint groups. The traffic is ...

CPI

71

asked Mar 15, 2024 at 10:24

0 votes

1 answer

43 views

Understand different servers (v50, v95) in CloudKit Latency Telemetry

I'm migrating to CloudKit from Firebase, the database isn't in production yet. Trying to improve the asset downloading speed I opened Telemetry, and under Latency i found this data: 12/31/2023, 8:00:...

kelin

12.2k

asked Jan 3, 2024 at 11:30

1 vote

0 answers

186 views

DPDK Error in Transmitting/Receiving packets

I have a application running with 2 cores on each port. If I use lid type as RX_TYPE on both the ports and run a application it works fine by transmitting and receiving udp packets. But if I use lid ...

Das

11

asked Dec 18, 2023 at 13:53

1 vote

1 answer

284 views

PromQL query giving multiple data points within a week

I want to calculate the average p95 for /my-api over the last week, say Mon - Sun, but when I am plotting below query it's giving me multiple data points even within a week. I expect that this would ...

tusharRawat

633

asked Dec 8, 2023 at 15:18

-1 votes

1 answer

53 views

Optimizing Connection Speed in a Firebase and MySQL Authentication Setup

Suppose you have a web application where users need to log in to access personalized data stored in a MySQL database. The authentication process involves Firebase, and every time a user makes a ...

Seb St Johnston

35

asked Dec 5, 2023 at 22:17

0 votes

1 answer

808 views

Firebird.conf optimisation, slow response from both server and client

I'm experiencing high latency (about 10s for each process) when using sql account software where my database is from a server. Server spec: Processor: Xeon E5-2630 v3 Ram: 8GB OS: Window Server 2019 ...

Zhi Zheng

11

asked Dec 3, 2023 at 16:32

0 votes

0 answers

137 views

How to time a ListenableFuture

I want to track how long an async process takes. My general approach for synchronous processes is to record the current time, run the process, and finally calculate and record the difference. I use an ...

kane

6,117

asked Nov 27, 2023 at 18:36

2 votes

1 answer

265 views

Logging latency of a query in JPA

I have a CRUD repository in springboot java app. I am trying to measure the latency of the query form APP perspective and log it in the logger. What is a good elegant way to achieve this please ? I am ...

Chinmay

364

asked Nov 20, 2023 at 22:36

0 votes

0 answers

704 views

Reponse time for GPT models via openAI API vs internet version

I am making a small time test of the openai API, from my local internet connection and laptop, but I get times that are much larger than expected. With the following code: import openai import time ...

Thomas

331

asked Nov 13, 2023 at 17:41

0 votes

0 answers

36 views

rdtsc delta to nanosecond conversion [duplicate]

Recently, I have been trying to run some performance anlaysis on my program. I want to measure the latency of some functions in cpu ticks and later convert the delta to nanosecond. (I intentionally am ...

Hedgehog

115

asked Nov 5, 2023 at 8:43

Collectives™ on Stack Overflow