1,240 questions
Best practices
0
votes
1
replies
62
views
What are safe techniques to reduce latency in a backend application without compromising correctness?
I am working on a backend application and trying to reduce response latency, but I want to avoid optimizations that could introduce correctness issues.
By correctness, I mean preserving accurate ...
1
vote
0
answers
86
views
Latency of warp add reduction instruction
The CUDA Programming Guide describe a warp instruction named __reduce_add_sync.
What is the latency of the function, specifically in the Ampere architecture?
Related sources:
This table within the ...
0
votes
0
answers
23
views
Why does the reduce latency metric look incorrect for binding pull requests in Milvus?
Reduce latency metric looks incorrect for binding pull requests in Milvus
While monitoring Milvus performance metrics, I noticed something unexpected related to binding pull requests.
Specifically, ...
3
votes
2
answers
329
views
What is the difference between 'baseLatency' and 'outputLatency' when calling AudioBufferSourceNode.start()?
I'm working on a Javascript browser application that requires very precise timing of audio playback via the AudioBufferSourceNode.start() method. I'm unsure on how I should treat 'baseLatency' and '...
0
votes
0
answers
70
views
Lenovo M10+ gen 3 on screen keyboard issue
I have one of these tablets, just over 2.5 years old now. It came with Android 12 and was fine, then an auto-update forced it to 13, and it broke the on screen keyboard.
What happens in practice is ...
2
votes
0
answers
254
views
Flutter SoLoud: How to reduce playback latency when syncing audio with UI (metronome-style)?
I want to play a short .wav sound every second and update a counter on the screen at the same time.
I'm using the flutter_soloud package and based my code on their metronome example from GitHub.
...
1
vote
0
answers
90
views
Why doesn't PDF caching speed up/match raw text input latency on Gemini API?
Passing the text from OCR on a PDF is much faster than using the cache. Why?
Passing raw text, Nth call takes ~2 seconds.
response = client.models.generate_content(
model=MODEL_NAME,
contents=f&...
1
vote
1
answer
198
views
How to execute some code just before cloudflare worker goes cold?
I am caching some global variables in memory across multiple requests in a cloudflare worker. I want to flush those variables into a persistent storage like KV cache or redis just before the worker is ...
2
votes
1
answer
306
views
Warm up HuggingFace Transformers models efficiently to reduce first-token latency in production
In production deployment of Hugging Face LLMs, the first inference call often has very high latency ("cold start"), even on a machine where the model is already loaded into memory.
...
-1
votes
1
answer
268
views
nodejs event loop lag 300ms and huge request latency [closed]
I have a next.js website that under the load 80 requests per seconds (for one pod) shows huge latency 20-30 seconds to handle a simple request like API call.
I checked event loop lag and under the ...
0
votes
1
answer
208
views
Why does my CPU's efficiency core have a lower core-to-core latency?
I used https://github.com/nviennot/core-to-core-latency to measure my CPU's (Intel(R) Core(TM) Ultra 7 268V) core-to-core latency and these are my results:
~/Developer/core-to-core-latency main
❯ ...
0
votes
1
answer
98
views
How to profile/monitor a KDB tickerplant to trace causes of a slow tickerplant?
I'm trying to use KDB as a low-latency pub/sub message broker that persists all messages in a queryable format.
However, I'm noticing the latency from when the tickerplant receives a message (i.e. ...
1
vote
1
answer
115
views
What could cause strange delays while sending delays from a python aiohttp server?
We have a service with this architecture:
HTTPS requests come into an A10 load balancer that does L4 load balancing
Behind it are 2 backend servers with Apache running that terminate the TLS ...
0
votes
1
answer
660
views
How to Reduce OpenAI Azure Response Time for Structured Output Using GPT-4o Mini (Fine-Tuned Model)?
I am using an Azure OpenAI GPT-4o Mini fine-tuned model to generate structured responses (e.g., JSON format). However, the response time is higher than expected, and I am looking for ways to optimize ...
1
vote
1
answer
66
views
Is latency available within traceroute in scapy
from scapy.all import traceroute
def perform_traceroute(target):
result, _ = traceroute(target, maxttl=50, verbose=True)
print(result)
for hop in result:
query_packet = hop[0]...
0
votes
0
answers
39
views
What is the most "empty" Linux system call to benchmark against? [duplicate]
I want to benchmark some performance aspects of a Linux device driver (a loadable module). Specifically, how fast certain code paths are when they are invoked from userspace via system calls.
In ...
0
votes
1
answer
289
views
Durable Functions + Azure Storage Account -> Multiple retry to Attempting to retrieve authentication token for resource 'https://storage.azure.com/'
Anyone else having a problem to let durable functions runtime get connection to Azure Storage Account? When you are running Local environment on your VS Code.
For me it is stuck on that and after many ...
0
votes
1
answer
125
views
Jmeter Response time is not matching with Web browser - 'Waiting for Server Response' time
We have facing a typical issue in Jmeter response time.
We have an application, where we are searching for record with a keyword.
Manually when we do the same in browser, we get it around 2-3 secs ...
1
vote
0
answers
119
views
How to interpert hwlat trace results on an ARM Cortex-A7?
I’m working with an STM32MP157D-DK1, trying to use the hwlat tracer for the board's arm Cortex-A7 CPU, to check for typical hardware latency.
The following attempt was made to use the hwlat tracer on ...
1
vote
1
answer
106
views
serial latency on raspberry AMA0
I wish to make software PLC on raspberry pi. I did reserve one core just for PLC task and I got latency under 100 micro seconds.
But the PLC need RS485 communication at 1 Megabit. It works, but the ...
1
vote
0
answers
131
views
How to measure latency between midi instrument and web midi api?
When using the web midi api for receiving midi messages from a midi instrument (via usb), how can we measure the latency time, that is: the time it took from playing a note with the instrument to ...
0
votes
1
answer
182
views
Is it possible to get less camera latency using OpenCV?
I'm learning python and i'm working with OpenCV.
My goal is :
When I execute my python script : it starts recording a 10 seconds sequence with my USB webcam which is a Logitech C920 and the video is ...
1
vote
0
answers
129
views
Redis Latency Issues: High Response Time for Repeated Requests on WSL with phpredis
I am experiencing unexpected latency issues with Redis in my Laravel application. Here are the details:
Setup:
Redis Client: I am using the phpredis extension.
Environment: Redis is running on Windows ...
-1
votes
1
answer
110
views
Configure latency parameters for CPU cache hit/miss latency in Gem5
I'm configuring the hit latency for a CPU model in Gem5. While I notice there are many latency parameters(eg. tag_latency, data_latency, frontend_latency, forward_latency, response_latency) which are ...
2
votes
1
answer
759
views
Latency issue - How to perfectly sync audio beeps (using Web Audio API) and visualization (using canvas + requestAnimationFrame)?
I use an audioContext = new AudioContext() of the Web Audio API for scheduling beep sounds for every beat (relative to a user-defined bpm and time signature) - let's call this stack of beeps "...
2
votes
1
answer
50
views
Can't Increase Throughput for Standalone Instance
I have been working with Milvus v2.2.9 in standalone mode using Docker Compose, running without any CPU limits. My current setup involves creating an index with 1 million embeddings using the IVF_SQ8 ...
1
vote
1
answer
2k
views
How to make Gstreamer RTSP -> UDP -> RTSP low latency pipeline?
I need to forward an rtsp stream from an axis camera through an udp port.
I have this pipeline on the send side:
gst-launch-1.0 rtspsrc location='rtsp://XXXX:[email protected]/axis-media/media.amp' ...
0
votes
0
answers
98
views
How to get info about Bluetooth output device latency in Python
I've been making a Python media player, but the audio and video de-synced when using a Bluetooth speaker, and I've been wondering how to get the Bluetooth latency using Python.
I am using a MacOS ...
1
vote
0
answers
165
views
How to calculate ping between server and client?
I have a socket server in nodejs, a unity client and I try calculate the latency between them like below
Server:
socket.on("readable", () => {
let data: Buffer = socket.read();
if(...
0
votes
1
answer
626
views
How many streams will be used by a single Bi-directional streaming gRPC call?
We are noticing performance issues when trying to use the bi-directional stream API call to stream data at 2000-4000 updates/second. We enabled debug logging and see that streamId is same for all the ...
0
votes
1
answer
208
views
Spanner JDBC Driver option to avoid latency
We are using the google spanner jdbc driver(google-cloud-spanner-jdbc) to enable connectivity from our application to cloud spanner. This was used instead of the google client libraries to reuse the ...
0
votes
0
answers
80
views
Questions about ParallelGC and Full GC behavior in long-running systems
I have a few concerns about ParallelGC and its behavior in long-running systems:
It seems that only Full GC can reclaim the old generation, implying that the old generation can only grow until a Full ...
1
vote
0
answers
259
views
Azure Service Bus: Time Spent in Queue is Large
We have a .NET process where the process is subscribing to the Topic. One of the common observations is that the Time Spent in Queue is large.
Azure Service Bus is running with 2 Messaging Units. The ...
1
vote
0
answers
115
views
Why is there an increased response times when fronting a Lambda with Cloudfront?
I have a simple Lambda function that is fronted by CloudFront. This Lambda also has a function URL.
Invoking the Lambda using the Function URL is way faster than invoking it via CloudFront. Based on ...
1
vote
0
answers
110
views
Unexpected Increase in Latency with More Readers in Multi-threaded Circular Queue Implementation in C++
I have implemented a circular queue using an array in C++. The queue uses two atomic variables, front and rear, to manage enqueue and dequeue operations. When an element is enqueued, rear is ...
0
votes
0
answers
82
views
How much latency does the operating system add transferring video over USB to the host application?
I am transferring raw uncompressed video from an image sensor to a C++ OpenCV application running on Linux. The signal chain is this:
MIPI output Image sensor -> MIPI to USB 3.0 chip -> USB 3.0 ...
0
votes
1
answer
78
views
AVR-GCC Unexpected interrupt lattency
I have some code where interrupt jitter is deal killer. There are just one interrupt source and that need to be as precise as possible. For ATMEGA the max interrupt response is the length of current ...
0
votes
0
answers
308
views
How do I reduce latency for Flutter Geolocator on iOS?
I've built an app using Flutter that allows users to track how far they have travelled from a starting point, a bit like any map-based or sports tracking app. With the app, a high accuracy is ...
0
votes
0
answers
151
views
Latency on the thermal camera while using multi-threading for multiple cameras
I am trying to do some real-time image processing by openCV with videos from web-camera and thermal-camera. I am using multi-threadings methods: one threading for one camera. The simplified code is ...
0
votes
0
answers
84
views
Not able to understand where the extra time is getting added when a API is completed
I am experiencing following weird behaviour -
From client(browser) a API call is made to server to fetch user details. The time required by this API to complete is around 650 ms. But in browser it is ...
0
votes
1
answer
393
views
How to measure the latency of globally load balanced tagging server deployments?
I have several globally distributed tagging server deployments that are deployed in GCP Cloud Run.
The Cloud Run deployments are reachable through serverless network endpoint groups.
The traffic is ...
0
votes
1
answer
43
views
Understand different servers (v50, v95) in CloudKit Latency Telemetry
I'm migrating to CloudKit from Firebase, the database isn't in production yet. Trying to improve the asset downloading speed I opened Telemetry, and under Latency i found this data:
12/31/2023, 8:00:...
1
vote
0
answers
186
views
DPDK Error in Transmitting/Receiving packets
I have a application running with 2 cores on each port. If I use lid type as RX_TYPE on both the ports and run a application it works fine by transmitting and receiving udp packets. But if I use lid ...
1
vote
1
answer
284
views
PromQL query giving multiple data points within a week
I want to calculate the average p95 for /my-api over the last week, say Mon - Sun, but when I am plotting below query it's giving me multiple data points even within a week.
I expect that this would ...
-1
votes
1
answer
53
views
Optimizing Connection Speed in a Firebase and MySQL Authentication Setup
Suppose you have a web application where users need to log in to access personalized data stored in a MySQL database. The authentication process involves Firebase, and every time a user makes a ...
0
votes
1
answer
808
views
Firebird.conf optimisation, slow response from both server and client
I'm experiencing high latency (about 10s for each process) when using sql account software where my database is from a server.
Server spec:
Processor: Xeon E5-2630 v3
Ram: 8GB
OS: Window Server 2019
...
0
votes
0
answers
137
views
How to time a ListenableFuture
I want to track how long an async process takes.
My general approach for synchronous processes is to record the current time, run the process, and finally calculate and record the difference. I use an ...
2
votes
1
answer
265
views
Logging latency of a query in JPA
I have a CRUD repository in springboot java app. I am trying to measure the latency of the query form APP perspective and log it in the logger. What is a good elegant way to achieve this please ?
I am ...
0
votes
0
answers
704
views
Reponse time for GPT models via openAI API vs internet version
I am making a small time test of the openai API, from my local internet connection and laptop, but I get times that are much larger than expected. With the following code:
import openai
import time
...
0
votes
0
answers
36
views
rdtsc delta to nanosecond conversion [duplicate]
Recently, I have been trying to run some performance anlaysis on my program.
I want to measure the latency of some functions in cpu ticks and later convert the delta to nanosecond.
(I intentionally am ...