Skip to main content
Filter by
Sorted by
Tagged with
0 votes
0 answers
54 views

I'm trying to run the following script, but after a few seconds (approximately 4) the job ends without creating the output_file with the data and without creating outout and error slurm files. #!/bin/...
andreg's user avatar
  • 11
Advice
0 votes
6 replies
131 views

I was trying to read data (chars) from a large text file (~250MB) in 1KB chunks and was very surprised that reading that file using either FileReader or BufferedReader takes exactly the same time, ...
sebkaminski16's user avatar
1 vote
1 answer
201 views

I have a bounded queue with small size that definitely fit in int. So I want to use atomic<int> instead of atomic<size_t> for indexing/counter, since int is smaller it should be faster. ...
Huy Le's user avatar
  • 1,999
1 vote
0 answers
113 views

Using go version go1.25.3 darwin/arm64. The below implementation is a simplified version of the actual implementation. type WaitObject struct{ c chan struct{} } func StartNewTestObject(d time....
Ahmad Sameh's user avatar
0 votes
1 answer
75 views

I was testing various expressions of a sixth order polynomial to find the fastest possible throughput. I have stumbled upon a simple polynomial expression length 6 that provokes poor code generation ...
Martin Brown's user avatar
  • 3,666
1 vote
1 answer
91 views

Today, I researched Base64 encoding versus other methods and whether to use it in a JSON API, considering the 33-37% size overhead that Base64 introduces and all sorts of related topics. To ...
tfn's user avatar
  • 87
2 votes
0 answers
128 views

I have spent some time trying to speed up code that uses Horner's method for evaluating modest length polynomials (N < 32). I have a solution using loop unrolling that works very well at -O2 or ...
Martin Brown's user avatar
  • 3,666
3 votes
1 answer
152 views

I'm trying to benchmark the performance of a library I've written that can parse large JSON files into both an object model and a JsonDocument. So far as I can tell I'm doing everything right, but I'...
Ari Roth's user avatar
  • 5,582
7 votes
1 answer
342 views

I want to use RDTSC in Rust to benchmark how many ticks my function takes. There's a built-in std::arch::x86_64::_rdtsc, alas it always translates into: rdtsc shl rdx, 32 or rax, rdx ...
Daniil Tutubalin's user avatar
4 votes
2 answers
223 views

I've run some benchmarks on Math.Log, System.Numerics.Vector.Log, System.Runtime.Intrinsics.Vector128.Log, Vector256.Log and Vector512.Log and the results were pretty surprising to me. I was expecting ...
user31260114's user avatar
1 vote
1 answer
60 views

I'm looking for some code or some benchmark to roughly asses the pause times or cpu load caused by some GC in order to get some rough estimate how efficient it is. I just want to see whether some GC ...
OlliP's user avatar
  • 1,617
3 votes
1 answer
202 views

I wish to benchmark various hashmaps for the <K,V> pair <u8, BoxedFnMut> where BoxedFnMut. type BoxedFnMut = Box<dyn FnMut() + Send + 'static>; To do this, I am using divan(0.1.21) ...
Naitik Mundra's user avatar
4 votes
1 answer
203 views

I have a really simple benchmark to measure and compare performance of Dictionary<string, int> and ConcurrentDictionary<string, int>: [MemoryDiagnoser] public class ...
Pupkin's user avatar
  • 1,223
0 votes
0 answers
165 views

I'm trying to run cargo bench on my new MacBook (Apple Silicon, macOS [Sequioa version 15.5]), but I get this error: cargo(31826) MallocStackLogging: can't turn off malloc stack logging because it was ...
ajita asthana's user avatar
2 votes
0 answers
59 views

This question is somewhat specific to Rust's Criterion, but I have kept it general so that anybody with knowledge about benchmarking can help. In my Rust codebase, I have a struct Model that is very ...
aleferna's user avatar
  • 141
27 votes
5 answers
9k views

First of all we create a random binary file with 100.000.000 bytes. I used Python for this: import random import os def Main(): length = 100000000 randomArray = random.randbytes(length) ...
Vasilis Kontopoulos's user avatar
1 vote
1 answer
101 views

I am writing a benchmarking framework for compiler-like programs. For benchmarking, I use a docker container (for reproducibility). However, i still measure quite a bit of noise (up to 5%!). My ...
Frobeniusnorm's user avatar
2 votes
1 answer
197 views

This Python 3.12.7 script with NumPy 2.2.4 and Numba 0.61.2: import numpy as np, timeit as ti, numba as nb def f0(a): p0 = a[:-2] p1 = a[1:-1] p2 = a[2:] return (p0 < p1) & (p1 > p2) ...
Paul Jurczak's user avatar
  • 8,650
0 votes
1 answer
93 views

I'm learning PostgreSQL Clustering abilities and I would like to compare performance of the same query with table not clustered and with table clustered. I tried to generate 25 million user events and ...
rela589n's user avatar
  • 1,224
2 votes
0 answers
125 views

I'm currently benchmarking PySpark vs the growing alternative Polars. Basically I'm writing various queries (aggregations, filtering, sorting etc.) and measure the execution time, RAM and CPU. I ...
Ernest P W's user avatar
1 vote
2 answers
157 views

My benchmark attempts to compare AoS vs SoA for 1 000 000 items. The result for 1 000 000 items: | Method | Mean | Error | StdDev | |---------------------- |---------:|-------...
user3821908's user avatar
0 votes
0 answers
52 views

In the following FastAPI app: from fastapi import FastAPI from sqlalchemy import create_engine, text from sqlalchemy.orm import Session engine = create_engine("postgresql+psycopg://postgres:...
Dante's user avatar
  • 880
0 votes
0 answers
63 views

I'm benchmarking some functions' memory in R and python, and had a great time getting results with the bench package in R that tracks all allocations by each call, allowing me to get the total and ...
Qile0317's user avatar
1 vote
0 answers
116 views

I am writing a RISC-V assembly program whose goal is to assess the performance of main memory, in read access only for now. I have thought about a simple benchmark code, that would load multiple ...
SFV's user avatar
  • 11
-1 votes
1 answer
78 views

I am trying to run a benchmark on some family of algorithms. I have multiple algorithms, each of them with one hyperparameter, and I want to test them with multiple data sizes. Each run takes ~60 ...
David Davó's user avatar
0 votes
2 answers
75 views

I'm using profvis to profile my functions in R, but I want to extract specific timings for subfunctions. For example if I run a = profvis({ dat <- data.frame( x = rnorm(5e4), y = ...
user19904's user avatar
  • 182
0 votes
1 answer
126 views

I'm starting to study algorithms and their efficiency. I started by selection sort. For the sake of interest, I wanted to compare the running time of the same algorithm implementation in python and c++...
Gwinkamp's user avatar
0 votes
1 answer
177 views

I need help regarding dragonfly db, particularly benchmarking. So here is the story, I tried benchmarking dragonfly as a cache to replace redis. I got the expected result when testing single node; it ...
amzshow's user avatar
  • 58
1 vote
0 answers
81 views

I would like to do some (micro)benchmarking in Swift. I have been using package-benchmark for this. It comes with a blackHole helper function that forces the compiler to assume that a variable is read ...
loonatick's user avatar
  • 1,197
7 votes
1 answer
177 views

I wanted a reliable benchmark which has a lot of cache misses, and the obvious go-to seemed to be a pointer chasing benchmark. I ended up trying google/multichase but the results don't seem to be what ...
Box Box Box Box's user avatar
0 votes
0 answers
19 views

We have designed benchmarks to test execution times of some algorithms coded within a library that we are working on. Those algorithms are mono-threaded. So the algorithms can use at most 100% of the ...
Arnaud's user avatar
  • 223
-2 votes
1 answer
101 views

I have two C++ implementations that count pairs ((x, y)) satisfying ((x + y) % 7 == 0). Method 1 skips unnecessary iterations using y += 6, while Method 2 checks every y. I performed benchmarks on ...
Chirag Jain's user avatar
7 votes
1 answer
176 views

I have just been trying to benchmark some PRNG code for generating hex characters - I have an input of random values buf, which I want to turn into a series of hex characters, in-place. Code #define ...
Anon's user avatar
  • 381
6 votes
0 answers
97 views

There is a binary serialization called BorshSerialize. And in its Rust implementation, when alloc is enabled, it allocates a Vector with initial capacity of 1024 before each serialization. I thought ...
Ahmet Yazıcı's user avatar
4 votes
0 answers
166 views

I have an important question. I performed a benchmark using the following tool versions: # JMH version: 1.37 # VM version: JDK 22, OpenJDK 64-Bit Server VM, 22+36-FR # VM invoker: /home/jack/.sdkman/...
Jack5000's user avatar
1 vote
0 answers
126 views

I am writing C code for the Raspberry Pi 4 (ARM Cortex-A72), which relies on precise timing in periods of less than 1μs. To get precise timing, I use the following algorithm: clock_gettime(...
Pygmalion's user avatar
  • 921
0 votes
1 answer
76 views

I want to perform a benchmarking Test (BPFM, IOR, FIO & Sysbench) on a Ubuntu VM. The benchmark should use the available amount of cores in steps of 2^2 (So 2, 4, 8, 16, ... up to the available ...
JulianW's user avatar
0 votes
0 answers
37 views

I'm trying to use WandB to store and view benchmarking results. I've got a snipped of code that looks basically like this for model in models_to_eval: wandb.init(project="benchmarking", ...
Sam Russell's user avatar
0 votes
1 answer
98 views

A function which is only being used with debug assertions is deactivated by: #[cfg(debug_assertions)] fn some_debug_support_fn() {} But this makes cargo bench fail to compile, as it is missing the ...
Dávid Tóth's user avatar
  • 3,315
2 votes
1 answer
101 views

I have a microbenchmark which I'm using to generate memory traffic. I've profiled the application and it seems to constantly hit in L1 cache. I have a Core i5-7260U. I want to understand the actual ...
jkang's user avatar
  • 579
0 votes
1 answer
110 views

I came across the question Wrapping real numbers which asks for help on implementing a "wrap around" functionality for double values within a given range, stating the following constraint: ...
null's user avatar
  • 5,825
-6 votes
2 answers
110 views

I wrote two empty functions in rust with the hope of using each to test the use of retain on vectors or the use of filter on iterators, after writing the empty function for each case, on running the ...
Brian Obot's user avatar
1 vote
0 answers
33 views

I'm trying to setup Airspeed Velocity to use with my Python project. In other setups, such as github workflows the only build commands needed are: "python -m pip install --upgrade pip", ...
Attack68's user avatar
  • 4,821
0 votes
0 answers
92 views

I am benchmarking the performance of iterative and recursive binary search algorithms in Java, specifically measuring both execution time and memory usage for different dataset sizes. However, I am ...
Ersin Karaduman's user avatar
1 vote
1 answer
121 views

I have pretty basic benchmark comparing performance of mutex vs atomic: const ( numCalls = 1000 ) var ( wg sync.WaitGroup ) func BenchmarkCounter(b *testing.B) { var ...
vtm11's user avatar
  • 409
4 votes
0 answers
367 views

Overview We have been using the the nats bench CLI tool for benchmarking the performance of NATS Jetstream over different variables. In our use-case, we require 100K ephemeral parallel consumers ...
Anish Gupta's user avatar
16 votes
3 answers
1k views

The goal of this research is to explore the performance differences between JIT (just-in-time compilation) and AOT (ahead-of-time compilation) strategies and to understand their respective advantages ...
Joas Coder's user avatar
0 votes
0 answers
37 views

I want to benchmark some performance aspects of a Linux device driver (a loadable module). Specifically, how fast certain code paths are when they are invoked from userspace via system calls. In ...
Grigory Rechistov's user avatar
5 votes
2 answers
211 views

So I recently ran a benchmark where I compared the performance of nested streams in 3 cases: Parallel outer stream and sequential inner stream Parallel outer and inner streams (using parallelStream) -...
Andorrax's user avatar
  • 123
1 vote
0 answers
130 views

I am trying to run the Spec '06 benchmarks on Gem5. All of the benchmarks I've tried seem to start up normally and then output the following and stall indefinitely: src/sim/mem_state.cc:448: info: ...
Martin Chapman's user avatar

1
2 3 4 5
73