Skip to main content
Filter by
Sorted by
Tagged with
1 vote
1 answer
128 views

using System.Diagnostics; const int TASKS = 100; var mainSw = Stopwatch.StartNew(); var tasks = Enumerable.Range(0, TASKS).Select(i => Task.Run(async () => { await Task.Delay(...
yuri's user avatar
  • 29
Best practices
0 votes
5 replies
105 views

I need to process a list of objects (not the same shown on the sample), which I thought could be greatly improved by running it in parallel.foreach loop. However, the result is not what I expected. ...
Jlong101's user avatar
Advice
1 vote
0 replies
44 views

I need to trigger a dynamic number of sub-workflows in parallel (around 100) and wait for ALL of them to complete before continuing the main workflow. I’ve implemented a solution but I’m wondering if ...
Michal's user avatar
  • 121
-4 votes
0 answers
42 views

I’ve been reading about Python’s Global Interpreter Lock (GIL), and I’m a bit confused about how it actually works behind the scenes. From what I understand, the GIL allows only one thread to execute ...
Yash Gupta's user avatar
Advice
2 votes
2 replies
59 views

I am working on a global PDE problem that is solved using a standard domain-decomposition strategy (e.g., Scotch, METIS). This part of the computation is well balanced across all MPI processes. ...
hrx71's user avatar
  • 1
Tooling
1 vote
3 replies
72 views

The idea of Persistent-Memory gawk is fabulous because it improves the performance, size, and clarity of many scripts on static and reference data. However, I have a significant problem in adopting ...
Sergio Albert's user avatar
1 vote
1 answer
79 views

I'm trying to run calculations using multiple cores in Python on multiple platforms (Linux, macOS, Windows). I need to pass a large CustomClass Object and a dict (both readonly) to all workers. So far ...
polyte's user avatar
  • 459
0 votes
0 answers
40 views

I am now trying to use FSDP in Huggingface transformers Trainer. The training script is something like train_dataset = Mydataset(...) args = TrainingArguments(...) model = LlamaForCausalLM....
xuehao-049's user avatar
0 votes
0 answers
21 views

I have the following process definition I try to execute on Camunda 7.24 / CibSeven 2.1 which currently logs during execution many OptimisticLockingException. I could already trace it down that it ...
BigMichi1's user avatar
  • 308
0 votes
1 answer
126 views

I have the following C# code : var rand = new Random(1); var range = Enumerable.Range(1, 8); var partition = Partitioner.Create(range, EnumerablePartitionerOptions.NoBuffering); foreach (var x in ...
tigrou's user avatar
  • 4,596
0 votes
1 answer
97 views

This is a bit of a slog so bare with me. I'm currently writing a 3D S(moothed) P(article) H(ydrodynamics) simulation in Unity with a parallel HLSL backend. It's a Lagrangian method of fluid simulation,...
Ben Williams's user avatar
Tooling
0 votes
0 replies
32 views

I am running Flux 1 dev text to image model through ComfyUI in Kaggle. Everything works but I noticed that Kaggle offers a second GPU inside the notebook. If I try to run two instances of the ComfyUI ...
Bram Fran's user avatar
  • 113
1 vote
0 answers
81 views

I am reading a paragraph about the tbb::parallel_scan algorithm from the book Intel Threading Building Blocks, and I understood what the operation does serially, but I am not understanding what are ...
luczzz's user avatar
  • 446
0 votes
0 answers
73 views

While looking at this TBB guide webpage: https://www.intel.com/content/www/us/en/docs/onetbb/developer-guide-api-reference/2021-9/bandwidth-and-cache-affinity.html, they mention this ...
luczzz's user avatar
  • 446
Best practices
0 votes
2 replies
129 views

Essentially I am trying to create a dataset that is dependent on prior rows to generate values for any given row. I then would like to run this loop over many IDs for an entire dataset. Current set up ...
triangle_coder's user avatar
0 votes
0 answers
52 views

TFORM is considered a great tool for manipulating large and symbolic equations. In this thread, I’d like to share my optimization problem, which concerns a very simple operation — equations expansion. ...
kozapdh's user avatar
Advice
1 vote
1 replies
33 views

I need to transfer a MBTiles map tiles from a disk to another, is there a faster way then just mv? It is huge and takes time.
Alperen Ölçer's user avatar
0 votes
1 answer
97 views

the code below crashes with terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >' Aborted ...
user1407220's user avatar
0 votes
1 answer
52 views

I am doing some calculations on spectra, doing some maths on each partial. This is taking a lot of time, but since the partials are all calculated independently, I wonder how to do parallel processing ...
les_h's user avatar
  • 438
1 vote
0 answers
41 views

Anyone know how to call single notebook with different parameters parallel, and all notebooks should appear on spark UI to make the trouble shooting easier? I have one child notebook, calling from ...
SAH's user avatar
  • 11
0 votes
0 answers
25 views

I am trying to integrate socket.io with Node's HTTP alongside Node's Cluster Module. Consider the reproducible example: index.js: let cluster = require('cluster') let fs = require('fs') let http = ...
Issac Howard's user avatar
1 vote
3 answers
249 views

I have a large number of files (about 3800) that I want to run a program over. The program reads a WAV file and makes a short .TSV text file containing the WAV's lip-sync data (but that is by-the-by ...
Benjamin Rich's user avatar
0 votes
0 answers
31 views

I’m working with FORM/TFORM to automatically expand a large set of symbolic equations. My goal is to make the expansion process run in parallel on multiple CPU cores using TFORM. Here’s a simplified ...
kozapdh's user avatar
2 votes
1 answer
86 views

I have a class with methods to simulate sources across 16 detectors using the Gelsa package. In my main script, I call the method generate.sources. I am trying to use multiprocessing to speed up the ...
Nicolò Fiaba's user avatar
15 votes
0 answers
626 views

I've been developing a 2D N-body gravity simulation in C++, and I've run into an interesting performance issue. Instead of a stable frame rate, the application's update time systematically pulsates ...
Krish's user avatar
  • 159
1 vote
1 answer
102 views

I am trying to compile some of my older applications, and it doesn't compile where it encounters lambdas. I am well aware that this api is deprecated, but it compiled ok some time ago on the same ...
armagedescu's user avatar
  • 2,417
0 votes
0 answers
66 views

I’m learning how to use SIMD (Single Instruction, Multiple Data) for parallel data processing. Suppose I have a large dataset (e.g., an array of 1 million floats), and I want to process it efficiently ...
Catdev's user avatar
  • 1
8 votes
1 answer
244 views

Parallel policy states that order of iterator evaluation is not guaranteed. However, std::find*, std::search and std::mismatch all say that they return first iterator matching condition. How do those ...
Dominik Kaszewski's user avatar
1 vote
1 answer
108 views

Building on this question here The term thread divergence is used in CUDA; from my understanding it's a situation where different threads are assigned to do different tasks and this results in a big ...
bigcodeszzer's user avatar
0 votes
1 answer
80 views

I'm trying to spawn multiple parallel (not concurrent) tasks. Every task is running a PUT operation to a custom S3 storage using AWS SDK for Rust. The function body looks the following (the different ...
lukeflo's user avatar
  • 195
0 votes
1 answer
207 views

We have several tests which have been switched to using XUnit 3. We have an assembly-level [assembly: CaptureConsole(CaptureOut = true, CaptureError = true)] and we have several "more ...
Willy Van den Driessche's user avatar
-1 votes
0 answers
49 views

I am learning parallel programming in haskell. My example code: module Quicksort where import Control.Parallel.Strategies import Control.Parallel qsort :: [Int] -> [Int] qsort [] = [] qsort lst@(...
kesarling's user avatar
  • 2,334
1 vote
0 answers
70 views

I use JFreeChart to create various kinds of charts en masse, based on huge amounts of data. Those charts are only meant to be written to PNG files on the hard drive; no JavaFX, Swing, AWT or other GUI ...
Pixelcode's user avatar
  • 440
2 votes
0 answers
81 views

I have function (RSA encryption) where end result m depend on m1 and m2 where m1 and m2 can be computed in parallel. I tried to use par and pseq but the result is weak. Total time 20.062s ( 18....
Maq's user avatar
  • 427
0 votes
2 answers
103 views

I am trying to test future.batchtools for parallelisation in R. I have a small test job (run_futurebatchtools_job.R) as: library(future) library(future.batchtools) # Set up the future plan to use ...
Arindam Ghosh's user avatar
4 votes
1 answer
144 views

In R, I am parallelising my simulation using packages foreach, doFuture, and doRNG. I have two nested foreach loop: the inner loop generate and analyse data for each iteration, and the outer loop ...
Trang Hien's user avatar
0 votes
0 answers
111 views

Right now I am running a model optimization to optimize one set of parameter for several sites (in total 47 sites, i.e. the cost function sum over these 47 results). Site computation is independent ...
Xu Shan's user avatar
  • 325
2 votes
1 answer
192 views

I'm trying to implement a short-circuited processing for an external input of java.util.Stream (think Stream.forEach() but with short-circuiting). I do not care about order of the elements, but if ...
Basilevs's user avatar
  • 24.6k
1 vote
1 answer
117 views

Here’s the pattern I want: Dispatch multiple tasks in parallel. Aggregate all their results into a final result. Remove the intermediate results right after the chord result is ready, without ...
Temax's user avatar
  • 97
-2 votes
1 answer
184 views

While learning about parallelism, I learned that C++ support parallelism through functions such as std::for_each, std::transform and execution policy. So if, for example, we want to divide elements of ...
Le_Square's user avatar
0 votes
0 answers
53 views

I'm working on a function to detect positive and negative events using a time series of cumulative anomalies. The function seems to work fine for vectors and produces the correct number of outputs as ...
Jose Lastra's user avatar
0 votes
0 answers
54 views

I'm trying to parallelize my R code that pulls in data from a Snowflake table, but when I do, I get an error that I have an invalid connection. I don't receive this error when I do NOT have it ...
J.Sabree's user avatar
  • 2,596
2 votes
3 answers
111 views

We are working with the following code: int i, j, k; for (i = 2; i < n; i++){ // S1 for (j = 3; j < n - 3; j++){ // S2 for (k = 4; k < n - 4; k++){ // S3 A[...
user31223185's user avatar
1 vote
0 answers
80 views

Suppose you are processing a large data set using several cores in parallel. I am looking for the most memory-efficient way to break up the data among the processors. Specifically, this would be for ...
Zzyzx's user avatar
  • 53
2 votes
0 answers
61 views

I'm writing a physics game, and I'm trying to speed up my motion calculations. Every tick of the update cycle, I call an rk4 routine which calls an ODE function 4 times, passing updated values for dt/...
Kate Goss's user avatar
2 votes
2 answers
371 views

I was excited when I read about the latest update of purrr (1.1.0) with its in_parallel capabilities. I just happen to have a time-consuming data task that runs for several minutes, because: my data ...
deschen's user avatar
  • 11.6k
1 vote
0 answers
42 views

I'm currently in Matlab doing a set of fixed point iterations using fmincon. I use parfor multithreading to do so. However, one of my iterations in the parfor loop goes particularly slow just because ...
ZZ Top's user avatar
  • 105
1 vote
1 answer
79 views

I have multi module android project where I am doing junit4 to junit5 migration For CoroutineTestRule I have added CoroutineTestExtension but it's not working well when parallel test execution is ...
OneDream Project's user avatar
2 votes
1 answer
283 views

Sorry I was busy and made a few mistakes. First is that the logics for the various implementations were not the same, and I adjusted accordingly, second is that there is an overflow with float and I ...
Michael's user avatar
  • 810
1 vote
1 answer
407 views

I have been trying to generalise this jax program for solving on both CPU and GPU depending on the machine it's running on (essentially need cpu parallelisation to speed up testing versus gpu for ...
Kepler7894i's user avatar

1
2 3 4 5
440