Skip to main content
Filter by
Sorted by
Tagged with
0 votes
0 answers
43 views

We use sycl command_graph to process data. The processing is configured during runtime. One of the (possible) processing steps is a real to complex fft, for this we use Intels oneApi mkl dft. If a fft ...
schoeffmann's user avatar
0 votes
1 answer
61 views

I'd like to clarify which memory transfers are done in two examples of SYCL code in Intel's oneAPI guide. The guide makes one point that contradicts my impression about when memory transactions are ...
xealits's user avatar
  • 4,818
0 votes
0 answers
69 views

The error is caught by the for loop below: try { for (int t = 0; t < T; t++) { if (t % 2 == 0) { // Run on CPU q_cpu.submit([&](sycl::handler& h) { ...
Chua Choon Fei's user avatar
3 votes
0 answers
167 views

This is my SYCL program on windows to check if SYCL works fine in my system #include <iostream> #include <cl/sycl.hpp> #include <Windows.h> int main() { int array[5]; { ...
Supergamer's user avatar
4 votes
1 answer
169 views

I'm new to SYCL. The following code compiles fine but gives the wrong result. The code computes the product of two matrices (A, RM x K and B, RK x N) into a result matrix (C, RM x N). #include <...
pablo_s's user avatar
  • 43
2 votes
1 answer
94 views

I have a collection of struts, each with a data_t member type (other members omitted): struct A { using data_t = int; }; struct B { using data_t = double; }; struct C { using data_t = bool; }; // etc. ...
Dominic Kerr's user avatar
0 votes
2 answers
145 views

I'm trying to run two kernels concurrently on a single Nvidia GPU using oneAPI SYCL and the Nvidia plugin. Is this possible? If not, why? Here is where I'm at so far: I'm able to run two kernels ...
rubikssolver4's user avatar
3 votes
2 answers
182 views

One of the advantages of SYCL (heterogeneous computing by Khronos) is claimed that the source code is standard C++17 source code. I am not sure how do I understand this. Is it strict statement or just ...
0x2207's user avatar
  • 1,052
-1 votes
1 answer
97 views

I have two rtx3090 gpus, and i want to bind them in one context. but below code: int main() { std::vector<sycl::device> devices = sycl::device::get_devices(); std::vector<sycl::device&...
Alice Pedom's user avatar
1 vote
0 answers
64 views

Does SYCL have an analogous feature to Direct3D's, or Unity's ComputeShader.DispatchIndirect, where you can launch x number of kernels where x is an integer stored on GPU memory. This avoids having to ...
denomme's user avatar
  • 35
1 vote
0 answers
99 views

In my application I perform the same computation on batches of problems, I do however require some intermediate data to be allocated for these computations and therefore I've resorted to function ...
Jonas la Cour's user avatar
-3 votes
1 answer
146 views

I'm new to SYCL. And I'm having some trouble compiling an almost empty program, and I really don't understand how to fix the redefinition of type_traits. I've tried all the compilation command options ...
Antag0nist's user avatar
2 votes
0 answers
235 views

I'm trying to port some cuda code which uses cuBlas to SYCL, using oneMKL's blas. OneMKL appears to be very slow though. This is the specific snippet cublasCheck(cublasSgemmStridedBatched(...
Mart's user avatar
  • 21
1 vote
1 answer
612 views

I am trying to build a shared Linux library that can be distributed and linked, like any normal shared library. We have recently ported our HPC GPU routines from CUDA to SYCL in order to be cross-...
James's user avatar
  • 434
1 vote
1 answer
389 views

I want to use the C++ library PcapPlusPlus and it‘s header files in my SYCL code. More exactly I want to compile it with the Intel C++ Compiler (icpx). I know how to program and know how C, Java and ...
user23683306's user avatar
1 vote
2 answers
214 views

SYCL offers NDRange and Hierarchical kernel parallelism abstractions. My questions: Is it true to claim that NDRange better mapped into GPUs hardware and Hierarchical parallelism better mapped into ...
Ami's user avatar
  • 21
2 votes
0 answers
51 views

Hi so I want to run a parallel reduction operation on my host device. When I compile using clang++ -fsycl it compiles fine but when I run it I get the following: terminate called after throwing an ...
Selorm K.aw's user avatar
-1 votes
1 answer
96 views

I am using Qt creator 4.4.0 to develop a SYCL application. I am not able to follow the SYCL functions into the headers to check for their definitions in Qt like I can do with other standard headers ...
sidarth narayanan's user avatar
1 vote
1 answer
223 views

I start with the question, in case somebody can say something without going through the whole post: What is the easiest way to start programming with a Radeon Pro VII in C++ in Windows? And for anyone ...
Guillermo Benito's user avatar
0 votes
1 answer
605 views

There are other similar questions regarding this issue, but their answers do not solve my case: terminate called after throwing an instance of 'sycl::_V1::runtime_error' what(): No kernel named ...
alvarella's user avatar
1 vote
0 answers
354 views

Hi im trying to perform a parallel reduction with SYCL, but after every calculation it seems my device fails to copy the values back to my host device. Attached is a snippet of my code: int ddot (...
Selorm K.aw's user avatar
4 votes
1 answer
258 views

I have recently ported my legacy CUDA code to SYCL using OneAPI for NVIDIA GPUs. The code runs fine but is two times slower than the native CUDA code. After profiling, I found the following thing. ...
Xilin Xia's user avatar
0 votes
0 answers
154 views

I am developing a SYCL application using the Intel ICPX compiler. While the code executes successfully in debug mode with the -O0 optimization level, I encounter issues in the release version: the ...
peizhao qiu's user avatar
0 votes
4 answers
521 views

I installed oneAPI base kit and HPC kit (2024.0) on public cluster to test the performance of gemm. but I got segmentation fault error. I don't know how to fix this problem. I used offline installer ...
jaewook kim's user avatar
-1 votes
1 answer
218 views

I'm using SYCL on Intel Develope Cloud to test Innovative algorithms. My questions: SYCL q.memcpy() & h.memcpy() do not work. It seems that Intel know about it What is the status of this issue? ...
Ami's user avatar
  • 21
1 vote
1 answer
490 views

I write a SYCL application for which I need to distinguish between GPU discrete devices and GPU integrated devices. Is there any way, directly or indirectly, to know if the GPU device I selected (for ...
Ami's user avatar
  • 21
0 votes
2 answers
158 views

I've been writing a ray tracer using SYCL for a few weeks but I'm now facing a memory corruption issue and I really can't find where it's coming from. I'm working on Windows 11 22H2 using the Intel ...
Tom Clabault's user avatar
0 votes
2 answers
508 views

I am having trouble writing a CMake file to offload SYCL code to the NVIDIA backend. My CMake file currently looks like this cmake_minimum_required(VERSION 3.22.1) set(CMAKE_C_COMPILER /opt/intel/...
peizhao qiu's user avatar
0 votes
1 answer
214 views

Is it possible to copy a class containing pointers to its internal attribute using SYCL and offload it to the graphics card? Basically, I try to reference members to avoid unnecessary memory usage. I ...
peizhao qiu's user avatar
0 votes
1 answer
580 views

I am trying to adopt an OOP software design strategy for a SYCL project I was working on. I got my code running in its C++ version, and then I attempted to convert it to SYCL while trying to make the ...
peizhao qiu's user avatar
3 votes
2 answers
151 views

We have the specific case with the GPU programing paradigm sycl as described in this fix request where we want to use read only access from a buffer. Specifically image the use case like: namespace ...
Fantastic Mr Fox's user avatar
0 votes
0 answers
245 views

I'm trying to install hipSYCL on Nvidia GPU, hipSYCL is installed but it is giving illegal instruction error when try to run the syclcc compiler. I tried running syclcc command on CPU and here is its ...
bhuma's user avatar
  • 1
0 votes
2 answers
135 views

I'm trying to optimize my memory-bound numerical simulation kernel in OpenCL/SYCL using local memory to allow data sharing between workitems, so that I can reduce redundant global memory traffic. When ...
比尔盖子's user avatar
  • 3,727
0 votes
1 answer
224 views

SYCL USM will work on discrete GPU if-and-only-if the GPU's hardware supports unified virtual address space. What is case regarding integrated GPUs? Can we assume that any integrated GPU supports USM? ...
Ami's user avatar
  • 21
2 votes
1 answer
205 views

What is the point of use_host_ptr property in SYCL? Why will the SYCL runtime not use the memory pointed to by the provided host pointer? https://registry.khronos.org/SYCL/specs/sycl-2020/html/sycl-...
ggg's user avatar
  • 1,999
0 votes
1 answer
962 views

I want to do workload on Nvidia GPU with SYCL in Windows 10 Pro 21H2 19044.3086. SYCL guide states that CUDA backend in supported on Windows: Build DPC++ toolchain with support for NVIDIA CUDA To ...
Vladislav Kogan's user avatar
16 votes
1 answer
2k views

In SYCL, there are three types of memory: host memory, device memory, and Unified Shared Memory (USM). For host and device memory, data exchange requires explicit copying. Meanwhile, data movement ...
比尔盖子's user avatar
  • 3,727
0 votes
2 answers
119 views

I'm trying to identify valgrind's support for different Programing languages, I just want to find the valgrind's support for the SYCL applications, if supports how to profile the SYCL Application, If ...
bhuma's user avatar
  • 1
1 vote
3 answers
178 views

In conventional C++, it's possible to create a multi-dimensional "viewer" or "wrapper" to a 1D buffer in linear memory by (1) defining a custom ArrayWrapper class, (2) overriding ...
比尔盖子's user avatar
  • 3,727
1 vote
0 answers
330 views

I'm new to SYCL and trying to run very simple vector addition program using ComputeCpp. #include <sycl/sycl.hpp> #include <iostream> class vector_addition; class vector_initialization; ...
Dmytro's user avatar
  • 340
0 votes
2 answers
358 views

I have "*.c" files generated during runtime with function implementation int foo(int, int): extern "C"{ int foo(int a, int b) { return a + b; } } I want to use these ...
Anon232's user avatar
  • 23
0 votes
1 answer
1k views

Currently I am working in a projet using DPC++. I have worked for a while in the Intel DevCloud. I haven't any problem using computing ressources. When I select a gpu, it works as expected. However, ...
MoMellouky's user avatar
0 votes
1 answer
480 views

I'm trying to code my first SYCL app. Just some falling sand. The details aren't important. just if cell has sand and cell beneath is empty move the sand, else bottom left or bottom right or if no ...
Aaron's user avatar
  • 91
0 votes
1 answer
185 views

When trying to allocate more than 256MB in one allocation on a discrete Intel GPU using SYCL/DPC++ on Linux, I get a runtime error and the program exits immediately, despite having significantly more ...
Nigel Overmars's user avatar
0 votes
3 answers
459 views

In the process of learning SYCL/DPC++, I wrote a SYCL GPU-enabled dot product code (full code on GitHub). #include <iostream> #include <sstream> #include <cmath> #include <CL/sycl....
wyphan's user avatar
  • 216
1 vote
1 answer
265 views

What is the contiguous dimension in an N-dimensional SYCL kernel, i.e. the dimension in which threads of a work-group are expected to belong to the same warp/wavefront? I would have expected it to be ...
Limmershin's user avatar
0 votes
1 answer
352 views

Context I'm porting a complex CUDA application to SYCL which uses multiple cudaStream to launch the kernels. In addition, it also uses the default Stream in some cases, forcing a device-wide ...
Luigi-Crisci's user avatar
0 votes
1 answer
383 views

I have a program with a variety of kernels. In production these kernels run on a gpu device and require JIT (Just in time) compilation because we use specialisation constants. For testing we run on ...
Fantastic Mr Fox's user avatar
2 votes
1 answer
770 views

Apologies for the slightly jokey title, but I couldn't find another way to concisely describe the question. I work in a team that use predominantly OpenCL code with a CPU fallback. For the most part ...
Louis Child's user avatar
0 votes
1 answer
273 views

I was running this code using SYCL and this error was coming up regarding recursion "error: SYCL kernel cannot call a recursive function" I am not sure what is causing this error. I used ...
Pksingh's user avatar
  • 53