About 50 results
Open links in new tab
  1. GPU Pro Tip: CUDA 7 Streams Simplify Concurrency

    Jan 23, 2015 · CUDA Applications manage concurrency by executing asynchronous commands in streams, sequences of commands that execute in order. Different streams may execute their …

  2. How is CUDA stream implemented? - NVIDIA Developer Forums

    Nov 4, 2021 · A cuda stream is mostly a device side activity. It coordinates work on the device. It should not require a thread per stream in all cases. The CUDA driver will create additional threads from time …

  3. Multistream in cudagraph capturing - CUDA Programming and …

    Feb 6, 2025 · This shows that running stream 1 during capturing is not available, so I tested this code after erasing the line “with torch.cuda.stream (stream1):”, and it worked successfully.

  4. Question on Stream, Connection and Performance - CUDA …

    Feb 23, 2024 · I contemplate two feasible approaches: Approach 1: Set CUDA_DEVICE_MAX_CONNECTIONS to 32 and create 32 streams, with the hope that each …

  5. Thread vs Stream what is the difference? - CUDA Programming and ...

    Dec 2, 2010 · A stream is an in-order channel of GPU operations. Every context has a default stream. However sometimes the application might need to leverage certain GPU features like: Concurrent …

  6. Cuda multi stream schedule - NVIDIA Developer Forums

    Oct 11, 2023 · When launch independent kernels in different stream, how these kernels dispatch to sm, multi-streams kernel can run in one sm? For Hopper Arch, assume block_per_sm = 2 for this kernel; …

  7. Questions of CUDA stream priority - NVIDIA Developer Forums

    Apr 18, 2023 · The test launches two threads with low-priority CUDA streams and one thread with a higher-priority CUDA stream. A timer is set for a duration of 5000 ms. During this duration, the …

  8. CUDA Streams, cuStreamWaitValue32, and Deadlocks in Inter-Kernel ...

    Sep 1, 2025 · Hi Everyone, I am trying to implement a mechanism for inter-kernel communication using a flag. My goal is for one kernel to signal another kernel via cuStreamWaitValue32, and have the …

  9. Using multi streams in cuda graph, the execution order is uncontrolled

    May 17, 2022 · I am using cuda graph stream capture API to implement a small demo with multi streams. Referenced by the CUDA Programming Guide here, I wrote the complete code. In my …

  10. How many streams? Maximum number of streams - CUDA …

    Nov 27, 2008 · Hi, is there a limitation on how many streams I can use in an application? If I increase the number of streams in the simpleStreams example from the Cuda SDK to 256, the test fails. I want to …