No, it's "cudaStreamSetAttribute".
Be careful with modifications if you don't know what you are doing exactly. The algorithm is not so straight as the classic one, for example, if you damage loop handling it will work for small ranges but fail for high ranges.
Discovered a problem
I didn't modify your source code, I ran it directly using the program you compiled
However, after running for a period of time, a large number of errors occurred.
DPs buffer overflow,some points lost, increase DP value!
DPs buffer overflow,some points lost, increase DP value!
DPs buffer overflow,some points lost, increase DP value!
At the beginning, it was normal, but the error message above kept appearing. May I ask why?