Friday, November 25, 2011

Uses too much local data?

ptxas error   : Entry function '_Z11Kernel_NamePfS_S_S_S_i' uses too much local data (0x4530 bytes, 0x4000 max)

According to links below:
http://kirschp.blogspot.com/2008/02/aircrack-speed-up-with-cuda.html
http://forums.nvidia.com/index.php?showtopic=196742

(avidday again):
There is a 16kb per thread local memory limit.
After checking my kernel:
BLOCK_SIZE 52 
(2 * (BLOCK_SIZE/2) + BLOCK_SIZE * (BLOCK_SIZE/2) * 3 )  * 4bytes = 17712


Checking under MATLAB:
hex2dec('4530') = 17712, goes over 16kb per thread limit.

No comments:

Post a Comment