ptxas error : Entry function '_Z11Kernel_NamePfS_S_S_S_i' uses too much local data (0x4530 bytes, 0x4000 max)
According to links below:
http://kirschp.blogspot.com/2008/02/aircrack-speed-up-with-cuda.html
http://forums.nvidia.com/index.php?showtopic=196742
(avidday again):
There is a 16kb per thread local memory limit.
After checking my kernel:
BLOCK_SIZE 52
(2 * (BLOCK_SIZE/2) + BLOCK_SIZE * (BLOCK_SIZE/2) * 3 ) * 4bytes = 17712
Checking under MATLAB:
hex2dec('4530') = 17712, goes over 16kb per thread limit.
No comments:
Post a Comment