-
SHARED_MEMORY_TILED_VERSION_2.0
Major changing is using the sign bit to do less access to the image. This version moves some code into point2d for better design, creates a point2d 8int specific version, fix the 1d algorihtm and add restrict qualifier to go faster
-
SHARED_MEMORY_TILED_VERSION_1.0
64x32 tiles using shared memory is the quickest, the full final merge is not yet implemented