Using cuSparse library for Tridiagonal solver on a CUDA compatible GPU with compute capability 1.1, has decreased the performance drastically, up to 50 times slower when compared with a traditional serial solver on a Core 2 Duo CPU.
Is it because of the low compute compatibility or is my implementation wrong?