cuda_tridiangonal_solver_PCR Implement Parallel Cyclic Reduction in CUDA-C for a Tridiagonal matrix equation