Functions to perform the Thomas Algorithm on tridiagonal systems in batches with CUDA. Code was used in the following paper:
- Efficient Interleaved Batch Matrix Solvers for CUDA (preprint http://arxiv.org/abs/1909.04539)
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Functions to perform the Thomas Algorithm on tridiagonal systems in batches with CUDA. Code was used in the following paper: