|
QUDA
1.0.0
|


Public Member Functions | |
| multicaxpy_ (const coeff_array< Complex > &a, const coeff_array< Complex > &b, const coeff_array< Complex > &c, int NYW) | |
| __device__ __host__ void | operator() (FloatN &x, FloatN &y, FloatN &z, FloatN &w, const int i, const int j) |
| where the reduction is usually computed and any auxiliary operations More... | |
| int | streams () |
| int | flops () |
| total number of input and output streams More... | |
Public Member Functions inherited from quda::blas::MultiBlasFunctor< NXZ, Float2, FloatN > | |
| virtual __device__ __host__ void | init () |
| pre-computation routine before the main loop More... | |
Public Attributes | |
| const int | NYW |
Definition at line 160 of file multi_blas_core.cuh.
|
inline |
Definition at line 163 of file multi_blas_core.cuh.
|
inline |
total number of input and output streams
Definition at line 180 of file multi_blas_core.cuh.
References quda::blas::MultiBlasArg< NXZ, SpinorX, SpinorY, SpinorZ, SpinorW, Functor >::NYW.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::MultiBlasFunctor< NXZ, Float2, FloatN >.
Definition at line 168 of file multi_blas_core.cuh.
References quda::blas::_caxpy(), quda::blas::Amatrix_d, quda::blas::Amatrix_h, and MAX_MULTI_BLAS_N.

|
inline |
Definition at line 179 of file multi_blas_core.cuh.
References quda::blas::MultiBlasArg< NXZ, SpinorX, SpinorY, SpinorZ, SpinorW, Functor >::NYW.
| const int quda::blas::multicaxpy_< NXZ, Float2, FloatN >::NYW |
Definition at line 161 of file multi_blas_core.cuh.
1.8.13