|
QUDA
0.9.0
|


Public Member Functions | |
| multicaxpy_ (const coeff_array< Complex > &a, const coeff_array< Complex > &b, const coeff_array< Complex > &c, int NYW) | |
| __device__ __host__ void | operator() (FloatN &x, FloatN &y, FloatN &z, FloatN &w, const int i, const int j) |
| where the reduction is usually computed and any auxiliary operations More... | |
| int | streams () |
| int | flops () |
| total number of input and output streams More... | |
Public Member Functions inherited from quda::blas::MultiBlasFunctor< NXZ, Float2, FloatN > | |
| virtual __device__ __host__ void | init () |
| pre-computation routine before the main loop More... | |
Public Attributes | |
| const int | NYW |
Definition at line 94 of file multi_blas_quda.cu.
|
inline |
Definition at line 97 of file multi_blas_quda.cu.
|
inline |
total number of input and output streams
Definition at line 113 of file multi_blas_quda.cu.
References quda::blas::multicaxpy_< NXZ, Float2, FloatN >::NYW.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::MultiBlasFunctor< NXZ, Float2, FloatN >.
Definition at line 101 of file multi_blas_quda.cu.
References quda::blas::_caxpy(), a, Amatrix_d, Amatrix_h, fused_exterior_ndeg_tm_dslash_cuda_gen::i, MAX_MULTI_BLAS_N, quda::blas::multicaxpy_< NXZ, Float2, FloatN >::NYW, x, and y.

|
inline |
Definition at line 112 of file multi_blas_quda.cu.
References quda::blas::multicaxpy_< NXZ, Float2, FloatN >::NYW.
| const int quda::blas::multicaxpy_< NXZ, Float2, FloatN >::NYW |
Definition at line 95 of file multi_blas_quda.cu.
Referenced by quda::blas::multicaxpy_< NXZ, Float2, FloatN >::flops(), quda::blas::multicaxpy_< NXZ, Float2, FloatN >::operator()(), and quda::blas::multicaxpy_< NXZ, Float2, FloatN >::streams().
1.8.14