QUDA
1.0.0
|
Public Member Functions | |
multicaxpy_ (const coeff_array< Complex > &a, const coeff_array< Complex > &b, const coeff_array< Complex > &c, int NYW) | |
__device__ __host__ void | operator() (FloatN &x, FloatN &y, FloatN &z, FloatN &w, const int i, const int j) |
where the reduction is usually computed and any auxiliary operations More... | |
int | streams () |
int | flops () |
total number of input and output streams More... | |
![]() | |
virtual __device__ __host__ void | init () |
pre-computation routine before the main loop More... | |
Public Attributes | |
const int | NYW |
Definition at line 160 of file multi_blas_core.cuh.
|
inline |
Definition at line 163 of file multi_blas_core.cuh.
|
inline |
total number of input and output streams
Definition at line 180 of file multi_blas_core.cuh.
References quda::blas::MultiBlasArg< NXZ, SpinorX, SpinorY, SpinorZ, SpinorW, Functor >::NYW.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::MultiBlasFunctor< NXZ, Float2, FloatN >.
Definition at line 168 of file multi_blas_core.cuh.
References quda::blas::_caxpy(), quda::blas::Amatrix_d, quda::blas::Amatrix_h, and MAX_MULTI_BLAS_N.
|
inline |
Definition at line 179 of file multi_blas_core.cuh.
References quda::blas::MultiBlasArg< NXZ, SpinorX, SpinorY, SpinorZ, SpinorW, Functor >::NYW.
const int quda::blas::multicaxpy_< NXZ, Float2, FloatN >::NYW |
Definition at line 161 of file multi_blas_core.cuh.