QUDA
1.0.0
|
Public Member Functions | |
multicaxpyz_ (const coeff_array< Complex > &a, const coeff_array< Complex > &b, const coeff_array< Complex > &c, int NYW) | |
__device__ __host__ void | operator() (FloatN &x, FloatN &y, FloatN &z, FloatN &w, const int i, const int j) |
where the reduction is usually computed and any auxiliary operations More... | |
int | streams () |
int | flops () |
total number of input and output streams More... | |
![]() | |
virtual __device__ __host__ void | init () |
pre-computation routine before the main loop More... | |
Public Attributes | |
const int | NYW |
Functor to perform the operation z = a * x + y (complex-valued)
Definition at line 187 of file multi_blas_core.cuh.
|
inline |
Definition at line 190 of file multi_blas_core.cuh.
|
inline |
total number of input and output streams
Definition at line 209 of file multi_blas_core.cuh.
References quda::blas::MultiBlasArg< NXZ, SpinorX, SpinorY, SpinorZ, SpinorW, Functor >::NYW.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::MultiBlasFunctor< NXZ, Float2, FloatN >.
Definition at line 195 of file multi_blas_core.cuh.
References quda::blas::_caxpy(), quda::blas::Amatrix_d, quda::blas::Amatrix_h, and MAX_MULTI_BLAS_N.
|
inline |
Definition at line 208 of file multi_blas_core.cuh.
References quda::blas::MultiBlasArg< NXZ, SpinorX, SpinorY, SpinorZ, SpinorW, Functor >::NYW.
const int quda::blas::multicaxpyz_< NXZ, Float2, FloatN >::NYW |
Definition at line 188 of file multi_blas_core.cuh.