QUDA
1.0.0
|
Public Types | |
typedef scalar< Float2 >::type | real |
Public Member Functions | |
multi_caxpyBxpz_ (const coeff_array< Complex > &a, const coeff_array< Complex > &b, const coeff_array< Complex > &c, int NYW) | |
__device__ __host__ void | operator() (FloatN &x, FloatN &y, FloatN &z, FloatN &w, const int i, const int j) |
where the reduction is usually computed and any auxiliary operations More... | |
int | streams () |
int | flops () |
total number of input and output streams More... | |
![]() | |
virtual __device__ __host__ void | init () |
pre-computation routine before the main loop More... | |
Public Attributes | |
const int | NYW |
Functor performing the operations y[i] = a*x[i] + y[i] and z[i] = b*x[i] + z[i]
Definition at line 247 of file multi_blas_core.cuh.
typedef scalar<Float2>::type quda::blas::multi_caxpyBxpz_< NXZ, Float2, FloatN >::real |
Definition at line 248 of file multi_blas_core.cuh.
|
inline |
Definition at line 251 of file multi_blas_core.cuh.
|
inline |
total number of input and output streams
Definition at line 273 of file multi_blas_core.cuh.
References quda::blas::MultiBlasArg< NXZ, SpinorX, SpinorY, SpinorZ, SpinorW, Functor >::NYW.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::MultiBlasFunctor< NXZ, Float2, FloatN >.
Definition at line 258 of file multi_blas_core.cuh.
References quda::blas::_caxpy(), quda::blas::Amatrix_d, quda::blas::Amatrix_h, quda::blas::Bmatrix_d, quda::blas::Bmatrix_h, and MAX_MULTI_BLAS_N.
|
inline |
Definition at line 272 of file multi_blas_core.cuh.
const int quda::blas::multi_caxpyBxpz_< NXZ, Float2, FloatN >::NYW |
Definition at line 249 of file multi_blas_core.cuh.