QUDA
1.0.0
|
Public Member Functions | |
axpyBzpcx_ (const Float2 &a, const Float2 &b, const Float2 &c) | |
__device__ __host__ void | operator() (FloatN &x, FloatN &y, FloatN &z, FloatN &w, FloatN &v) |
where the reduction is usually computed and any auxiliary operations More... | |
![]() | |
virtual __device__ __host__ void | init () |
pre-computation routine before the main loop More... | |
Static Public Member Functions | |
static int | streams () |
static int | flops () |
total number of input and output streams More... | |
Public Attributes | |
const Float2 | a |
const Float2 | b |
const Float2 | c |
Functor performing the operations: y[i] = a*x[i] + y[i]; x[i] = b*z[i] + c*x[i]
Definition at line 230 of file blas_core.cuh.
|
inline |
Definition at line 234 of file blas_core.cuh.
|
inlinestatic |
total number of input and output streams
Definition at line 241 of file blas_core.cuh.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::BlasFunctor< Float2, FloatN >.
Definition at line 235 of file blas_core.cuh.
|
inlinestatic |
Definition at line 240 of file blas_core.cuh.
const Float2 quda::blas::axpyBzpcx_< Float2, FloatN >::a |
Definition at line 231 of file blas_core.cuh.
const Float2 quda::blas::axpyBzpcx_< Float2, FloatN >::b |
Definition at line 232 of file blas_core.cuh.
const Float2 quda::blas::axpyBzpcx_< Float2, FloatN >::c |
Definition at line 233 of file blas_core.cuh.