QUDA
1.0.0
|
Public Member Functions | |
cabxpyAx_ (const Float2 &a, const Float2 &b, const Float2 &c) | |
__device__ __host__ void | operator() (FloatN &x, FloatN &y, FloatN &z, FloatN &w, FloatN &v) |
where the reduction is usually computed and any auxiliary operations More... | |
![]() | |
virtual __device__ __host__ void | init () |
pre-computation routine before the main loop More... | |
Static Public Member Functions | |
static int | streams () |
static int | flops () |
total number of input and output streams More... | |
Public Attributes | |
const Float2 | a |
const Float2 | b |
Functor performing the operation y[i] += a*b*x[i], x[i] *= a
Definition at line 315 of file blas_core.cuh.
|
inline |
Definition at line 318 of file blas_core.cuh.
|
inlinestatic |
total number of input and output streams
Definition at line 325 of file blas_core.cuh.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::BlasFunctor< Float2, FloatN >.
Definition at line 319 of file blas_core.cuh.
References quda::blas::_caxpy().
|
inlinestatic |
Definition at line 324 of file blas_core.cuh.
const Float2 quda::blas::cabxpyAx_< Float2, FloatN >::a |
Definition at line 316 of file blas_core.cuh.
const Float2 quda::blas::cabxpyAx_< Float2, FloatN >::b |
Definition at line 317 of file blas_core.cuh.