QUDA
1.0.0
|
Public Member Functions | |
caxpydotzy (const Float2 &a, const Float2 &b) | |
__device__ __host__ void | operator() (ReduceType &sum, FloatN &x, FloatN &y, FloatN &z, FloatN &w, FloatN &v) |
where the reduction is usually computed and any auxiliary operations More... | |
![]() | |
virtual __device__ __host__ void | pre () |
pre-computation routine called before the "M-loop" More... | |
virtual __device__ __host__ void | post (ReduceType &sum) |
post-computation routine called after the "M-loop" More... | |
Static Public Member Functions | |
static int | streams () |
static int | flops () |
total number of input and output streams More... | |
Public Attributes | |
Float2 | a |
double caxpyDotzyCuda(float a, float *x, float *y, float *z, n){} First performs the operation y[i] = a*x[i] + y[i] Second returns the dot product (z,y)
Definition at line 368 of file reduce_core.cuh.
|
inline |
Definition at line 370 of file reduce_core.cuh.
|
inlinestatic |
total number of input and output streams
Definition at line 377 of file reduce_core.cuh.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::ReduceFunctor< ReduceType, Float2, FloatN >.
Definition at line 371 of file reduce_core.cuh.
References quda::blas::Caxpy_(), and quda::sum().
|
inlinestatic |
Definition at line 376 of file reduce_core.cuh.
Float2 quda::blas::caxpydotzy< ReduceType, Float2, FloatN >::a |
Definition at line 369 of file reduce_core.cuh.