|
| axpyCGNorm2 (const Float2 &a, const Float2 &b) |
|
__device__ __host__ void | operator() (ReduceType &sum, FloatN &x, FloatN &y, FloatN &z, FloatN &w, FloatN &v) |
| where the reduction is usually computed and any auxiliary operations More...
|
|
virtual __device__ __host__ void | pre () |
| pre-computation routine called before the "M-loop" More...
|
|
virtual __device__ __host__ void | post (ReduceType &sum) |
| post-computation routine called after the "M-loop" More...
|
|
template<typename ReduceType, typename Float2, typename FloatN>
struct quda::blas::axpyCGNorm2< ReduceType, Float2, FloatN >
Specialized kernel for the modified CG norm computation for computing beta. Computes y = y + a*x and returns norm(y) and dot(y, delta(y)) where delta(y) is the difference between the input and out y vector.
Definition at line 444 of file reduce_core.cuh.