QUDA
1.0.0
|
Public Types | |
typedef scalar< ReduceType >::type | real |
Public Member Functions | |
xpyHeavyQuarkResidualNorm_ (const Float2 &a, const Float2 &b) | |
__device__ __host__ void | pre () |
pre-computation routine called before the "M-loop" More... | |
__device__ __host__ void | operator() (ReduceType &sum, FloatN &x, FloatN &y, FloatN &z, FloatN &w, FloatN &v) |
where the reduction is usually computed and any auxiliary operations More... | |
__device__ __host__ void | post (ReduceType &sum) |
sum the solution and residual norms, and compute the heavy-quark norm More... | |
Static Public Member Functions | |
static int | streams () |
static int | flops () |
total number of input and output streams More... | |
Public Attributes | |
Float2 | a |
Float2 | b |
ReduceType | aux |
Variant of the HeavyQuarkResidualNorm kernel: this takes three arguments, the first two are summed together to form the solution, with the third being the residual vector. This removes the need an additional xpy call in the solvers, impriving performance.
Definition at line 510 of file reduce_core.cuh.
typedef scalar<ReduceType>::type quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::real |
Definition at line 511 of file reduce_core.cuh.
|
inline |
Definition at line 515 of file reduce_core.cuh.
|
inlinestatic |
total number of input and output streams
Definition at line 538 of file reduce_core.cuh.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::ReduceFunctor< ReduceType, Float2, FloatN >.
Definition at line 523 of file reduce_core.cuh.
|
inlinevirtual |
sum the solution and residual norms, and compute the heavy-quark norm
Reimplemented from quda::blas::ReduceFunctor< ReduceType, Float2, FloatN >.
Definition at line 530 of file reduce_core.cuh.
|
inlinevirtual |
pre-computation routine called before the "M-loop"
Reimplemented from quda::blas::ReduceFunctor< ReduceType, Float2, FloatN >.
Definition at line 517 of file reduce_core.cuh.
|
inlinestatic |
Definition at line 537 of file reduce_core.cuh.
Float2 quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::a |
Definition at line 512 of file reduce_core.cuh.
ReduceType quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::aux |
Definition at line 514 of file reduce_core.cuh.
Float2 quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::b |
Definition at line 513 of file reduce_core.cuh.