|
QUDA
0.9.0
|


Public Types | |
| typedef scalar< ReduceType >::type | real |
Public Member Functions | |
| xpyHeavyQuarkResidualNorm_ (const Float2 &a, const Float2 &b) | |
| __device__ __host__ void | pre () |
| pre-computation routine called before the "M-loop" More... | |
| __device__ __host__ void | operator() (ReduceType &sum, FloatN &x, FloatN &y, FloatN &z, FloatN &w, FloatN &v) |
| where the reduction is usually computed and any auxiliary operations More... | |
| __device__ __host__ void | post (ReduceType &sum) |
| sum the solution and residual norms, and compute the heavy-quark norm More... | |
Static Public Member Functions | |
| static int | streams () |
| static int | flops () |
| total number of input and output streams More... | |
Public Attributes | |
| Float2 | a |
| Float2 | b |
| ReduceType | aux |
Variant of the HeavyQuarkResidualNorm kernel: this takes three arguments, the first two are summed together to form the solution, with the third being the residual vector. This removes the need an additional xpy call in the solvers, impriving performance.
Definition at line 719 of file reduce_quda.cu.
| typedef scalar<ReduceType>::type quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::real |
Definition at line 720 of file reduce_quda.cu.
|
inline |
Definition at line 724 of file reduce_quda.cu.
|
inlinestatic |
total number of input and output streams
Definition at line 739 of file reduce_quda.cu.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::ReduceFunctor< ReduceType, Float2, FloatN >.
Definition at line 728 of file reduce_quda.cu.
References quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::aux, x, y, and z.
|
inlinevirtual |
sum the solution and residual norms, and compute the heavy-quark norm
Reimplemented from quda::blas::ReduceFunctor< ReduceType, Float2, FloatN >.
Definition at line 733 of file reduce_quda.cu.
References quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::aux, and sum().

|
inlinevirtual |
pre-computation routine called before the "M-loop"
Reimplemented from quda::blas::ReduceFunctor< ReduceType, Float2, FloatN >.
Definition at line 726 of file reduce_quda.cu.
References quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::aux.
|
inlinestatic |
Definition at line 738 of file reduce_quda.cu.
| Float2 quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::a |
Definition at line 721 of file reduce_quda.cu.
| ReduceType quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::aux |
Definition at line 723 of file reduce_quda.cu.
Referenced by quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::operator()(), quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::post(), and quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::pre().
| Float2 quda::blas::xpyHeavyQuarkResidualNorm_< ReduceType, Float2, FloatN >::b |
Definition at line 722 of file reduce_quda.cu.
1.8.14