|
QUDA
1.0.0
|


Public Types | |
| typedef scalar< Float2 >::type | real |
Public Member Functions | |
| CdotCopy (const coeff_array< Complex > &a, const coeff_array< Complex > &b, const coeff_array< Complex > &c, int NYW) | |
| __device__ __host__ void | operator() (ReduceType &sum, FloatN &x, FloatN &y, FloatN &z, FloatN &w, const int i, const int j) |
| where the reduction is usually computed and any auxiliary operations More... | |
Public Member Functions inherited from quda::blas::MultiReduceFunctor< NXZ, ReduceType, Float2, FloatN > | |
| virtual __device__ __host__ void | pre () |
| pre-computation routine called before the "M-loop" More... | |
| virtual __device__ __host__ void | post (ReduceType &sum) |
| post-computation routine called after the "M-loop" More... | |
Static Public Member Functions | |
| static int | streams () |
| static int | flops () |
| total number of input and output streams More... | |
Public Attributes | |
| const int | NYW |
Definition at line 249 of file multi_reduce_core.cuh.
| typedef scalar<Float2>::type quda::blas::CdotCopy< NXZ, ReduceType, Float2, FloatN >::real |
Definition at line 250 of file multi_reduce_core.cuh.
|
inline |
Definition at line 252 of file multi_reduce_core.cuh.
|
inlinestatic |
total number of input and output streams
Definition at line 264 of file multi_reduce_core.cuh.
|
inlinevirtual |
where the reduction is usually computed and any auxiliary operations
Implements quda::blas::MultiReduceFunctor< NXZ, ReduceType, Float2, FloatN >.
Definition at line 257 of file multi_reduce_core.cuh.
References quda::sum().

|
inlinestatic |
Definition at line 263 of file multi_reduce_core.cuh.
| const int quda::blas::CdotCopy< NXZ, ReduceType, Float2, FloatN >::NYW |
Definition at line 251 of file multi_reduce_core.cuh.
1.8.13