QUDA  0.9.0
Public Member Functions | Public Attributes | List of all members
quda::blas::multicaxpy_< NXZ, Float2, FloatN > Struct Template Reference
Inheritance diagram for quda::blas::multicaxpy_< NXZ, Float2, FloatN >:
Inheritance graph
[legend]
Collaboration diagram for quda::blas::multicaxpy_< NXZ, Float2, FloatN >:
Collaboration graph
[legend]

Public Member Functions

 multicaxpy_ (const coeff_array< Complex > &a, const coeff_array< Complex > &b, const coeff_array< Complex > &c, int NYW)
 
__device__ __host__ void operator() (FloatN &x, FloatN &y, FloatN &z, FloatN &w, const int i, const int j)
 where the reduction is usually computed and any auxiliary operations More...
 
int streams ()
 
int flops ()
 total number of input and output streams More...
 
- Public Member Functions inherited from quda::blas::MultiBlasFunctor< NXZ, Float2, FloatN >
virtual __device__ __host__ void init ()
 pre-computation routine before the main loop More...
 

Public Attributes

const int NYW
 

Detailed Description

template<int NXZ, typename Float2, typename FloatN>
struct quda::blas::multicaxpy_< NXZ, Float2, FloatN >

Definition at line 94 of file multi_blas_quda.cu.

Constructor & Destructor Documentation

◆ multicaxpy_()

template<int NXZ, typename Float2 , typename FloatN >
quda::blas::multicaxpy_< NXZ, Float2, FloatN >::multicaxpy_ ( const coeff_array< Complex > &  a,
const coeff_array< Complex > &  b,
const coeff_array< Complex > &  c,
int  NYW 
)
inline

Definition at line 97 of file multi_blas_quda.cu.

Member Function Documentation

◆ flops()

template<int NXZ, typename Float2 , typename FloatN >
int quda::blas::multicaxpy_< NXZ, Float2, FloatN >::flops ( )
inline

total number of input and output streams

Definition at line 113 of file multi_blas_quda.cu.

References quda::blas::multicaxpy_< NXZ, Float2, FloatN >::NYW.

◆ operator()()

template<int NXZ, typename Float2 , typename FloatN >
__device__ __host__ void quda::blas::multicaxpy_< NXZ, Float2, FloatN >::operator() ( FloatN &  x,
FloatN &  y,
FloatN &  z,
FloatN &  w,
const int  i,
const int  j 
)
inlinevirtual

where the reduction is usually computed and any auxiliary operations

Implements quda::blas::MultiBlasFunctor< NXZ, Float2, FloatN >.

Definition at line 101 of file multi_blas_quda.cu.

References quda::blas::_caxpy(), a, Amatrix_d, Amatrix_h, fused_exterior_ndeg_tm_dslash_cuda_gen::i, MAX_MULTI_BLAS_N, quda::blas::multicaxpy_< NXZ, Float2, FloatN >::NYW, x, and y.

Here is the call graph for this function:

◆ streams()

template<int NXZ, typename Float2 , typename FloatN >
int quda::blas::multicaxpy_< NXZ, Float2, FloatN >::streams ( )
inline

Member Data Documentation

◆ NYW

template<int NXZ, typename Float2 , typename FloatN >
const int quda::blas::multicaxpy_< NXZ, Float2, FloatN >::NYW

The documentation for this struct was generated from the following file: