QUDA  1.0.0
All Classes Namespaces Files Functions Variables Typedefs Enumerations Enumerator Friends Macros
Public Member Functions | Protected Member Functions | Protected Attributes | Static Protected Attributes | List of all members
quda::Dslash5< Float, nColor, Arg > Class Template Reference
Inheritance diagram for quda::Dslash5< Float, nColor, Arg >:
Inheritance graph
[legend]
Collaboration diagram for quda::Dslash5< Float, nColor, Arg >:
Collaboration graph
[legend]

Public Member Functions

 Dslash5 (Arg &arg, const ColorSpinorField &meta)
 
virtual ~Dslash5 ()
 
template<typename T >
void launch (T *f, const TuneParam &tp, Arg &arg, const cudaStream_t &stream)
 
void apply (const cudaStream_t &stream)
 
void initTuneParam (TuneParam &param) const
 
void defaultTuneParam (TuneParam &param) const
 
TuneKey tuneKey () const
 
- Public Member Functions inherited from quda::TunableVectorYZ
 TunableVectorYZ (unsigned int vector_length_y, unsigned int vector_length_z)
 
bool advanceBlockDim (TuneParam &param) const
 
void resizeVector (int y, int z) const
 
void resizeStep (int y, int z) const
 
- Public Member Functions inherited from quda::TunableVectorY
 TunableVectorY (unsigned int vector_length_y)
 
void resizeVector (int y) const
 
void resizeStep (int y) const
 
- Public Member Functions inherited from quda::Tunable
 Tunable ()
 
virtual ~Tunable ()
 
virtual void preTune ()
 
virtual void postTune ()
 
virtual int tuningIter () const
 
virtual std::string paramString (const TuneParam &param) const
 
virtual std::string perfString (float time) const
 
virtual bool advanceTuneParam (TuneParam &param) const
 
void checkLaunchParam (TuneParam &param)
 
CUresult jitifyError () const
 
CUresult & jitifyError ()
 

Protected Member Functions

long long flops () const
 
long long bytes () const
 
bool tuneGridDim () const
 
unsigned int minThreads () const
 
int blockStep () const
 
int blockMin () const
 
unsigned int sharedBytesPerThread () const
 
unsigned int maxSharedBytesPerBlock () const
 The maximum shared memory that a CUDA thread block can use in the autotuner. This isn't necessarily the same as maxDynamicSharedMemoryPerBlock since that may need explicit opt in to enable (by calling setMaxDynamicSharedBytes for the kernel in question). If the CUDA kernel in question does this opt in then this function can be overloaded to return maxDynamicSharedBytesPerBlock. More...
 
- Protected Member Functions inherited from quda::TunableVectorY
virtual unsigned int sharedBytesPerBlock (const TuneParam &param) const
 
- Protected Member Functions inherited from quda::Tunable
virtual bool tuneAuxDim () const
 
virtual bool tuneSharedBytes () const
 
virtual bool advanceGridDim (TuneParam &param) const
 
virtual unsigned int maxBlockSize (const TuneParam &param) const
 
virtual unsigned int maxGridSize () const
 
virtual unsigned int minGridSize () const
 
virtual int gridStep () const
 gridStep sets the step size when iterating the grid size in advanceGridDim. More...
 
virtual void resetBlockDim (TuneParam &param) const
 
unsigned int maxBlocksPerSM () const
 For some reason this can't be queried from the device properties, so here we set set this. Based on Table 14 of the CUDA Programming Guide 10.0 (Technical Specifications per Compute Capability) More...
 
template<typename F >
void setMaxDynamicSharedBytesPerBlock (F *func) const
 Enable the maximum dynamic shared bytes for the kernel "func" (values given by maxDynamicSharedBytesPerBlock()). More...
 
unsigned int maxDynamicSharedBytesPerBlock () const
 This can't be correctly queried in CUDA for all architectures so here we set set this. Based on Table 14 of the CUDA Programming Guide 10.0 (Technical Specifications per Compute Capability). More...
 
virtual bool advanceSharedBytes (TuneParam &param) const
 
virtual bool advanceAux (TuneParam &param) const
 
int writeAuxString (const char *format,...)
 

Protected Attributes

Argarg
 
const ColorSpinorFieldmeta
 
- Protected Attributes inherited from quda::TunableVectorY
unsigned int vector_length_y
 
unsigned int step_y
 
bool tune_block_x
 
- Protected Attributes inherited from quda::Tunable
char aux [TuneKey::aux_n]
 
CUresult jitify_error
 

Static Protected Attributes

static constexpr bool shared = true
 
static constexpr bool var_inverse = true
 

Detailed Description

template<typename Float, int nColor, typename Arg>
class quda::Dslash5< Float, nColor, Arg >

Definition at line 20 of file dslash5_domain_wall.cu.

Constructor & Destructor Documentation

◆ Dslash5()

template<typename Float, int nColor, typename Arg>
quda::Dslash5< Float, nColor, Arg >::Dslash5 ( Arg arg,
const ColorSpinorField meta 
)
inline

◆ ~Dslash5()

template<typename Float, int nColor, typename Arg>
virtual quda::Dslash5< Float, nColor, Arg >::~Dslash5 ( )
inlinevirtual

Definition at line 118 of file dslash5_domain_wall.cu.

Member Function Documentation

◆ apply()

template<typename Float, int nColor, typename Arg>
void quda::Dslash5< Float, nColor, Arg >::apply ( const cudaStream_t &  stream)
inlinevirtual

◆ blockMin()

template<typename Float, int nColor, typename Arg>
int quda::Dslash5< Float, nColor, Arg >::blockMin ( ) const
inlineprotectedvirtual

Reimplemented from quda::Tunable.

Definition at line 80 of file dslash5_domain_wall.cu.

◆ blockStep()

template<typename Float, int nColor, typename Arg>
int quda::Dslash5< Float, nColor, Arg >::blockStep ( ) const
inlineprotectedvirtual

Reimplemented from quda::Tunable.

Definition at line 79 of file dslash5_domain_wall.cu.

◆ bytes()

template<typename Float, int nColor, typename Arg>
long long quda::Dslash5< Float, nColor, Arg >::bytes ( ) const
inlineprotectedvirtual

◆ defaultTuneParam()

template<typename Float, int nColor, typename Arg>
void quda::Dslash5< Float, nColor, Arg >::defaultTuneParam ( TuneParam param) const
inlinevirtual

◆ flops()

template<typename Float, int nColor, typename Arg>
long long quda::Dslash5< Float, nColor, Arg >::flops ( ) const
inlineprotectedvirtual

◆ initTuneParam()

template<typename Float, int nColor, typename Arg>
void quda::Dslash5< Float, nColor, Arg >::initTuneParam ( TuneParam param) const
inlinevirtual

◆ launch()

template<typename Float, int nColor, typename Arg>
template<typename T >
void quda::Dslash5< Float, nColor, Arg >::launch ( T *  f,
const TuneParam tp,
Arg arg,
const cudaStream_t &  stream 
)
inline

◆ maxSharedBytesPerBlock()

template<typename Float, int nColor, typename Arg>
unsigned int quda::Dslash5< Float, nColor, Arg >::maxSharedBytesPerBlock ( ) const
inlineprotectedvirtual

The maximum shared memory that a CUDA thread block can use in the autotuner. This isn't necessarily the same as maxDynamicSharedMemoryPerBlock since that may need explicit opt in to enable (by calling setMaxDynamicSharedBytes for the kernel in question). If the CUDA kernel in question does this opt in then this function can be overloaded to return maxDynamicSharedBytesPerBlock.

Returns
The maximum shared bytes limit per block the autotung will utilize.

Reimplemented from quda::Tunable.

Definition at line 93 of file dslash5_domain_wall.cu.

References quda::M5_INV_DWF, quda::M5_INV_MOBIUS, quda::M5_INV_ZMOBIUS, quda::Tunable::maxDynamicSharedBytesPerBlock(), and quda::Tunable::maxSharedBytesPerBlock().

Here is the call graph for this function:

◆ minThreads()

template<typename Float, int nColor, typename Arg>
unsigned int quda::Dslash5< Float, nColor, Arg >::minThreads ( ) const
inlineprotectedvirtual

Reimplemented from quda::Tunable.

Definition at line 78 of file dslash5_domain_wall.cu.

◆ sharedBytesPerThread()

template<typename Float, int nColor, typename Arg>
unsigned int quda::Dslash5< Float, nColor, Arg >::sharedBytesPerThread ( ) const
inlineprotectedvirtual

Reimplemented from quda::TunableVectorY.

Definition at line 81 of file dslash5_domain_wall.cu.

References quda::M5_INV_DWF, quda::M5_INV_MOBIUS, quda::M5_INV_ZMOBIUS, nColor, and quda::ColorSpinorField::Nspin().

Referenced by quda::Dslash5< Float, nColor, Arg >::defaultTuneParam(), and quda::Dslash5< Float, nColor, Arg >::initTuneParam().

Here is the call graph for this function:
Here is the caller graph for this function:

◆ tuneGridDim()

template<typename Float, int nColor, typename Arg>
bool quda::Dslash5< Float, nColor, Arg >::tuneGridDim ( ) const
inlineprotectedvirtual

Reimplemented from quda::Tunable.

Definition at line 77 of file dslash5_domain_wall.cu.

◆ tuneKey()

template<typename Float, int nColor, typename Arg>
TuneKey quda::Dslash5< Float, nColor, Arg >::tuneKey ( ) const
inlinevirtual

Implements quda::Tunable.

Definition at line 212 of file dslash5_domain_wall.cu.

References quda::Tunable::aux, and quda::LatticeField::VolString().

Here is the call graph for this function:

Member Data Documentation

◆ arg

template<typename Float, int nColor, typename Arg>
Arg& quda::Dslash5< Float, nColor, Arg >::arg
protected

Definition at line 24 of file dslash5_domain_wall.cu.

Referenced by quda::ApplyDslash5().

◆ meta

template<typename Float, int nColor, typename Arg>
const ColorSpinorField& quda::Dslash5< Float, nColor, Arg >::meta
protected

Definition at line 25 of file dslash5_domain_wall.cu.

◆ shared

template<typename Float, int nColor, typename Arg>
constexpr bool quda::Dslash5< Float, nColor, Arg >::shared = true
staticprotected

Definition at line 26 of file dslash5_domain_wall.cu.

◆ var_inverse

template<typename Float, int nColor, typename Arg>
constexpr bool quda::Dslash5< Float, nColor, Arg >::var_inverse = true
staticprotected

Whether to use variable or fixed coefficient algorithm. Must be true if using ZMOBIUS

Definition at line 29 of file dslash5_domain_wall.cu.


The documentation for this class was generated from the following file: