QUDA  0.9.0
Public Member Functions | Private Member Functions | Private Attributes | List of all members
quda::DslashCoarsePolicyTune Class Reference
Inheritance diagram for quda::DslashCoarsePolicyTune:
Inheritance graph
[legend]
Collaboration diagram for quda::DslashCoarsePolicyTune:
Collaboration graph
[legend]

Public Member Functions

 DslashCoarsePolicyTune (DslashCoarseLaunch &dslash)
 
virtual ~DslashCoarsePolicyTune ()
 
void apply (const cudaStream_t &stream)
 
int tuningIter () const
 
bool advanceAux (TuneParam &param) const
 
bool advanceTuneParam (TuneParam &param) const
 
void initTuneParam (TuneParam &param) const
 
void defaultTuneParam (TuneParam &param) const
 
TuneKey tuneKey () const
 
long long flops () const
 
long long bytes () const
 
- Public Member Functions inherited from quda::Tunable
 Tunable ()
 
virtual ~Tunable ()
 
virtual void preTune ()
 
virtual void postTune ()
 
virtual std::string paramString (const TuneParam &param) const
 
virtual std::string perfString (float time) const
 
void checkLaunchParam (TuneParam &param)
 

Private Member Functions

bool tuneGridDim () const
 
bool tuneAuxDim () const
 
unsigned int sharedBytesPerThread () const
 
unsigned int sharedBytesPerBlock (const TuneParam &param) const
 

Private Attributes

DslashCoarseLaunchdslash
 

Additional Inherited Members

- Protected Member Functions inherited from quda::Tunable
virtual unsigned int minThreads () const
 
virtual bool tuneSharedBytes () const
 
virtual bool advanceGridDim (TuneParam &param) const
 
virtual unsigned int maxBlockSize () const
 
virtual unsigned int maxGridSize () const
 
virtual unsigned int minGridSize () const
 
virtual int blockStep () const
 
virtual int blockMin () const
 
virtual bool advanceBlockDim (TuneParam &param) const
 
unsigned int maxBlocksPerSM () const
 For reason this can't be queried from the device properties, so here we set set this. Based on Table 14 of the CUDA Programming Guide 9.0 (Technical Specifications per Compute Capability) More...
 
virtual bool advanceSharedBytes (TuneParam &param) const
 
int writeAuxString (const char *format,...)
 
- Protected Attributes inherited from quda::Tunable
char aux [TuneKey::aux_n]
 

Detailed Description

Definition at line 959 of file dslash_coarse.cu.

Constructor & Destructor Documentation

◆ DslashCoarsePolicyTune()

quda::DslashCoarsePolicyTune::DslashCoarsePolicyTune ( DslashCoarseLaunch dslash)
inline

◆ ~DslashCoarsePolicyTune()

virtual quda::DslashCoarsePolicyTune::~DslashCoarsePolicyTune ( )
inlinevirtual

Definition at line 1027 of file dslash_coarse.cu.

References quda::setPolicyTuning().

Here is the call graph for this function:

Member Function Documentation

◆ advanceAux()

bool quda::DslashCoarsePolicyTune::advanceAux ( TuneParam param) const
inlinevirtual

Reimplemented from quda::Tunable.

Definition at line 1044 of file dslash_coarse.cu.

References param, and quda::policy.

Referenced by advanceTuneParam().

Here is the caller graph for this function:

◆ advanceTuneParam()

bool quda::DslashCoarsePolicyTune::advanceTuneParam ( TuneParam param) const
inlinevirtual

Reimplemented from quda::Tunable.

Definition at line 1055 of file dslash_coarse.cu.

References advanceAux(), and param.

Here is the call graph for this function:

◆ apply()

void quda::DslashCoarsePolicyTune::apply ( const cudaStream_t &  stream)
inlinevirtual

Implements quda::Tunable.

Definition at line 1029 of file dslash_coarse.cu.

References quda::TuneParam::aux, quda::config, dslash, errorQuda, getTuning(), quda::policy, QUDA_DEBUG_VERBOSE, and quda::tuneLaunch().

Here is the call graph for this function:

◆ bytes()

long long quda::DslashCoarsePolicyTune::bytes ( ) const
inlinevirtual

Reimplemented from quda::Tunable.

Definition at line 1080 of file dslash_coarse.cu.

◆ defaultTuneParam()

void quda::DslashCoarsePolicyTune::defaultTuneParam ( TuneParam param) const
inlinevirtual

sets default values for when tuning is disabled

Reimplemented from quda::Tunable.

Definition at line 1062 of file dslash_coarse.cu.

References quda::config, quda::Tunable::defaultTuneParam(), and param.

Here is the call graph for this function:

◆ flops()

long long quda::DslashCoarsePolicyTune::flops ( ) const
inlinevirtual

Implements quda::Tunable.

Definition at line 1071 of file dslash_coarse.cu.

◆ initTuneParam()

void quda::DslashCoarsePolicyTune::initTuneParam ( TuneParam param) const
inlinevirtual

Reimplemented from quda::Tunable.

Definition at line 1057 of file dslash_coarse.cu.

References quda::config, quda::Tunable::initTuneParam(), and param.

Here is the call graph for this function:

◆ sharedBytesPerBlock()

unsigned int quda::DslashCoarsePolicyTune::sharedBytesPerBlock ( const TuneParam param) const
inlineprivatevirtual

Implements quda::Tunable.

Definition at line 966 of file dslash_coarse.cu.

◆ sharedBytesPerThread()

unsigned int quda::DslashCoarsePolicyTune::sharedBytesPerThread ( ) const
inlineprivatevirtual

Implements quda::Tunable.

Definition at line 965 of file dslash_coarse.cu.

◆ tuneAuxDim()

bool quda::DslashCoarsePolicyTune::tuneAuxDim ( ) const
inlineprivatevirtual

Reimplemented from quda::Tunable.

Definition at line 964 of file dslash_coarse.cu.

◆ tuneGridDim()

bool quda::DslashCoarsePolicyTune::tuneGridDim ( ) const
inlineprivatevirtual

Reimplemented from quda::Tunable.

Definition at line 963 of file dslash_coarse.cu.

◆ tuneKey()

TuneKey quda::DslashCoarsePolicyTune::tuneKey ( ) const
inlinevirtual

Implements quda::Tunable.

Definition at line 1067 of file dslash_coarse.cu.

References quda::Tunable::aux.

Referenced by DslashCoarsePolicyTune().

Here is the caller graph for this function:

◆ tuningIter()

int quda::DslashCoarsePolicyTune::tuningIter ( ) const
inlinevirtual

Reimplemented from quda::Tunable.

Definition at line 1042 of file dslash_coarse.cu.

Member Data Documentation

◆ dslash

DslashCoarseLaunch& quda::DslashCoarsePolicyTune::dslash
private

Definition at line 961 of file dslash_coarse.cu.

Referenced by apply(), and DslashCoarsePolicyTune().


The documentation for this class was generated from the following file: