|
QUDA
0.9.0
|
#include <tune_quda.h>


Public Member Functions | |
| TunableVectorYZ (unsigned int vector_length_y, unsigned int vector_length_z) | |
| bool | advanceBlockDim (TuneParam ¶m) const |
| void | initTuneParam (TuneParam ¶m) const |
| void | defaultTuneParam (TuneParam ¶m) const |
| void | resizeVector (int y, int z) |
Public Member Functions inherited from quda::TunableVectorY | |
| TunableVectorY (unsigned int vector_length_y) | |
| void | resizeVector (int y) |
Public Member Functions inherited from quda::Tunable | |
| Tunable () | |
| virtual | ~Tunable () |
| virtual TuneKey | tuneKey () const =0 |
| virtual void | apply (const cudaStream_t &stream)=0 |
| virtual void | preTune () |
| virtual void | postTune () |
| virtual int | tuningIter () const |
| virtual std::string | paramString (const TuneParam ¶m) const |
| virtual std::string | perfString (float time) const |
| virtual bool | advanceTuneParam (TuneParam ¶m) const |
| void | checkLaunchParam (TuneParam ¶m) |
Private Attributes | |
| unsigned | vector_length_z |
Additional Inherited Members | |
Protected Member Functions inherited from quda::TunableVectorY | |
| virtual unsigned int | sharedBytesPerThread () const |
| virtual unsigned int | sharedBytesPerBlock (const TuneParam ¶m) const |
Protected Member Functions inherited from quda::Tunable | |
| virtual long long | flops () const =0 |
| virtual long long | bytes () const |
| virtual unsigned int | minThreads () const |
| virtual bool | tuneGridDim () const |
| virtual bool | tuneAuxDim () const |
| virtual bool | tuneSharedBytes () const |
| virtual bool | advanceGridDim (TuneParam ¶m) const |
| virtual unsigned int | maxBlockSize () const |
| virtual unsigned int | maxGridSize () const |
| virtual unsigned int | minGridSize () const |
| virtual int | blockStep () const |
| virtual int | blockMin () const |
| unsigned int | maxBlocksPerSM () const |
| For reason this can't be queried from the device properties, so here we set set this. Based on Table 14 of the CUDA Programming Guide 9.0 (Technical Specifications per Compute Capability) More... | |
| virtual bool | advanceSharedBytes (TuneParam ¶m) const |
| virtual bool | advanceAux (TuneParam ¶m) const |
| int | writeAuxString (const char *format,...) |
Protected Attributes inherited from quda::TunableVectorY | |
| unsigned int | vector_length_y |
Protected Attributes inherited from quda::Tunable | |
| char | aux [TuneKey::aux_n] |
Definition at line 400 of file tune_quda.h.
|
inline |
Definition at line 405 of file tune_quda.h.
|
inlinevirtual |
Reimplemented from quda::TunableVectorY.
Definition at line 408 of file tune_quda.h.
References quda::TunableVectorY::advanceBlockDim(), deg_tm_dslash_cuda_gen::block(), deviceProp, param, ret, and vector_length_z.

|
inlinevirtual |
sets default values for when tuning is disabled
Reimplemented from quda::TunableVectorY.
Definition at line 441 of file tune_quda.h.
References quda::TunableVectorY::defaultTuneParam(), param, and vector_length_z.

|
inlinevirtual |
Reimplemented from quda::TunableVectorY.
Definition at line 433 of file tune_quda.h.
References quda::TunableVectorY::initTuneParam(), param, and vector_length_z.

Definition at line 448 of file tune_quda.h.
References quda::TunableVectorY::resizeVector(), vector_length_z, y, and z.
Referenced by quda::CalculateY< from_coarse, Float, fineSpin, fineColor, coarseSpin, coarseColor, Arg >::setComputeType().


|
mutableprivate |
Definition at line 402 of file tune_quda.h.
Referenced by advanceBlockDim(), defaultTuneParam(), initTuneParam(), and resizeVector().
1.8.14