|
QUDA
0.9.0
|


Public Member Functions | |
| CopySpinorEx (CopySpinorExArg< OutOrder, InOrder, Basis > &arg, const ColorSpinorField &meta, QudaFieldLocation location) | |
| virtual | ~CopySpinorEx () |
| void | apply (const cudaStream_t &stream) |
| TuneKey | tuneKey () const |
| long long | flops () const |
| long long | bytes () const |
Private Member Functions | |
| unsigned int | sharedBytesPerThread () const |
| unsigned int | sharedBytesPerBlock (const TuneParam ¶m) const |
| bool | advanceSharedBytes (TuneParam ¶m) const |
| bool | tuneGridDim () const |
| unsigned int | minThreads () const |
Private Member Functions inherited from quda::Tunable | |
| Tunable () | |
| virtual | ~Tunable () |
| virtual void | preTune () |
| virtual void | postTune () |
| virtual int | tuningIter () const |
| virtual std::string | paramString (const TuneParam ¶m) const |
| virtual std::string | perfString (float time) const |
| virtual void | initTuneParam (TuneParam ¶m) const |
| virtual void | defaultTuneParam (TuneParam ¶m) const |
| virtual bool | advanceTuneParam (TuneParam ¶m) const |
| void | checkLaunchParam (TuneParam ¶m) |
| virtual bool | tuneAuxDim () const |
| virtual bool | tuneSharedBytes () const |
| virtual bool | advanceGridDim (TuneParam ¶m) const |
| virtual unsigned int | maxBlockSize () const |
| virtual unsigned int | maxGridSize () const |
| virtual unsigned int | minGridSize () const |
| virtual int | blockStep () const |
| virtual int | blockMin () const |
| virtual bool | advanceBlockDim (TuneParam ¶m) const |
| unsigned int | maxBlocksPerSM () const |
| For reason this can't be queried from the device properties, so here we set set this. Based on Table 14 of the CUDA Programming Guide 9.0 (Technical Specifications per Compute Capability) More... | |
| virtual bool | advanceAux (TuneParam ¶m) const |
| int | writeAuxString (const char *format,...) |
Private Attributes | |
| CopySpinorExArg< OutOrder, InOrder, Basis > | arg |
| const ColorSpinorField & | meta |
| QudaFieldLocation | location |
Private Attributes inherited from quda::Tunable | |
| char | aux [TuneKey::aux_n] |
Definition at line 236 of file extended_color_spinor_utilities.cu.
|
inline |
Definition at line 250 of file extended_color_spinor_utilities.cu.
References quda::arg().

|
inlinevirtual |
Definition at line 254 of file extended_color_spinor_utilities.cu.
|
inlineprivatevirtual |
The goal here is to throttle the number of thread blocks per SM by over-allocating shared memory (in order to improve L2 utilization, etc.). We thus request the smallest amount of dynamic shared memory that guarantees throttling to a given number of blocks, in order to allow some extra leeway.
Reimplemented from quda::Tunable.
Definition at line 245 of file extended_color_spinor_utilities.cu.
|
inlinevirtual |
Implements quda::Tunable.
Definition at line 256 of file extended_color_spinor_utilities.cu.
References quda::arg(), quda::TuneParam::block, getTuning(), getVerbosity(), quda::TuneParam::grid, QUDA_CPU_FIELD_LOCATION, QUDA_CUDA_FIELD_LOCATION, quda::TuneParam::shared_bytes, quda::stream, and quda::tuneLaunch().
Referenced by quda::copySpinorEx().


|
inlinevirtual |
Reimplemented from quda::Tunable.
Definition at line 270 of file extended_color_spinor_utilities.cu.
References quda::arg().

|
inlinevirtual |
Implements quda::Tunable.
Definition at line 269 of file extended_color_spinor_utilities.cu.
|
inlineprivatevirtual |
Reimplemented from quda::Tunable.
Definition at line 247 of file extended_color_spinor_utilities.cu.
References quda::arg().

|
inlineprivatevirtual |
Implements quda::Tunable.
Definition at line 244 of file extended_color_spinor_utilities.cu.
|
inlineprivatevirtual |
Implements quda::Tunable.
Definition at line 243 of file extended_color_spinor_utilities.cu.
|
inlineprivatevirtual |
Reimplemented from quda::Tunable.
Definition at line 246 of file extended_color_spinor_utilities.cu.
|
inlinevirtual |
Implements quda::Tunable.
Definition at line 267 of file extended_color_spinor_utilities.cu.
References quda::LatticeField::VolString().

|
private |
Definition at line 238 of file extended_color_spinor_utilities.cu.
|
private |
Definition at line 240 of file extended_color_spinor_utilities.cu.
|
private |
Definition at line 239 of file extended_color_spinor_utilities.cu.
1.8.14