|
QUDA
0.9.0
|


Public Member Functions | |
| CopyGaugeEx (CopyGaugeExArg< OutOrder, InOrder > &arg, const GaugeField &meta, QudaFieldLocation location) | |
| virtual | ~CopyGaugeEx () |
| void | apply (const cudaStream_t &stream) |
| TuneKey | tuneKey () const |
| long long | flops () const |
| long long | bytes () const |
Private Member Functions | |
| unsigned int | sharedBytesPerThread () const |
| unsigned int | sharedBytesPerBlock (const TuneParam ¶m) const |
| bool | tuneGridDim () const |
| unsigned int | minThreads () const |
Private Member Functions inherited from quda::Tunable | |
| Tunable () | |
| virtual | ~Tunable () |
| virtual void | preTune () |
| virtual void | postTune () |
| virtual int | tuningIter () const |
| virtual std::string | paramString (const TuneParam ¶m) const |
| virtual std::string | perfString (float time) const |
| virtual void | initTuneParam (TuneParam ¶m) const |
| virtual void | defaultTuneParam (TuneParam ¶m) const |
| virtual bool | advanceTuneParam (TuneParam ¶m) const |
| void | checkLaunchParam (TuneParam ¶m) |
| virtual bool | tuneAuxDim () const |
| virtual bool | tuneSharedBytes () const |
| virtual bool | advanceGridDim (TuneParam ¶m) const |
| virtual unsigned int | maxBlockSize () const |
| virtual unsigned int | maxGridSize () const |
| virtual unsigned int | minGridSize () const |
| virtual int | blockStep () const |
| virtual int | blockMin () const |
| virtual bool | advanceBlockDim (TuneParam ¶m) const |
| unsigned int | maxBlocksPerSM () const |
| For reason this can't be queried from the device properties, so here we set set this. Based on Table 14 of the CUDA Programming Guide 9.0 (Technical Specifications per Compute Capability) More... | |
| virtual bool | advanceSharedBytes (TuneParam ¶m) const |
| virtual bool | advanceAux (TuneParam ¶m) const |
| int | writeAuxString (const char *format,...) |
Private Attributes | |
| CopyGaugeExArg< OutOrder, InOrder > | arg |
| const GaugeField & | meta |
| QudaFieldLocation | location |
Private Attributes inherited from quda::Tunable | |
| char | aux [TuneKey::aux_n] |
Definition at line 110 of file copy_gauge_extended.cu.
|
inline |
Definition at line 123 of file copy_gauge_extended.cu.
References quda::arg().

|
inlinevirtual |
Definition at line 127 of file copy_gauge_extended.cu.
|
inlinevirtual |
Implements quda::Tunable.
Definition at line 129 of file copy_gauge_extended.cu.
References quda::arg(), quda::TuneParam::block, getTuning(), getVerbosity(), quda::TuneParam::grid, QUDA_CPU_FIELD_LOCATION, QUDA_CUDA_FIELD_LOCATION, quda::TuneParam::shared_bytes, quda::stream, and quda::tuneLaunch().

|
inlinevirtual |
Reimplemented from quda::Tunable.
Definition at line 148 of file copy_gauge_extended.cu.
References quda::arg().

|
inlinevirtual |
Implements quda::Tunable.
Definition at line 147 of file copy_gauge_extended.cu.
|
inlineprivatevirtual |
Reimplemented from quda::Tunable.
Definition at line 120 of file copy_gauge_extended.cu.
References quda::arg().

|
inlineprivatevirtual |
Implements quda::Tunable.
Definition at line 117 of file copy_gauge_extended.cu.
|
inlineprivatevirtual |
Implements quda::Tunable.
Definition at line 116 of file copy_gauge_extended.cu.
|
inlineprivatevirtual |
Reimplemented from quda::Tunable.
Definition at line 119 of file copy_gauge_extended.cu.
|
inlinevirtual |
Implements quda::Tunable.
Definition at line 143 of file copy_gauge_extended.cu.
References quda::LatticeField::VolString().

|
private |
Definition at line 111 of file copy_gauge_extended.cu.
|
private |
Definition at line 113 of file copy_gauge_extended.cu.
|
private |
Definition at line 112 of file copy_gauge_extended.cu.
1.8.14