QUDA  0.9.0
Public Member Functions | Private Member Functions | Private Attributes | List of all members
quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend > Class Template Reference
Inheritance diagram for quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >:
Inheritance graph
[legend]
Collaboration diagram for quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >:
Collaboration graph
[legend]

Public Member Functions

 CopySpinorEx (CopySpinorExArg< OutOrder, InOrder, Basis > &arg, const ColorSpinorField &meta, QudaFieldLocation location)
 
virtual ~CopySpinorEx ()
 
void apply (const cudaStream_t &stream)
 
TuneKey tuneKey () const
 
long long flops () const
 
long long bytes () const
 

Private Member Functions

unsigned int sharedBytesPerThread () const
 
unsigned int sharedBytesPerBlock (const TuneParam &param) const
 
bool advanceSharedBytes (TuneParam &param) const
 
bool tuneGridDim () const
 
unsigned int minThreads () const
 
- Private Member Functions inherited from quda::Tunable
 Tunable ()
 
virtual ~Tunable ()
 
virtual void preTune ()
 
virtual void postTune ()
 
virtual int tuningIter () const
 
virtual std::string paramString (const TuneParam &param) const
 
virtual std::string perfString (float time) const
 
virtual void initTuneParam (TuneParam &param) const
 
virtual void defaultTuneParam (TuneParam &param) const
 
virtual bool advanceTuneParam (TuneParam &param) const
 
void checkLaunchParam (TuneParam &param)
 
virtual bool tuneAuxDim () const
 
virtual bool tuneSharedBytes () const
 
virtual bool advanceGridDim (TuneParam &param) const
 
virtual unsigned int maxBlockSize () const
 
virtual unsigned int maxGridSize () const
 
virtual unsigned int minGridSize () const
 
virtual int blockStep () const
 
virtual int blockMin () const
 
virtual bool advanceBlockDim (TuneParam &param) const
 
unsigned int maxBlocksPerSM () const
 For reason this can't be queried from the device properties, so here we set set this. Based on Table 14 of the CUDA Programming Guide 9.0 (Technical Specifications per Compute Capability) More...
 
virtual bool advanceAux (TuneParam &param) const
 
int writeAuxString (const char *format,...)
 

Private Attributes

CopySpinorExArg< OutOrder, InOrder, Basis > arg
 
const ColorSpinorFieldmeta
 
QudaFieldLocation location
 
- Private Attributes inherited from quda::Tunable
char aux [TuneKey::aux_n]
 

Detailed Description

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
class quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >

Definition at line 236 of file extended_color_spinor_utilities.cu.

Constructor & Destructor Documentation

◆ CopySpinorEx()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::CopySpinorEx ( CopySpinorExArg< OutOrder, InOrder, Basis > &  arg,
const ColorSpinorField meta,
QudaFieldLocation  location 
)
inline

Definition at line 250 of file extended_color_spinor_utilities.cu.

References quda::arg().

Here is the call graph for this function:

◆ ~CopySpinorEx()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
virtual quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::~CopySpinorEx ( )
inlinevirtual

Definition at line 254 of file extended_color_spinor_utilities.cu.

Member Function Documentation

◆ advanceSharedBytes()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
bool quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::advanceSharedBytes ( TuneParam param) const
inlineprivatevirtual

The goal here is to throttle the number of thread blocks per SM by over-allocating shared memory (in order to improve L2 utilization, etc.). We thus request the smallest amount of dynamic shared memory that guarantees throttling to a given number of blocks, in order to allow some extra leeway.

Reimplemented from quda::Tunable.

Definition at line 245 of file extended_color_spinor_utilities.cu.

◆ apply()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
void quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::apply ( const cudaStream_t &  stream)
inlinevirtual

Implements quda::Tunable.

Definition at line 256 of file extended_color_spinor_utilities.cu.

References quda::arg(), quda::TuneParam::block, getTuning(), getVerbosity(), quda::TuneParam::grid, QUDA_CPU_FIELD_LOCATION, QUDA_CUDA_FIELD_LOCATION, quda::TuneParam::shared_bytes, quda::stream, and quda::tuneLaunch().

Referenced by quda::copySpinorEx().

Here is the call graph for this function:
Here is the caller graph for this function:

◆ bytes()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
long long quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::bytes ( ) const
inlinevirtual

Reimplemented from quda::Tunable.

Definition at line 270 of file extended_color_spinor_utilities.cu.

References quda::arg().

Here is the call graph for this function:

◆ flops()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
long long quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::flops ( ) const
inlinevirtual

Implements quda::Tunable.

Definition at line 269 of file extended_color_spinor_utilities.cu.

◆ minThreads()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
unsigned int quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::minThreads ( ) const
inlineprivatevirtual

Reimplemented from quda::Tunable.

Definition at line 247 of file extended_color_spinor_utilities.cu.

References quda::arg().

Here is the call graph for this function:

◆ sharedBytesPerBlock()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
unsigned int quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::sharedBytesPerBlock ( const TuneParam param) const
inlineprivatevirtual

Implements quda::Tunable.

Definition at line 244 of file extended_color_spinor_utilities.cu.

◆ sharedBytesPerThread()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
unsigned int quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::sharedBytesPerThread ( ) const
inlineprivatevirtual

Implements quda::Tunable.

Definition at line 243 of file extended_color_spinor_utilities.cu.

◆ tuneGridDim()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
bool quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::tuneGridDim ( ) const
inlineprivatevirtual

Reimplemented from quda::Tunable.

Definition at line 246 of file extended_color_spinor_utilities.cu.

◆ tuneKey()

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
TuneKey quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::tuneKey ( ) const
inlinevirtual

Implements quda::Tunable.

Definition at line 267 of file extended_color_spinor_utilities.cu.

References quda::LatticeField::VolString().

Here is the call graph for this function:

Member Data Documentation

◆ arg

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
CopySpinorExArg<OutOrder,InOrder,Basis> quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::arg
private

Definition at line 238 of file extended_color_spinor_utilities.cu.

◆ location

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
QudaFieldLocation quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::location
private

Definition at line 240 of file extended_color_spinor_utilities.cu.

◆ meta

template<typename FloatOut, typename FloatIn, int Ns, int Nc, typename OutOrder, typename InOrder, typename Basis, bool extend>
const ColorSpinorField& quda::CopySpinorEx< FloatOut, FloatIn, Ns, Nc, OutOrder, InOrder, Basis, extend >::meta
private

Definition at line 239 of file extended_color_spinor_utilities.cu.


The documentation for this class was generated from the following file: