QUDA  0.9.0
Public Member Functions | Private Member Functions | Private Attributes | List of all members
quda::QudaMemCopy Class Reference
Inheritance diagram for quda::QudaMemCopy:
Inheritance graph
[legend]
Collaboration diagram for quda::QudaMemCopy:
Collaboration graph
[legend]

Public Member Functions

 QudaMemCopy (void *dst, const void *src, size_t count, cudaMemcpyKind kind, const char *func, const char *file, const char *line)
 
virtual ~QudaMemCopy ()
 
void apply (const cudaStream_t &stream)
 
bool advanceTuneParam (TuneParam &param) const
 
TuneKey tuneKey () const
 
long long flops () const
 
long long bytes () const
 
- Public Member Functions inherited from quda::Tunable
 Tunable ()
 
virtual ~Tunable ()
 
virtual void preTune ()
 
virtual void postTune ()
 
virtual int tuningIter () const
 
virtual std::string paramString (const TuneParam &param) const
 
virtual std::string perfString (float time) const
 
virtual void initTuneParam (TuneParam &param) const
 
virtual void defaultTuneParam (TuneParam &param) const
 
void checkLaunchParam (TuneParam &param)
 

Private Member Functions

unsigned int sharedBytesPerThread () const
 
unsigned int sharedBytesPerBlock (const TuneParam &param) const
 

Private Attributes

void * dst
 
const void * src
 
const size_t count
 
const cudaMemcpyKind kind
 
const char * name
 

Additional Inherited Members

- Protected Member Functions inherited from quda::Tunable
virtual unsigned int minThreads () const
 
virtual bool tuneGridDim () const
 
virtual bool tuneAuxDim () const
 
virtual bool tuneSharedBytes () const
 
virtual bool advanceGridDim (TuneParam &param) const
 
virtual unsigned int maxBlockSize () const
 
virtual unsigned int maxGridSize () const
 
virtual unsigned int minGridSize () const
 
virtual int blockStep () const
 
virtual int blockMin () const
 
virtual bool advanceBlockDim (TuneParam &param) const
 
unsigned int maxBlocksPerSM () const
 For reason this can't be queried from the device properties, so here we set set this. Based on Table 14 of the CUDA Programming Guide 9.0 (Technical Specifications per Compute Capability) More...
 
virtual bool advanceSharedBytes (TuneParam &param) const
 
virtual bool advanceAux (TuneParam &param) const
 
int writeAuxString (const char *format,...)
 
- Protected Attributes inherited from quda::Tunable
char aux [TuneKey::aux_n]
 

Detailed Description

Definition at line 29 of file quda_cuda_api.cpp.

Constructor & Destructor Documentation

◆ QudaMemCopy()

quda::QudaMemCopy::QudaMemCopy ( void *  dst,
const void *  src,
size_t  count,
cudaMemcpyKind  kind,
const char *  func,
const char *  file,
const char *  line 
)
inline

Definition at line 41 of file quda_cuda_api.cpp.

References quda::Tunable::aux, errorQuda, func, kind, name, strcat(), and strcpy().

Here is the call graph for this function:

◆ ~QudaMemCopy()

virtual quda::QudaMemCopy::~QudaMemCopy ( )
inlinevirtual

Definition at line 71 of file quda_cuda_api.cpp.

Member Function Documentation

◆ advanceTuneParam()

bool quda::QudaMemCopy::advanceTuneParam ( TuneParam param) const
inlinevirtual

Reimplemented from quda::Tunable.

Definition at line 99 of file quda_cuda_api.cpp.

◆ apply()

void quda::QudaMemCopy::apply ( const cudaStream_t &  stream)
inlinevirtual

Implements quda::Tunable.

Definition at line 73 of file quda_cuda_api.cpp.

References count, dst, errorQuda, getTuning(), getVerbosity(), kind, memcpy(), src, and quda::tuneLaunch().

Here is the call graph for this function:

◆ bytes()

long long quda::QudaMemCopy::bytes ( ) const
inlinevirtual

Reimplemented from quda::Tunable.

Definition at line 109 of file quda_cuda_api.cpp.

References count, and kind.

◆ flops()

long long quda::QudaMemCopy::flops ( ) const
inlinevirtual

Implements quda::Tunable.

Definition at line 108 of file quda_cuda_api.cpp.

◆ sharedBytesPerBlock()

unsigned int quda::QudaMemCopy::sharedBytesPerBlock ( const TuneParam param) const
inlineprivatevirtual

Implements quda::Tunable.

Definition at line 38 of file quda_cuda_api.cpp.

◆ sharedBytesPerThread()

unsigned int quda::QudaMemCopy::sharedBytesPerThread ( ) const
inlineprivatevirtual

Implements quda::Tunable.

Definition at line 37 of file quda_cuda_api.cpp.

◆ tuneKey()

TuneKey quda::QudaMemCopy::tuneKey ( ) const
inlinevirtual

Implements quda::Tunable.

Definition at line 101 of file quda_cuda_api.cpp.

References quda::Tunable::aux, count, name, strcpy(), and quda::u64toa().

Here is the call graph for this function:

Member Data Documentation

◆ count

const size_t quda::QudaMemCopy::count
private

Definition at line 33 of file quda_cuda_api.cpp.

Referenced by apply(), bytes(), and tuneKey().

◆ dst

void* quda::QudaMemCopy::dst
private

Definition at line 31 of file quda_cuda_api.cpp.

Referenced by apply().

◆ kind

const cudaMemcpyKind quda::QudaMemCopy::kind
private

Definition at line 34 of file quda_cuda_api.cpp.

Referenced by apply(), bytes(), and QudaMemCopy().

◆ name

const char* quda::QudaMemCopy::name
private

Definition at line 35 of file quda_cuda_api.cpp.

Referenced by QudaMemCopy(), and tuneKey().

◆ src

const void* quda::QudaMemCopy::src
private

Definition at line 32 of file quda_cuda_api.cpp.

Referenced by apply().


The documentation for this class was generated from the following file: