Difference between revisions of "Openacc"

From Tau Wiki
Jump to: navigation, search
(Matrix Multiply)
(Matrix Multiply)
Line 13: Line 13:
 
  export TAU_MAKEFILE=<path to TAU>/x86_64/lib/Makefile.tau-cupti-pgi
 
  export TAU_MAKEFILE=<path to TAU>/x86_64/lib/Makefile.tau-cupti-pgi
 
  export TAU_OPTIONS='-optVerbose -optShared'
 
  export TAU_OPTIONS='-optVerbose -optShared'
 +
 +
Compile
  
 
  tau_f90.sh -ta=nvidia matmult.f90 -o matmult
 
  tau_f90.sh -ta=nvidia matmult.f90 -o matmult
 +
 +
Run:
  
 
  tau_exec -T serial,cupti,pgi -cupti ./matmult
 
  tau_exec -T serial,cupti,pgi -cupti ./matmult
 +
 +
Use TAU analysis tool to view performance data:
 +
 
  pprof
 
  pprof
 
  paraprof
 
  paraprof

Revision as of 19:06, 15 March 2012


Matrix Multiply

TAU has support for the OpenACC directives available in PGI 12.3 and greater. Configure TAU:

./configure -c++=pgCC -cc=pgcc -fortran=pgi -cuda=<path to CUDA 4.1 or greater>
make install
export TAU_MAKEFILE=<path to TAU>/x86_64/lib/Makefile.tau-cupti-pgi
export TAU_OPTIONS='-optVerbose -optShared'

Compile

tau_f90.sh -ta=nvidia matmult.f90 -o matmult

Run:

tau_exec -T serial,cupti,pgi -cupti ./matmult

Use TAU analysis tool to view performance data:

pprof
paraprof