Cray
Cray XK6/XK7
Cray XK6 machines
GPU performance tracking
1. Configuring TAU:
module load cudatoolkit ./configure -arch=craycnl -cuda="$CRAY_CUDATOOLKIT_DIR" -cudalibrary="$CRAY_CUDATOOLKIT_POST_LINK_OPTS" -bfd=download
Setup your environment:
export PATH=<path to tau2>/craycnl/bin:$PATH export LD_LIBRARY_PATH=<path to tau2>/craycnl/lib:$LD_LIBRARY_PATH
2. CUDA
Build as normally would, and modify your run command to be:
aprun -N 1 tau_exec -T serial,cupti -cupti ./matmult
3. OpenACC
Both PGI and Cray uses the CUDA driver API to interact with the GPU, so setup TAU to collect those calls:
export TAU_CUPTI_API=driver
Compile as normally would and run with tau_exec as well:
aprun -N 1 tau_exec -T serial,cupti -cupti ./matmult
4. Viewing profiles
You can view TAU profile either through pprof (text-basied) or paraprof (GUI).
5. Tracing
Traces can be capture by setting:
export TAU_TRACE=1
before running your application. The traces need to be post-processed as well, issue these commands:
aprun -N 1 tau_exec -T serial,cupti -cupti ./matmult tau_multimerge tau2slog2 tau.trc tau.edf -o stencil2d.slog2 jumpshot stencil2d.slog2
Jumpshot is a commond Trace visualizer bundled with TAU.