The purpose of this programming assignment is to gain experience in using performance analysis tools for parallel programs. For this assignment, you will run an existing parallel code, LULESH and analyze its performance using HPCToolkit and Hatchet.
You can get LULESH by cloning its git repository as follows:
git clone https://github.com/LLNL/LULESH.git
You can use CMake to build LULESH on deepthought2 by following these steps:
mkdir build
cd build
cmake -DCMAKE_BUILD_TYPE=Release -DCMAKE_CXX_COMPILER=mpicxx -DCMAKE_CXX_FLAGS="-g -O3" -DMPI_CXX_COMPILER=`which mpicxx` -DWITH_OPENMP=Off -DWITH_SILO=Off ..
make
This should produce an executable lulesh2.0
in the build directory.
Lets say you want to run LULESH on 8 processes for 10 iterations/timesteps. This would be the mpirun line:
mpirun -np 8 ./lulesh2.0 -i 10 -p
HPCToolkit is available on deepthought2 via the hpctoolkit/gcc
module. You can use HPCtoolkit to collect profiling data for a parallel program in three steps.
hpcstruct exe
mpirun -np <num_ranks> hpcrun -e WALLCLOCK@5000 ./exe <args>
mpirun -np 1 hpcprof-mpi --metric-db=yes -S exe.hpcstruct -I <path_to_src> <measurements-directory>
hpcprof-mpi
using its from_hpctoolkit
reader.
You can install Hatchet on deepthought2 using:
module load python/3.7.7
pip install hatchet --user
.
OR If you want to use the latest (development) version of hatchet on deepthought2 or your local computer, you can install it by following these steps:
module load python/3.7.7 (only works on deepthought2)
pip install textx --user
git clone https://github.com/hatchet/hatchet.git
Then add the directory where you installed hatchet your PYTHONPATH
and then do:
cd hatchet/
./install.sh
-s 45
and compare the performance of these executions with those in the default mode. Identify the functions/code regions where the code spends disproportionately more time compared to the default mode in task 1.
-s 45
, -s 22
, and -s 15
respectively), and compare the performance of various executions. Identify the functions/code regions that scale poorly as you run on more processes in this strong scaling mpde. Compare the results with the functions you identified in task 1.
You must submit the following files and no other files:
LastName-FirstName-assign2
), compress it to .tar.gz (LastName-FirstName-assign2.tar.gz
) and upload that to ELMS.
Do not include irrelevant files in the tarball.
Replace LastName
and FirstName
with your last and first name, respectively.
The project will be graded as follows:
Component | Percentage |
---|---|
Analysis 1 | 30 |
Analysis 2 | 30 |
Analysis 3 | 30 |
Writeup | 10 |