This document describes how to set up for the Bug cluster. If you need more information, look at the UMIACS cluster guide and the Bug cluster guide first.
$ ssh -l username brood00.umiacs.umd.edu $ cd $HOME $ ssh-keygen -t rsa1 -N "" -f $HOME/.ssh/identity $ ssh-keygen -t rsa -N "" -f $HOME/.ssh/id_rsa $ ssh-keygen -t dsa -N "" -f $HOME/.ssh/id_dsa $ cd .ssh $ touch authorized_keys authorized_keys2 $ cat identity.pub >> authorized_keys $ cat id_rsa.pub id_dsa.pub >> authorized_keys2 $ chmod 640 authorized_keys authorized_keys2
setenv PATH "/usr/local/stow/lam-7.1.4-gm/bin:${PATH}" setenv LD_LIBRARY_PATH "/usr/local/stow/lam-7.1.4-gm/lib:${HOME}"We are adding your home directory to your library path because we need to create a link that renames a library file. From your home directory, run:
$ ln -s /opt/UMtorque/lib/libtorque.so.2.0.0 libtorque.so.0You should see a file named libtorque.so.0 in your home directory. Log out and log back in. Make sure your shell is now finding the correct binaries; if the output of the which command isn't the same as below, you've done something in this step wrong.
$ which mpicc /usr/local/stow/lam-7.1.4-gm/bin/mpicc
$ mpicc -o foo foo.ccompiles foo.c into an executable called foo.
#!/bin/tcsh #PBS -l walltime=00:02:00 #PBS -l nodes=4 cd ~/<YOUR EXECUTABLE DIRECTORY> mpiexec -machinefile ${PBS_NODEFILE} -ssi rpi gm C <YOUR EXECUTABLE AND PARAMETERS>The second and third line are directives to the batch system specifying that your program will require 2 minutes of wall-time on each of 4 nodes. To run this job on the cluster, type:
$ qsub foo.sh NNNNN.queen.umiacs.umd.eduThe NNNNN that get returned is your job number. You can check the status of all your running jobs with the qstat command, and you can delete a job with qdel NNNNN. The standard output and standard error streams of your job are redirected to files named foo.sh.oNNNNN and foo.sh.eNNNNN, respectively. There will be a couple extra lines in the former file which read:
Warning: no access to tty (Bad file descriptor). Thus no job control in this shell.This will always show up at the beginning of the output file; it's just part of the way the system runs. On a normal execution the error file will be empty.