| 9 | 1. in palm_wd, create one item for each remote host in the following three lists hostname, username and description, e.g. |
| 10 | {{{ |
| 11 | hostname = ["hlogin.hlrn.de", "blogin.hlrn.de"] |
| 12 | username = ["nikname" , "nikname" ] |
| 13 | description = ["Hannover" , "Berlin" ] |
| 14 | }}} |
| 15 | here hostname is the IP of the remote host (assuming that a passwordless login via ssh-key is available), username is the user name on the remote host, and description is an arbritrary string to identify the host. |
| 16 | |
| 17 | Additionally, the update_frequency can be adjusted: |
| 18 | {{{ |
| 19 | update_frequency = 600*1000 |
| 20 | }}} |
| 21 | |
| 22 | 2. in palm_wdd, system-specific configurations must be made. The default is configured to be used on the Cray-XC40 at HLRN-III and reads |
| 23 | {{{ |
| 24 | cmd_readqueue = "showq | egrep " |
| 25 | cmd_tmpdir = "/gfs1/tmp/" |
| 26 | cmd_canceljob = "canceljob" |
| 27 | cmd_checkjob = "checkjob" |
| 28 | cmd_realname_grep = "AName" |
| 29 | cmd_starttime = "showstart" |
| 30 | cmd_starttime_grep = "start in" |
| 31 | }}} |
| 32 | For other hosts, the parameters above must be adjusted appropriately. |
| 33 | |
| 34 | 3. Copy palm_wdd into the $HOME directory of each of the remote hosts, i.e. for HLRN-III: |
| 35 | {{{ |
| 36 | scp palm_wdd, nikname@hlogin.hlrn.de |
| 37 | scp palm_wdd, nikname@blogin.hlrn.de |
| 38 | }}} |
| 39 | |
| 40 | 4. Create database files for the watchdog in your working directory: |
| 41 | {{{ |
| 42 | cp $PALM_BIN/palm_wd_files/.wd.olddata $HOME/palm/current_version |
| 43 | cp $PALM_BIN/palm_wd_files/.wd.newdata $HOME/palm/current_version |
| 44 | }}} |