Notes for PPPL TRANSP Support Personnel

Index
Crashed Runs
Expired Proxy
Recovering from other problems
Other Tools

Recovering from other problems:

1. Check logs:
    a) $LOGDIR/<petrel*>/<runid>.log
    b) $RESULTDIR/<tok>/<runid>.pbsout
    c) $LOGDIR/pbslog/

2. Run completed somehow but post-processing needs to be done:
   a) be sure to do: source ~pshare/globus/.userrc_csh 
   b)  tr_recover.pl  <runid> <tok> <year> all 
      tr_recover.pl with all will prompt you for plotcon, etc.
      and does Steps 3 and 4 below.

3. Run completed, but stopped before writing to mdsplus, or failed during
   mdsplot:
    a) <runid>mds.sh
       if crash directory is empty, run manually, e.g.:
       mdsplot T T transp_nstx s transpgrid.pppl.gov n 123450101 q 12345A01
       Get server, tree and mds-shot from:
       $QSHARE/<tok>/<runid>/<runid>_<tok>.REQUEST
       Be sure to check in the REQUEST file the user really wants MDSplus
       output.
    If run did not complete:
    b) source ~pshare/globus/.userrc_csh (or . ~pshare/globus/.userrc)
       finishup <runid>
    c) follow 4.

4. Run stopped after finishup:
     source ~pshare/globus/.userrc_csh (or . ~pshare/globus/.userrc)
     tr_recover.pl <runid> <tok> <year> 
       copies all Output files to $ARCDIR
       corrects status
       sends email to user

Top

The triage program manages the bug tracking of a stopped run. The stopped run can be locked to prevent other developers from simultaneously debugging a run, a cause can be attached which is communicated to the user through the stopped job web page and an action assigned for controling when a stopped job is cleaned up. more info.

Common Errors

a list of TRANSP fatal errors

tr_save

If you made a "private run" with your own TRANSP version, and want to permanently archive it, use tr_save

Pre-requisite:
1. TR.DAT, TR.INF and *.CDF files must be on a cluster directory, that can be accessed by pshare;
  you must provide a TR.INF file
2. Ufiles, as pointed to by TR.DAT must be accessible by pshare
Now do:
1. login as pshare
2. cd < location of your run >
3. tr_save <runid> <tok> <yr>

If you restarted an aborted run of another user, with your private TRANSP code, as "yourself", and want to archive it, see Archiving Notes

mds_get_inf

to retrieve Namelist, Contents (TF.PLN), or other text nodes from MDSplus, or to get names of Ufiles.

Looking up user - pshr####

On transpgrid:/etc/grid-security/mdsip.hosts
On cluster: ~pshare/PSHR.LIST

To extract email:
grep pshr#### ~pshare/PSHR.LIST | cut -d: -f3

Submitting Collaboratory Runs

See Scripts for Experts

Top

Home

Notes for PPPL TRANSP Support Personnel

Crashed Runs:

Expired Proxy:

Recovering from other problems:

Other Tools: