Condor Job on Hold
How to get the output:
condor_q| awk '/ H / {print $1}'|xargs -n 1 condor_q -long | grep HoldReason
Error: Failed to open file
Error Message
Error from starter on slot?@n???.atlas.local: Failed to open '/home/user/path/to/file' as standard output: No such file or directory (errno 2)
Investigation
On the node we found this error message in /var/log/syslog:
Jun 25 17:59:26 n0036 automount[1560]: >> Cannot create link /etc/mtab~
Jun 25 17:59:26 n0036 automount[1560]: >> Perhaps there is a stale lock file?
Jun 25 17:59:26 n0036 automount[1560]: failed to mount autofs path /home
Jun 25 17:59:26 n0036 automount[1560]: /home: mount failed!
Maybe there was a stale file around, but when checking it was not there.
Cure
/etc/init.d/autofs reload
State
unresolved since the initial cause is still unknown
--
CarstenAulbert - 26 Jun 2008