Yes, please... if one of my units fails or otherwise goes off line, it basically brings the entire cluster down... very annoying.
this is exactly what I wanted to say
Guys, as a temporary workaround, you can wrap your cgminer call in a .bat file (Windows), or the Linux equivalent. Something like this (again, Windows code):
:loop
cgminer
waitfor /t 10 signame
goto loop
If cgminer quits for any reason (such as a Single going offline), the batch file will pause for 10 seconds (adjust this to whatever you want) and then runs cgminer again. If you ever want to stop the infinite loop and get out of the batch file, hit
and answer 'y' to the prompt that appears (again, Windows).
Epoch, cgminer crashes. If process is still running, your script will not work.
It will only work when cgminer quits gracefully or calls exit()/abort().
That is not what I am seeing. I have tried this exact scenario on 4 separate Windows 7/64 machines. When I unplug one of the Singles, cgminer simply quits. Perhaps not gracefully, but it does not hang or lock up. It merely quits ... essentially back to the command prompt. My script then gets control and loops back to re-run it; it has worked fine on 4 different machines.
Some of the recent earlier versions of cgminer used to crash/lock-up. But the recent releases (certainly 2.3.6) do not and merely quit to the command prompt.
I realize that if cgminer hangs, this workaround will not work. The real solution is a fix in cgminer. I offer this looping .bat file as something that may work for some of us (Windows 7 at least, perhaps not Linux ... that's something I cannot comment on).
edit: sorry, Luke, I don't mean to go off-topic here with cgminer comments ... I'll stay back on track with bfgminer!
I haven't 'unplugged a Single' running with bfgminer yet to see how it behaves ... I'll try that this evening.