cool, I'll check it out
It looks good, however I did see one small issue. I raised the MHz on the flaky soft reset die to stress it and bfgminer wouldn't load. It got stuck in a hard reset loop when the die failed. I was able to change the advanced setting to get out of the loop, but maybe need to delay the hard reset routine for a few min to let bfgminer load when there is a bad die that's overclocked or not turned off
ok I just noticed something ... If I get a bunch of "Got nonce for unknown work in slot xx" errors scrolling then it goes straight to hard reset ... (this may tie in with the bgfminer not loading loop).
other than that everything is working fine
I'll get another cup of coffee & watch this for awhile, see if I can notice anything else that might help you troubleshoot it
...... ok this time the "Got nonce for unknown work in slot xx" errors didn't trigger an instant hard reset and it reset successfully, must have been coincidence earlier
ok looks like just getting random hard resets instead of running thru the soft reset loop. Most of the time it runs thru the loop. If I disable the flaky die there aren't any problems
btw, that die is set at 150 MHz so it has less amp draw
So, for the die thats at 150mhz ... are either of the DCDC's less than 5amp output for more than a few mins? If so, thats why its carrying out the hard reset. Paste contents of /var/log/monitordcdc.log if ya could =) thanks
If it turns out that one of the DCDC's is putting out current but the other is not, I may just change the comparison to DCDC1 AND DCDC2 less than threshold rather than OR. Because both will be less than 5 if the die is not working at all.
I bet that's it, in adv settings one die is showing zero amps, the other die isn't even listed -- I know it's hashing because I turned off the good dies and got 10 mhs in bfgminer for that bad die, about half what it's suppose to be
here's the end of the log, it's getting pretty big
Attempting softreset of ASIC# 5 DIE# 1
{ "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "200" } }
STATUS=S,When=1439479313,Code=92,Msg=PGA 0 set OK: Die setup Ok; asic 5 die 1 cmd RECONFIGURE,Description=bfgminer 5.2.0|
Attempting softreset of ASIC# 5 DIE# 1
{ "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "200" } }
STATUS=S,When=1439479352,Code=92,Msg=PGA 0 set OK: Die setup Ok; asic 5 die 1 cmd RECONFIGURE,Description=bfgminer 5.2.0|
Attempting softreset of ASIC# 5 DIE# 1
KnC: Frequency change FAILED! { "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "200" } }
Soft reset failed, initiatng hard reset
Stopping bfgminer.
Power cycling ASIC# 5
INFO: Attempt to power down dc/dc
INFO: Attempt to power UP dc/dc
Starting bfgminer.
[2015-08-13 15:23:46] Die 5- restarted
Manually disabled die detected, skipping dead die detection. ASIC# 4, DIE# 2
Manually disabled die detected, skipping dead die detection. ASIC# 4, DIE# 4
Moving on with dead die test, no manual disabled die found
[2015-08-13 15:28:51] Die 5-1 requires restart
Attempting softreset of ASIC# 5 DIE# 1
{ "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "150" } }
STATUS=S,When=1439479743,Code=92,Msg=PGA 0 set OK: Die setup Ok; asic 5 die 1 cmd RECONFIGURE,Description=bfgminer 5.2.0|
Attempting softreset of ASIC# 5 DIE# 1
{ "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "150" } }
STATUS=S,When=1439479781,Code=92,Msg=PGA 0 set OK: Die setup Ok; asic 5 die 1 cmd RECONFIGURE,Description=bfgminer 5.2.0|
Attempting softreset of ASIC# 5 DIE# 1
{ "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "150" } }
STATUS=S,When=1439479818,Code=92,Msg=PGA 0 set OK: Die setup Ok; asic 5 die 1 cmd RECONFIGURE,Description=bfgminer 5.2.0|
Attempting softreset of ASIC# 5 DIE# 1
{ "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "150" } }
STATUS=S,When=1439479857,Code=92,Msg=PGA 0 set OK: Die setup Ok; asic 5 die 1 cmd RECONFIGURE,Description=bfgminer 5.2.0|
Attempting softreset of ASIC# 5 DIE# 1
{ "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "150" } }
STATUS=S,When=1439479897,Code=92,Msg=PGA 0 set OK: Die setup Ok; asic 5 die 1 cmd RECONFIGURE,Description=bfgminer 5.2.0|
Attempting softreset of ASIC# 5 DIE# 1
{ "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "150" } }
STATUS=S,When=1439479936,Code=92,Msg=PGA 0 set OK: Die setup Ok; asic 5 die 1 cmd RECONFIGURE,Description=bfgminer 5.2.0|
Attempting softreset of ASIC# 5 DIE# 1
{ "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "150" } }
STATUS=S,When=1439479973,Code=92,Msg=PGA 0 set OK: Die setup Ok; asic 5 die 1 cmd RECONFIGURE,Description=bfgminer 5.2.0|
Attempting softreset of ASIC# 5 DIE# 1
KnC: Frequency change FAILED! { "asic_6_voltage": { "die2": "-0.0366" }, "asic_6_frequency": { "die2": "150" } }
Soft reset failed, initiatng hard reset
I was scrolling thru the log and saw this "Failed multiple soft reset attempts, performing hard reset" -- above it there were 10 soft reset attempts like above -- however there aren't any die configuration failed error messages in bfgminer
.... ok I just confirmed in bfgminer -- 10 successful soft restarts and the 11th goes straight to hard restart :|
... it's not detecting a successful restart because of the low/zero amp issue??