Author

Topic: Miner status - no ASIC readings (Read 307 times)

legendary
Activity: 3500
Merit: 3249
Happy New year 🤗
December 07, 2018, 03:21:03 PM
#8
I have yet to see an explanation on what HW in the pool section even means :-(

This is what I got below

Quote
HW = A total count of hardware errors while running. There will always be some and this number will keep going up.

Diffa = This is a percentage errors or lost shares based on the total.

So it means that HW is a hardware error but according to the image under the miner hashboard the hardware errors are all zero it's supposed to be the total amount of hardware error per hashboard and HW under the pools is the total amount of hardware error of all hasboard.

Check this image below as a reference.



If the HW(hardware error) under the hashboard is zero, the hashboard detector might be broken
full member
Activity: 538
Merit: 175
December 07, 2018, 01:11:19 PM
#7
One of my friends mentioned that for some reason the LPM firmware was causing some miners to suddenly appear as if both fans are dead. You could always try running on the non-lpm firmware and see if that makes a difference.
newbie
Activity: 3
Merit: 0
December 06, 2018, 05:43:53 PM
#6
BitMaxz, thank you for that guide!

Unfortunately I went through every image in the guide and in the comment section and I have yet to have someone with the same issues as mine. I don't see examples of people with HW under pool that doesn't add up to the HW section at the bottom as well. Is what I'm saying making sense? https://drive.google.com/file/d/12p2UpIrtz8zCF3v2tKlxsYFNJSdKIbku/view?usp=sharing

I have yet to see an explanation on what HW in the pool section even means :-(
legendary
Activity: 3500
Merit: 3249
Happy New year 🤗
December 04, 2018, 06:36:56 PM
#5
Code:
do read temp on Chain[5]
Chain[5] Chip[62] TempTypeID=55 middle offset=32
Chain[5] Chip[62] local Temp=60
read failed on Chain[5] Chip[1] middle Temp old value:0
Special fix Chain[5] Chip[1] middle Temp = 15
Done read temp on Chain[5]
do read temp on Chain[6]
Done read temp on Chain[6]
do read temp on Chain[7]
Done read temp on Chain[7]
set FAN speed according to: temp_highest=0 temp_top1[PWM_T]=0 temp_top1[TEMP_POS_LOCAL]=0 temp_change=-62 fix_fan_steps=0
set full FAN speed...
FAN PWM: 100
read_temp_func Done!
CRC error counter=36578

This is a hardware issue only chain 5 has temp result but a weird result, both chain 6 and 7 no temp result and then after reading temp on fan
it shows lots of error "CRC error counter=36578".

If you have extra s9 try to replace PSU and 2 fans and cables then check it again.

Also, check this guide on how to troubleshoot the miner Possible problems and solutions for S7, S9 (fixed frequency) & L3+
newbie
Activity: 3
Merit: 0
December 03, 2018, 06:57:36 PM
#4
I upgraded the firmware on all the machines when I first set them up. See the details from the overview screen below.

Its showing you have zero fans from that screenshot which will cause all kinds of odd problems. Im not sure how you have an uptime over a few minutes with no fans reporting in

It loses the fan speed response when it locks up. The uptime doesn't keep ticking past the point when it errors out, the ASIC individual chip readings don't come up, and the fans show no speed. Any of these are big red flags, but I have no reason to believe the fan speed just stopped reading and that is causing the rest of it, I think it is another symptom of whatever is going on here. Anything you can think of that would be causing it? In each of these cases, there are tons of HW in the pool section, but not necessarily on the bottom section where each board reports its errors.

Does anyone know of an S9 status page manual? The user guide isn't very thorough. I can't find anything on the HW section under pools.



If you can paste your kernel logs here or in paste.com and put the link here we will try to check if the miner is a software related issue or hardware issue...

Assuming that was the firmware you wanted to see, here is a REALLLLY long log that I hope helps shine some light on the issue. This is so strange!

https://drive.google.com/file/d/1eNrP9_NyB8gPdfHLrDxRWnDiEdd6Qwo8/view?usp=sharing
full member
Activity: 294
Merit: 129
December 03, 2018, 04:41:53 PM
#3
Its showing you have zero fans from that screenshot which will cause all kinds of odd problems. Im not sure how you have an uptime over a few minutes with no fans reporting in
legendary
Activity: 3500
Merit: 3249
Happy New year 🤗
December 03, 2018, 03:09:53 PM
#2
I think you can fix this problem by flashing it to a new firmware. The program might be corrupted that's why it give you buggy result from your dashboard.

If you can paste your kernel logs here or in paste.com and put the link here we will try to check if the miner is a software related issue or hardware issue but first try to flash it with auto_freq firmware.

You can download the firmware from Bitmain here https://service.bitmain.com/support/download
newbie
Activity: 3
Merit: 0
December 03, 2018, 02:01:43 PM
#1
This is an error I am seeing quite often now. You will notice the chip status is not returned. Interestingly enough, the temp section that shows 15 is always the same. No idea why 15 is the magic number either. If I click to refresh the miner status, the uptime value doesn't change. It is basically stuck like this with no hashing until I restart it. Then it will work again for some amount of time before usually having the same error a few days later.

You will note the HW errors from the bottom section don't equal the ones in the pool section.

https://drive.google.com/file/d/15XBD89kNZflGXeJJIzD-UlmeKG1Lk_ms/view?usp=sharing
Jump to: