Author

Topic: Fault on Antminer S17e (Read 205 times)

member
Activity: 166
Merit: 82
EET/NASA intern 2013 Bitmain/MicroBT/IPC cert
July 26, 2022, 01:44:46 PM
#6
To move forward we need to know if the power supply is outputting the required 20V. The control board puts out an additional  3.3V through the data cable to power the pic which collects the temperature data via I2C. And almost finally, the control board is powered from the power supply by the 6 wire 12V line. Plenty of failure points for this error. Let’s start with the first. Do you have a voltage meter?
jr. member
Activity: 56
Merit: 12
July 24, 2022, 10:23:33 PM
#5
I was wondering if you tried replacing the control board? The temperature sensing circuit of the Antminer 17 series hash board is in the form of I2C serial bus, which is powered by the control board. Therefore, if the control board fails, it will not be able to supply power to the temperature sensor, and the temperature sensor parameters will not be collected.
legendary
Activity: 4102
Merit: 7765
'The right to privacy matters'
June 14, 2022, 09:22:53 AM
#4
pull two boards and try with one board.

see if that works.

if it does thank the good board out place it in a safe place

go to board two. run it and see if it works.

if it works take it out and place in a safe place.

go to board three run it and see if it works.

if it works the issue is either over heating or weak psu.


so try two boards see if it works.

so if two boards work settle for that and sell the third board.


my guess is nothing will work and the psu is bad.

also look long and hard at fans on the psu maybe one is failing or failed.
newbie
Activity: 4
Merit: 9
June 14, 2022, 09:03:11 AM
#3
This may be a fault caused by the PSU. It is recommended to try another PSU.
legendary
Activity: 3234
Merit: 2943
Block halving is coming.
May 14, 2022, 08:03:22 PM
#2
The fault might be overheating that is why the sensor is lost based on the kernel logs you posted above.

What I would you to do is try to flash it first with the latest firmware available from Bitmain here https://shop.bitmain.com/support/download

Then post the result here if the issue is still there I suggest you to try test all hashboard one by one you can follow the guide from Bitmain here "Test hash board one by one Guide"
newbie
Activity: 1
Merit: 0
May 14, 2022, 08:55:43 AM
#1
Hi guys,

I've been advised to look for help here on issues with my S17e miner. As I power it up, it works for a period of time, then stops hashing. It sometimes starts hashing again for a short period of time sometimes it doesn't. If I switch the power off and turn it back on it starts hashing again. I have copied parts of the Kernel log bellow, maybe you you will be able to advise me with potential causes of this issue




chain 2 domain 12:   d0 0.311,   d1 0.309,   d2 0.310,   d3 0.308,   sum = 1.238241
chain 2 domain 13:   d0 0.312,   d1 0.307,   d2 0.309,   d3 0.310,   sum = 1.237549
chain 2 domain 14:   d0 0.314,   d1 0.312,   d2 0.313,   d3 0.314,   sum = 1.254110
2022-05-12 09:05:08:driver-hash-chip.c:493:check_adc_voltage: PASS domainn volt check: request 0.800 (index 0, open_core)
2022-05-12 09:05:08:frequency.c:442:inc_asic_diff_freq_by_steps: chain = 0, start = 365, freq_step = 5
2022-05-12 09:05:19:frequency.c:442:inc_asic_diff_freq_by_steps: chain = 1, start = 395, freq_step = 5
2022-05-12 09:05:26:frequency.c:442:inc_asic_diff_freq_by_steps: chain = 2, start = 365, freq_step = 5
2022-05-12 09:05:37:driver-btm-api.c:642:set_timeout: freq = 425, percent = 90, hcn = 235928, timeout = 554
2022-05-12 09:05:37:power_api.c:353:set_to_working_voltage_by_steps: Set to voltage raw 1860, step by step.
2022-05-12 09:05:53:thread.c:1515:create_check_system_status_thread: create thread
2022-05-12 09:05:53:driver-btm-api.c:2571:bitmain_soc_init: Init done!
2022-05-12 09:05:59:driver-btm-api.c:247:set_miner_status: STATUS_OKAY
2022-05-12 09:06:00:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 62323
2022-05-12 09:06:00:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 60000
2022-05-12 09:06:02:driver-btm-api.c:1610:dhash_chip_send_job: Version num 8
2022-05-12 09:06:02:driver-btm-api.c:1754:dhash_chip_send_job: stime.tv_sec 1652346362, block_ntime 1652346251
2022-05-12 09:36:37:thread.c:233:calc_hashrate_avg: avg rate is 62533.67 in 30 mins
2022-05-12 09:36:37:temperature.c:440:temp_statistics_show:   pcb temp 42~67  chip temp 58~71
2022-05-12 10:07:14:thread.c:233:calc_hashrate_avg: avg rate is 62541.08 in 30 mins
2022-05-12 10:07:14:temperature.c:440:temp_statistics_show:   pcb temp 43~67  chip temp 59~72
2022-05-12 10:37:46:thread.c:233:calc_hashrate_avg: avg rate is 62595.34 in 30 mins
2022-05-12 10:37:46:temperature.c:440:temp_statistics_show:   pcb temp 46~71  chip temp 61~75
2022-05-12 11:08:16:thread.c:233:calc_hashrate_avg: avg rate is 62737.02 in 30 mins
2022-05-12 11:08:16:temperature.c:440:temp_statistics_show:   pcb temp 46~70  chip temp 60~75
2022-05-12 11:38:49:thread.c:233:calc_hashrate_avg: avg rate is 63035.07 in 30 mins
2022-05-12 11:38:49:temperature.c:440:temp_statistics_show:   pcb temp 46~72  chip temp 62~76
2022-05-12 11:48:12:thread.c:1136:asic_status_monitor_thread: ERROR: chain 0 get hashrate_reg_counter 0, require 135, failed times 1
2022-05-12 11:48:12:thread.c:1136:asic_status_monitor_thread: ERROR: chain 1 get hashrate_reg_counter 0, require 135, failed times 1
2022-05-12 11:48:12:thread.c:1136:asic_status_monitor_thread: ERROR: chain 2 get hashrate_reg_counter 0, require 135, failed times 1
2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 0, chip = 27, reg = 0
2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 0, chip = 27, reg = 1
2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 1, chip = 35, reg = 0
2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 1, chip = 35, reg = 1
2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 2, chip = 99, reg = 0
2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 2, chip = 99, reg = 1
2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 3, chip = 107, reg = 0
2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 3, chip = 107, reg = 1
2022-05-12 11:48:13:temperature.c:754:get_temp_info: ERROR: chain 0 can get NONE temp info or temp value abnormal, power it off
2022-05-12 11:48:13:thread.c:1172:asic_status_monitor_thread: chain 0 can't get enough hashrate reg val for 0 times.
2022-05-12 11:48:13:thread.c:1172:asic_status_monitor_thread: chain 1 can't get enough hashrate reg val for 0 times.
2022-05-12 11:48:13:thread.c:1172:asic_status_monitor_thread: chain 2 can't get enough hashrate reg val for 0 times.
2022-05-12 11:48:13:driver-hash-chip.c:591:recalc_invalid_volt: chain 0 domain 00 column 00 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:13:driver-hash-chip.c:591:recalc_invalid_volt: chain 0 domain 00 column 01 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.





2022-05-12 11:48:16:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 14 column 01 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:16:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 14 column 02 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:16:thread.c:1172:asic_status_monitor_thread: chain 0 can't get enough hashrate reg val for 1 times.
2022-05-12 11:48:16:thread.c:1172:asic_status_monitor_thread: chain 1 can't get enough hashrate reg val for 1 times.
2022-05-12 11:48:16:thread.c:1172:asic_status_monitor_thread: chain 2 can't get enough hashrate reg val for 1 times.
2022-05-12 11:48:16:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 14 column 03 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:16:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 14 column 04 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.






2022-05-12 11:48:16:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 12 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:16:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 13 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:16:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 14 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:17:thread.c:1136:asic_status_monitor_thread: ERROR: chain 0 get hashrate_reg_counter 0, require 135, failed times 1
2022-05-12 11:48:17:thread.c:1136:asic_status_monitor_thread: ERROR: chain 1 get hashrate_reg_counter 0, require 135, failed times 1
2022-05-12 11:48:17:thread.c:1136:asic_status_monitor_thread: ERROR: chain 2 get hashrate_reg_counter 0, require 135, failed times 1
2022-05-12 11:48:17:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 41848
2022-05-12 11:48:17:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 40000
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 0, chip = 27, reg = 0
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 0, chip = 27, reg = 1
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 1, chip = 35, reg = 0
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 1, chip = 35, reg = 1
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 2, chip = 99, reg = 0
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 2, chip = 99, reg = 1
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 3, chip = 107, reg = 0
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 3, chip = 107, reg = 1
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 0, chip = 27, reg = 0
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 0, chip = 27, reg = 1
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 1, chip = 35, reg = 0
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 1, chip = 35, reg = 1
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 2, chip = 99, reg = 0
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 2, chip = 99, reg = 1
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 3, chip = 107, reg = 0
2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 3, chip = 107, reg = 1
2022-05-12 11:48:18:temperature.c:754:get_temp_info: ERROR: chain 1 can get NONE temp info or temp value abnormal, power it off
2022-05-12 11:48:18:thread.c:1172:asic_status_monitor_thread: chain 1 can't get enough hashrate reg val for 2 times.
2022-05-12 11:48:18:thread.c:1172:asic_status_monitor_thread: chain 2 can't get enough hashrate reg val for 2 times.
2022-05-12 11:48:19:driver-hash-chip.c:591:recalc_invalid_volt: chain 1 domain 00 column 00 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:19:driver-hash-chip.c:591:recalc_invalid_volt: chain 1 domain 00 column 01 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.






chain 1 domain 12: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 1 domain 13: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 1 domain 14: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
2022-05-12 11:48:19:thread.c:1136:asic_status_monitor_thread: ERROR: chain 1 get hashrate_reg_counter 0, require 135, failed times 1
2022-05-12 11:48:19:thread.c:1136:asic_status_monitor_thread: ERROR: chain 2 get hashrate_reg_counter 0, require 135, failed times 1
2022-05-12 11:48:19:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 00 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:19:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 01 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:19:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 02 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.







2022-05-12 11:48:20:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 12 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:20:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 13 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:20:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 14 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:21:temperature.c:754:get_temp_info: ERROR: chain 2 can get NONE temp info or temp value abnormal, power it off
2022-05-12 11:48:21:thread.c:1172:asic_status_monitor_thread: chain 2 can't get enough hashrate reg val for 3 times.
2022-05-12 11:48:21:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 00 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:21:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 01 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:21:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 20475
2022-05-12 11:48:21:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 20000
2022-05-12 11:48:21:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 02 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:21:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 03 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.
2022-05-12 11:48:21:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 04 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00.







2022-05-12 11:48:21:driver-hash-chip.c:707:dump_adc_voltage_v2: chain 2 domain 14 core_domain 1: all asic timeout or overrange.
2022-05-12 11:48:21:driver-hash-chip.c:707:dump_adc_voltage_v2: chain 2 domain 14 core_domain 2: all asic timeout or overrange.
2022-05-12 11:48:21:driver-hash-chip.c:707:dump_adc_voltage_v2: chain 2 domain 14 core_domain 3: all asic timeout or overrange.
2022-05-12 11:48:21:driver-hash-chip.c:737:dump_adc_voltage_v2: get ADC_DATAOUT from chain 2 with 540 regs timeout.
chain 2 domain  0: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain  1: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain  2: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain  3: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain  4: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain  5: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain  6: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain  7: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain  8: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain  9: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain 10: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain 11: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain 12: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain 13: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
chain 2 domain 14: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 00 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 01 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 02 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 03 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 04 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 05 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 06 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 07 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 08 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 09 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 10 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 11 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 12 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 13 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 14 volt 0.000 less then request 0.800 (index 0)
2022-05-12 11:48:23:driver-btm-api.c:247:set_miner_status: ERROR_TEMP_LOST
2022-05-12 11:48:23:driver-btm-api.c:176:stop_mining: stop mining: no chain exists, maybe caused by sensor lost

2022-05-12 11:48:23:thread.c:1588:cancel_check_miner_status_thread: cancel thread
2022-05-12 11:48:23:thread.c:1583:cancel_check_system_status_thread: cancel thread
2022-05-12 11:48:23:thread.c:1572:cancel_read_nonce_reg_thread: cancel thread
2022-05-12 11:48:23:thread.c:1593:cancel_asic_status_monitor_thread: cancel thread
2022-05-12 11:48:23:driver-btm-api.c:147:killall_hashboard: ****power off hashboard****
2022-05-12 11:48:23:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 0
2022-05-12 11:48:23:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 0
2022-05-12 11:48:24:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 0
2022-05-12 11:48:24:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 0
2022-05-12 11:48:25:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 0
2022-05-12 11:48:25:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 0
2022-05-12 11:48:26:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 0
Jump to: