The miner is just over a month old and is showing issues with Chain[0].
The kernel log shows the first error:
temperature.c:697:get_temp_info: read temp sensor failed: chain = 0, sensor = 0, chip = 64, reg = 0
Then it shuts down that chip, and gets stuck in retrying to check temp.
It worked normally a few weeks ago, and even worked with 2/3 chips yesterday with 2/3 hash rate.
I have followed the five steps in the first post on this website but still nothing.
Could anyone help ?
2020-01-15 11:31:28 thread.c:105:pic_heart_beat_thread: chain[0] heart beat fail 5 times.
2020-01-15 11:31:30 power_api.c:86:get_average_voltage: chain[0], voltage is: 0.000000
2020-01-15 11:31:32 power_api.c:86:get_average_voltage: chain[1], voltage is: 17.781328
2020-01-15 11:31:33 temperature.c:697:get_temp_info: read temp sensor failed: chain = 0, sensor = 2, chip = 168, reg = 1
2020-01-15 11:31:35 power_api.c:86:get_average_voltage: chain[2], voltage is: 18.001758
2020-01-15 11:31:35 power_api.c:97:get_average_voltage: aveage voltage is: 11.927695
2020-01-15 11:31:35 power_api.c:110:check_voltage: target_vol = 17.90, actural_vol = 11.93, more than 1.0v diff.
2020-01-15 11:31:36 power_api.c:124:check_voltage_multi: retry time: 6
2020-01-15 11:31:40 temperature.c:697:get_temp_info: read temp sensor failed: chain = 0, sensor = 3, chip = 184, reg = 0
2020-01-15 11:31:48 power_api.c:86:get_average_voltage: chain[0], voltage is: 0.000000
2020-01-15 11:31:50 power_api.c:86:get_average_voltage: chain[1], voltage is: 17.787451
2020-01-15 11:31:53 temperature.c:697:get_temp_info: read temp sensor failed: chain = 0, sensor = 3, chip = 184, reg = 1
2020-01-15 11:31:53 power_api.c:86:get_average_voltage: chain[2], voltage is: 18.014004
2020-01-15 11:31:53 power_api.c:97:get_average_voltage: aveage voltage is: 11.933818
2020-01-15 11:31:53 power_api.c:110:check_voltage: target_vol = 17.90, actural_vol = 11.93, more than 1.0v diff.
2020-01-15 11:31:54 power_api.c:124:check_voltage_multi: retry time: 7
2020-01-15 11:31:54 thread.c:642:check_temperature: over max temp, pcb temp 69 (max 80), chip temp 107(max 103)
2020-01-15 11:31:54 driver-btm-api.c:201:set_miner_status: ERROR_TEMP_TOO_HIGH
2020-01-15 11:31:54 driver-btm-api.c:142:stop_mining: stop mining: over max temp
2020-01-15 11:31:54 thread.c:824:cancel_temperature_monitor_thread: cancel thread
2020-01-15 11:31:54 thread.c:834:cancel_read_nonce_reg_thread: cancel thread
2020-01-15 11:31:54 driver-btm-api.c:128:killall_hashboard: ****power off hashboard****
2020-01-15 11:31:56 thread.c:105:pic_heart_beat_thread: chain[0] heart beat fail 6 times.
2020-01-15 11:32:10 power_api.c:86:get_average_voltage: chain[0], voltage is: 0.000000
2020-01-15 11:32:12 power_api.c:86:get_average_voltage: chain[1], voltage is: 17.775205
2020-01-15 11:32:15 power_api.c:86:get_average_voltage: chain[2], voltage is: 18.007881
2020-01-15 11:32:15 power_api.c:97:get_average_voltage: aveage voltage is: 11.927695
2020-01-15 11:32:15 power_api.c:110:check_voltage: target_vol = 17.90, actural_vol = 11.93, more than 1.0v diff.
2020-01-15 11:32:16 power_api.c:124:check_voltage_multi: retry time: 8
2020-01-15 11:32:29 thread.c:105:pic_heart_beat_thread: chain[0] heart beat fail 7 times.
2020-01-15 11:32:30 power_api.c:86:get_average_voltage: chain[0], voltage is: 0.000000
2020-01-15 11:32:32 power_api.c:86:get_average_voltage: chain[1], voltage is: 17.799697
2020-01-15 11:32:35 power_api.c:86:get_average_voltage: chain[2], voltage is: 18.050742
2020-01-15 11:32:35 power_api.c:97:get_average_voltage: aveage voltage is: 11.950146
2020-01-15 11:32:35 power_api.c:110:check_voltage: target_vol = 17.90, actural_vol = 11.95, more than 1.0v diff.
2020-01-15 11:32:36 power_api.c:124:check_voltage_multi: retry time: 9
2020-01-15 11:32:48 power_api.c:86:get_average_voltage: chain[0], voltage is: 0.000000
2020-01-15 11:32:50 power_api.c:86:get_average_voltage: chain[1], voltage is: 17.799697
2020-01-15 11:32:53 power_api.c:86:get_average_voltage: chain[2], voltage is: 18.050742
2020-01-15 11:32:53 power_api.c:97:get_average_voltage: aveage voltage is: 11.950146
2020-01-15 11:32:53 power_api.c:110:check_voltage: target_vol = 17.90, actural_vol = 11.95, more than 1.0v diff.
2020-01-15 11:32:54 power_api.c:124:check_voltage_multi: retry time: 10
2020-01-15 11:32:56 thread.c:105:pic_heart_beat_thread: chain[0] heart beat fail 8 times.
2020-01-15 11:33:07 power_api.c:86:get_average_voltage: chain[0], voltage is: 0.000000
2020-01-15 11:33:08 power_api.c:86:get_average_voltage: chain[1], voltage is: 17.799697
2020-01-15 11:33:10 power_api.c:86:get_average_voltage: chain[2], voltage is: 18.050742
2020-01-15 11:33:10 power_api.c:97:get_average_voltage: aveage voltage is: 11.950146
2020-01-15 11:33:10 power_api.c:110:check_voltage: target_vol = 17.90, actural_vol = 11.95, more than 1.0v diff.
2020-01-15 11:33:11 power_api.c:124:check_voltage_multi: retry time: 11
UPDATE:
I disassembled the miner and disconnected the power from Chain [2], now the miner works fine, including Chain[0] which had the problem before, does this mean that there is a power problem ?