Hello guys, I want to post my problem here to see if anybody can help me.
I bought 2 Antminer S9 on February. The machines were working perfectly until last weekend I unplugged them to move them and install them in another location. When I connected the two machines, the machine 1 works good but the machine 2 wasn't working, I was getting an error with the FAN num 2.
I started to troubleshooting the machine 2 and I started by switching the FAN 1 and 2 and then the problem moves to FAN num 1. So I thought it was the FAN. I bought a new FAN, installed the new FAN and I'm still getting the same error. I even switched the FAN from the working machine 1 to the not working machine 2, and I'm having the same error with the FAN in the machine 2, and the FAN in the machine 1 (originally from machine 2) are working normally and the machine is working at this moment.
I don't know if there is anything else I can try to fix the problem, or I just need to send it to support. I saw several topics related with this problem, and I read a lot of posts but with any result. I don't know if anybody can help me with this problem.
I'm attaching the LOG of the kernel from the S9 with the problem.
[ 0.000000] Booting Linux on physical CPU 0x0
[ 0.000000] Linux version 3.14.0-xilinx-ge8a2f71-dirty (lzq@armdev2) (gcc version 4.8.3 20140320 (prerelease) (Sourcery CodeBench Lite 2014.05-23) ) #82 SMP PREEMPT Tue May 16 19:49:53 CST 2017
[ 0.000000] CPU: ARMv7 Processor [413fc090] revision 0 (ARMv7), cr=18c5387d
[ 0.000000] CPU: PIPT / VIPT nonaliasing data cache, VIPT aliasing instruction cache
[ 0.000000] Machine model: Xilinx Zynq
[ 0.000000] cma: CMA: reserved 128 MiB at 16800000
[ 0.000000] Memory policy: Data cache writealloc
[ 0.000000] On node 0 totalpages: 126976
[ 0.000000] free_area_init_node: node 0, pgdat c0740a40, node_mem_map debd8000
[ 0.000000] Normal zone: 992 pages used for memmap
[ 0.000000] Normal zone: 0 pages reserved
[ 0.000000] Normal zone: 126976 pages, LIFO batch:31
[ 0.000000] PERCPU: Embedded 8 pages/cpu @debc1000 s9088 r8192 d15488 u32768
[ 0.000000] pcpu-alloc: s9088 r8192 d15488 u32768 alloc=8*4096
[ 0.000000] pcpu-alloc: [0] 0 [0] 1
[ 0.000000] Built 1 zonelists in Zone order, mobility grouping on. Total pages: 125984
[ 0.000000] Kernel command line: noinitrd mem=496M console=ttyPS0,115200 root=ubi0:rootfs ubi.mtd=1 rootfstype=ubifs rw rootwait
[ 0.000000] PID hash table entries: 2048 (order: 1, 8192 bytes)
[ 0.000000] Dentry cache hash table entries: 65536 (order: 6, 262144 bytes)
[ 0.000000] Inode-cache hash table entries: 32768 (order: 5, 131072 bytes)
[ 0.000000] Memory: 364356K/507904K available (5032K kernel code, 283K rwdata, 1916K rodata, 204K init, 258K bss, 143548K reserved, 0K highmem)
[ 0.000000] Virtual kernel memory layout:
[ 0.000000] vector : 0xffff0000 - 0xffff1000 ( 4 kB)
[ 0.000000] fixmap : 0xfff00000 - 0xfffe0000 ( 896 kB)
[ 0.000000] vmalloc : 0xdf800000 - 0xff000000 ( 504 MB)
[ 0.000000] lowmem : 0xc0000000 - 0xdf000000 ( 496 MB)
[ 0.000000] pkmap : 0xbfe00000 - 0xc0000000 ( 2 MB)
[ 0.000000] modules : 0xbf000000 - 0xbfe00000 ( 14 MB)
[ 0.000000] .text : 0xc0008000 - 0xc06d1374 (6949 kB)
[ 0.000000] .init : 0xc06d2000 - 0xc0705380 ( 205 kB)
[ 0.000000] .data : 0xc0706000 - 0xc074cf78 ( 284 kB)
[ 0.000000] .bss : 0xc074cf84 - 0xc078d9fc ( 259 kB)
[ 0.000000] Preemptible hierarchical RCU implementation.
[ 0.000000] Dump stacks of tasks blocking RCU-preempt GP.
[ 0.000000] RCU restricting CPUs from NR_CPUS=4 to nr_cpu_ids=2.
[ 0.000000] RCU: Adjusting geometry for rcu_fanout_leaf=16, nr_cpu_ids=2
[ 0.000000] NR_IRQS:16 nr_irqs:16 16
[ 0.000000] ps7-slcr mapped to df802000
[ 0.000000] zynq_clock_init: clkc starts at df802100
[ 0.000000] Zynq clock init
[ 0.000015] sched_clock: 64 bits at 333MHz, resolution 3ns, wraps every 3298534883328ns
[ 0.000310] ps7-ttc #0 at df804000, irq=43
[ 0.000617] Console: colour dummy device 80x30
[ 0.000649] Calibrating delay loop... 1325.46 BogoMIPS (lpj=6627328)
[ 0.040207] pid_max: default: 32768 minimum: 301
[ 0.040428] Mount-cache hash table entries: 1024 (order: 0, 4096 bytes)
[ 0.040448] Mountpoint-cache hash table entries: 1024 (order: 0, 4096 bytes)
[ 0.042576] CPU: Testing write buffer coherency: ok
[ 0.042911] CPU0: thread -1, cpu 0, socket 0, mpidr 80000000
[ 0.042971] Setting up static identity map for 0x4c4b00 - 0x4c4b58
[ 0.043196] L310 cache controller enabled
[ 0.043215] l2x0: 8 ways, CACHE_ID 0x410000c8, AUX_CTRL 0x72760000, Cache size: 512 kB
[ 0.121035] CPU1: Booted secondary processor
[ 0.210228] CPU1: thread -1, cpu 1, socket 0, mpidr 80000001
[ 0.210358] Brought up 2 CPUs
[ 0.210378] SMP: Total of 2 processors activated.
[ 0.210386] CPU: All CPU(s) started in SVC mode.
[ 0.211047] devtmpfs: initialized
[ 0.213464] VFP support v0.3: implementor 41 architecture 3 part 30 variant 9 rev 4
[ 0.214674] regulator-dummy: no parameters
[ 0.221941] NET: Registered protocol family 16
[ 0.224130] DMA: preallocated 256 KiB pool for atomic coherent allocations
[ 0.226431] cpuidle: using governor ladder
[ 0.226444] cpuidle: using governor menu
[ 0.233862] syscon f8000000.ps7-slcr: regmap [mem 0xf8000000-0xf8000fff] registered
[ 0.235390] hw-breakpoint: found 5 (+1 reserved) breakpoint and 1 watchpoint registers.
[ 0.235404] hw-breakpoint: maximum watchpoint size is 4 bytes.
[ 0.235517] zynq-ocm f800c000.ps7-ocmc: ZYNQ OCM pool: 256 KiB @ 0xdf880000
[ 0.257200] bio: create slab at 0
[ 0.258630] vgaarb: loaded
[ 0.259324] SCSI subsystem initialized
[ 0.260973] usbcore: registered new interface driver usbfs
[ 0.261339] usbcore: registered new interface driver hub
[ 0.261572] usbcore: registered new device driver usb
[ 0.262097] media: Linux media interface: v0.10
[ 0.262252] Linux video capture interface: v2.00
[ 0.262497] pps_core: LinuxPPS API ver. 1 registered
[ 0.262509] pps_core: Software ver. 5.3.6 - Copyright 2005-2007 Rodolfo Giometti
[ 0.262632] PTP clock support registered
[ 0.262991] EDAC MC: Ver: 3.0.0
[ 0.264022] Advanced Linux Sound Architecture Driver Initialized.
[ 0.266868] DMA-API: preallocated 4096 debug entries
[ 0.266882] DMA-API: debugging enabled by kernel config
[ 0.266954] Switched to clocksource arm_global_timer
[ 0.287227] NET: Registered protocol family 2
[ 0.288174] TCP established hash table entries: 4096 (order: 2, 16384 bytes)
[ 0.288231] TCP bind hash table entries: 4096 (order: 3, 32768 bytes)
[ 0.288316] TCP: Hash tables configured (established 4096 bind 4096)
[ 0.288363] TCP: reno registered
[ 0.288381] UDP hash table entries: 256 (order: 1, 8192 bytes)
[ 0.288412] UDP-Lite hash table entries: 256 (order: 1, 8192 bytes)
[ 0.288650] NET: Registered protocol family 1
[ 0.289001] RPC: Registered named UNIX socket transport module.
[ 0.289013] RPC: Registered udp transport module.
[ 0.289022] RPC: Registered tcp transport module.
[ 0.289030] RPC: Registered tcp NFSv4.1 backchannel transport module.
[ 0.289043] PCI: CLS 0 bytes, default 64
[ 0.289485] hw perfevents: enabled with ARMv7 Cortex-A9 PMU driver, 7 counters available
[ 0.291504] futex hash table entries: 512 (order: 3, 32768 bytes)
[ 0.293599] jffs2: version 2.2. (NAND) © 2001-2006 Red Hat, Inc.
[ 0.293790] msgmni has been set to 967
[ 0.294566] io scheduler noop registered
[ 0.294578] io scheduler deadline registered
[ 0.294615] io scheduler cfq registered (default)
[ 0.307382] dma-pl330 f8003000.ps7-dma: Loaded driver for PL330 DMAC-2364208
[ 0.307402] dma-pl330 f8003000.ps7-dma: DBUFF-128x8bytes Num_Chans-8 Num_Peri-4 Num_Events-16
[ 0.432360] e0001000.serial: ttyPS0 at MMIO 0xe0001000 (irq = 82, base_baud = 3124999) is a xuartps
[ 1.000285] console [ttyPS0] enabled
[ 1.004547] xdevcfg f8007000.ps7-dev-cfg: ioremap 0xf8007000 to df866000
[ 1.012193] [drm] Initialized drm 1.1.0 20060810
[ 1.029126] brd: module loaded
[ 1.038497] loop: module loaded
[ 1.048348] e1000e: Intel(R) PRO/1000 Network Driver - 2.3.2-k
[ 1.054101] e1000e: Copyright(c) 1999 - 2013 Intel Corporation.
[ 1.062065] libphy: XEMACPS mii bus: probed
[ 1.066436] ------------- phy_id = 0x3625e62
[ 1.071219] xemacps e000b000.ps7-ethernet: pdev->id -1, baseaddr 0xe000b000, irq 54
[ 1.079833] ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
[ 1.086463] ehci-pci: EHCI PCI platform driver
[ 1.093746] zynq-dr e0002000.ps7-usb: Unable to init USB phy, missing?
[ 1.100590] usbcore: registered new interface driver usb-storage
[ 1.107472] mousedev: PS/2 mouse device common for all mice
[ 1.113561] i2c /dev entries driver
[ 1.120483] zynq-edac f8006000.ps7-ddrc: ecc not enabled
[ 1.125963] cpufreq_cpu0: failed to get cpu0 regulator: -19
[ 1.131888] Xilinx Zynq CpuIdle Driver started
[ 1.136740] sdhci: Secure Digital Host Controller Interface driver
[ 1.142947] sdhci: Copyright(c) Pierre Ossman
[ 1.147244] sdhci-pltfm: SDHCI platform and OF driver helper
[ 1.154021] mmc0: no vqmmc regulator found
[ 1.158083] mmc0: no vmmc regulator found
[ 1.196972] mmc0: SDHCI controller on e0100000.ps7-sdio [e0100000.ps7-sdio] using ADMA
[ 1.205668] usbcore: registered new interface driver usbhid
[ 1.211186] usbhid: USB HID core driver
[ 1.215890] nand: device found, Manufacturer ID: 0x2c, Chip ID: 0xda
[ 1.222180] nand: Micron MT29F2G08ABAEAWP
[ 1.226150] nand: 256MiB, SLC, page size: 2048, OOB size: 64
[ 1.232101] Bad block table found at page 131008, version 0x01
[ 1.238349] Bad block table found at page 130944, version 0x01
[ 1.244404] 3 ofpart partitions found on MTD device pl353-nand
[ 1.250175] Creating 3 MTD partitions on "pl353-nand":
[ 1.255275] 0x000000000000-0x000002000000 : "BOOT.bin-env-dts-kernel"
[ 1.263360] 0x000002000000-0x00000b000000 : "angstram-rootfs"
[ 1.270687] 0x00000b000000-0x000010000000 : "upgrade-rootfs"
[ 1.279641] TCP: cubic registered
[ 1.282875] NET: Registered protocol family 17
[ 1.287593] Registering SWP/SWPB emulation handler
[ 1.293466] regulator-dummy: disabling
[ 1.297867] UBI: attaching mtd1 to ubi0
[ 1.824513] UBI: scanning is finished
[ 1.836147] UBI: attached mtd1 (name "angstram-rootfs", size 144 MiB) to ubi0
[ 1.843230] UBI: PEB size: 131072 bytes (128 KiB), LEB size: 126976 bytes
[ 1.849992] UBI: min./max. I/O unit sizes: 2048/2048, sub-page size 2048
[ 1.856656] UBI: VID header offset: 2048 (aligned 2048), data offset: 4096
[ 1.863533] UBI: good PEBs: 1152, bad PEBs: 0, corrupted PEBs: 0
[ 1.869515] UBI: user volume: 1, internal volumes: 1, max. volumes count: 128
[ 1.876622] UBI: max/mean erase counter: 2/0, WL threshold: 4096, image sequence number: 1283732989
[ 1.885665] UBI: available PEBs: 0, total reserved PEBs: 1152, PEBs reserved for bad PEB handling: 40
[ 1.894880] UBI: background thread "ubi_bgt0d" started, PID 1080
[ 1.894885] drivers/rtc/hctosys.c: unable to open rtc device (rtc0)
[ 1.898842] ALSA device list:
[ 1.898846] No soundcards found.
[ 1.915258] UBIFS: background thread "ubifs_bgt0_0" started, PID 1082
[ 1.944427] UBIFS: recovery needed
[ 2.028011] UBIFS: recovery completed
[ 2.031681] UBIFS: mounted UBI device 0, volume 0, name "rootfs"
[ 2.037628] UBIFS: LEB size: 126976 bytes (124 KiB), min./max. I/O unit sizes: 2048 bytes/2048 bytes
[ 2.046724] UBIFS: FS size: 128626688 bytes (122 MiB, 1013 LEBs), journal size 9023488 bytes (8 MiB, 72 LEBs)
[ 2.056626] UBIFS: reserved for root: 0 bytes (0 KiB)
[ 2.061666] UBIFS: media format: w4/r0 (latest is w4/r0), UUID C72F8006-6DFF-46BA-BBE9-380960A89F92, small LPT model
[ 2.072943] VFS: Mounted root (ubifs filesystem) on device 0:11.
[ 2.080367] devtmpfs: mounted
[ 2.083474] Freeing unused kernel memory: 204K (c06d2000 - c0705000)
[ 2.922498] random: dd urandom read with 0 bits of entropy available
[ 3.316977]
[ 3.316977] bcm54xx_config_init
[ 3.927000]
[ 3.927000] bcm54xx_config_init
[ 6.927830] xemacps e000b000.ps7-ethernet: Set clk to 24999999 Hz
[ 6.933921] xemacps e000b000.ps7-ethernet: link up (100/FULL)
[ 22.625708] In axi fpga driver!
[ 22.628810] request_mem_region OK!
[ 22.632164] AXI fpga dev virtual address is 0xdf9fc000
[ 22.637306] *base_vir_addr = 0x8c510
[ 22.652713] In fpga mem driver!
[ 22.655785] request_mem_region OK!
[ 22.659380] fpga mem virtual address is 0xe2000000
[ 23.446439]
[ 23.446439] bcm54xx_config_init
[ 24.076379]
[ 24.076379] bcm54xx_config_init
[ 27.077062] xemacps e000b000.ps7-ethernet: Set clk to 24999999 Hz
[ 27.083080] xemacps e000b000.ps7-ethernet: link up (100/FULL)
This is XILINX board. Totalram: 507527168
Detect 512MB control board of XILINX
Find Force Freq=600 will search Freq again!
DETECT HW version=0008c510
miner ID : 81234440c6650881c
config file found, will disable freq setting.
Miner Type = S9
AsicType = 1387
real AsicNum = 63
Find /tmp/searcherror: F:1 The last log is below:
This is XILINX board. Totalram: 507527168
Detect 512MB control board of XILINX
Find Force Freq=600 will search Freq again!
DETECT HW version=0008c510
miner ID : 81234440c6650881c
config file found, will disable freq setting.
Miner Type = S9
AsicType = 1387
real AsicNum = 63
use critical mode to search freq...
get PLUG ON=0x000000e0
Find hashboard on Chain[5]
Find hashboard on Chain[6]
Find hashboard on Chain[7]
set_reset_allhashboard = 0x0000ffff
Check chain[5] PIC fw version=0x03
Check chain[6] PIC fw version=0x03
Check chain[7] PIC fw version=0x03
chain[5]: [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255]
Start search freq on chain[5]...
Check chain[5] PIC fw version=0x03
chain[6]: [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255]
Start search freq on chain[6]...
Check chain[6] PIC fw version=0x03
chain[7]: [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255]
Start search freq on chain[7]...
Check chain[7] PIC fw version=0x03
get PIC voltage=91 on chain[5], value=890
get PIC voltage=91 on chain[6], value=890
get PIC voltage=91 on chain[7], value=890
set voltage=870 search freq on chain[5]
now set pic voltage=125 on chain[5]
set voltage=870 search freq on chain[6]
now set pic voltage=125 on chain[6]
set voltage=870 search freq on chain[7]
now set pic voltage=125 on chain[7]
enable_pic_dac on chain[5]
enable_pic_dac on chain[6]
enable_pic_dac on chain[7]
set_reset_allhashboard = 0x00000000
chain[5] temp offset record: 62,0,0,0,0,0,35,28
chain[5] temp chip I2C addr=0x98
chain[5] has no middle temp, use special fix mode.
chain[6] temp offset record: 62,0,0,0,0,0,35,28
chain[6] temp chip I2C addr=0x98
chain[6] has no middle temp, use special fix mode.
chain[7] temp offset record: 62,0,0,0,0,0,35,28
chain[7] temp chip I2C addr=0x98
chain[7] has no middle temp, use special fix mode.
set_reset_allhashboard = 0x0000ffff
set_reset_allhashboard = 0x00000000
CRC error counter=0
set command mode to VIL
--- check asic number
check chain[5]: asicNum = 63
check chain[6]: asicNum = 63
check chain[7]: asicNum = 63
After Get ASIC NUM CRC error counter=0
set_baud=0
SEARCH_FREQ_TEST_LEVEL mode set freq=650 voltage=870 on chain[5]
SEARCH_FREQ_TEST_LEVEL mode set freq=650 voltage=870 on chain[6]
SEARCH_FREQ_TEST_LEVEL mode set freq=650 voltage=870 on chain[7]
The min freq=650
set real timeout 56, need sleep=408576
start send works on chain[5]
start send works on chain[6]
start send works on chain[7]
get send work num :57456 on Chain[5]
get send work num :57456 on Chain[6]
get send work num :57456 on Chain[7]
wait recv nonce on chain[5]
wait recv nonce on chain[6]
wait recv nonce on chain[7]
get nonces on chain[5]
require nonce number:912
require validnonce number:57456
freq[00]=650 freq[01]=650 freq[02]=650 freq[03]=650 freq[04]=650 freq[05]=650 freq[06]=650 freq[07]=650
freq[08]=650 freq[09]=650 freq[10]=650 freq[11]=650 freq[12]=650 freq[13]=650 freq[14]=650 freq[15]=650
freq[16]=650 freq[17]=650 freq[18]=650 freq[19]=650 freq[20]=650 freq[21]=650 freq[22]=650 freq[23]=650
freq[24]=650 freq[25]=650 freq[26]=650 freq[27]=650 freq[28]=650 freq[29]=650 freq[30]=650 freq[31]=650
freq[32]=650 freq[33]=650 freq[34]=650 freq[35]=650 freq[36]=650 freq[37]=650 freq[38]=650 freq[39]=650
freq[40]=650 freq[41]=650 freq[42]=650 freq[43]=650 freq[44]=650 freq[45]=650 freq[46]=650 freq[47]=650
freq[48]=650 freq[49]=650 freq[50]=650 freq[51]=650 freq[52]=650 freq[53]=650 freq[54]=650 freq[55]=650
freq[56]=650 freq[57]=650 freq[58]=650 freq[59]=650 freq[60]=650 freq[61]=650 freq[62]=650
total valid nonce number:56665
total send work number:57456
require valid nonce number:57456
repeated_nonce_num:0
err_nonce_num:13803
last_nonce_num:2890
get nonces on chain[6]
require nonce number:912
require validnonce number:57456
freq[00]=650 freq[01]=650 freq[02]=650 freq[03]=650 freq[04]=650 freq[05]=650 freq[06]=650 freq[07]=650
freq[08]=650 freq[09]=650 freq[10]=650 freq[11]=650 freq[12]=650 freq[13]=650 freq[14]=650 freq[15]=650
freq[16]=650 freq[17]=650 freq[18]=650 freq[19]=650 freq[20]=650 freq[21]=650 freq[22]=650 freq[23]=650
freq[24]=650 freq[25]=650 freq[26]=650 freq[27]=650 freq[28]=650 freq[29]=650 freq[30]=650 freq[31]=650
freq[32]=650 freq[33]=650 freq[34]=650 freq[35]=650 freq[36]=650 freq[37]=650 freq[38]=650 freq[39]=650
freq[40]=650 freq[41]=650 freq[42]=650 freq[43]=650 freq[44]=650 freq[45]=650 freq[46]=650 freq[47]=650
freq[48]=650 freq[49]=650 freq[50]=650 freq[51]=650 freq[52]=650 freq[53]=650 freq[54]=650 freq[55]=650
freq[56]=650 freq[57]=650 freq[58]=650 freq[59]=650 freq[60]=650 freq[61]=650 freq[62]=650
total valid nonce number:57216
total send work number:57456
require valid nonce number:57456
repeated_nonce_num:0
err_nonce_num:13966
last_nonce_num:2937
get nonces on chain[7]
require nonce number:912
require validnonce number:57456
freq[00]=650 freq[01]=650 freq[02]=650 freq[03]=650 freq[04]=650 freq[05]=650 freq[06]=650 freq[07]=650
freq[08]=650 freq[09]=650 freq[10]=650 freq[11]=650 freq[12]=650 freq[13]=650 freq[14]=650 freq[15]=650
freq[16]=650 freq[17]=650 freq[18]=650 freq[19]=650 freq[20]=650 freq[21]=650 freq[22]=650 freq[23]=650
freq[24]=650 freq[25]=650 freq[26]=650 freq[27]=650 freq[28]=650 freq[29]=650 freq[30]=650 freq[31]=650
freq[32]=650 freq[33]=650 freq[34]=650 freq[35]=650 freq[36]=650 freq[37]=650 freq[38]=650 freq[39]=650
freq[40]=650 freq[41]=650 freq[42]=650 freq[43]=650 freq[44]=650 freq[45]=650 freq[46]=650 freq[47]=650
freq[48]=650 freq[49]=650 freq[50]=650 freq[51]=650 freq[52]=650 freq[53]=650 freq[54]=650 freq[55]=650
freq[56]=650 freq[57]=650 freq[58]=650 freq[59]=650 freq[60]=650 freq[61]=650 freq[62]=650
total valid nonce number:57242
total send work number:57456
require valid nonce number:57456
repeated_nonce_num:0
err_nonce_num:13944
last_nonce_num:2859
checkBoardState chain[5] rate=4471 bad_chip_num=2 lowest rate=4666 ret=0
Detect Higher Rate = 4471 > 0 on chain[5] freq=650 voltage=870
checkBoardState chain[6] rate=4623 bad_chip_num=0 lowest rate=4666 ret=0
Detect Higher Rate = 4623 > 0 on chain[6] freq=650 voltage=870
checkBoardState chain[7] rate=4626 bad_chip_num=0 lowest rate=4666 ret=0
Detect Higher Rate = 4626 > 0 on chain[7] freq=650 voltage=870
After TEST CRC error counter=0
check FAN Speed: fan[2] speed=5880
check FAN Speed: fan[2] speed=5880
check FAN Speed: fan[2] speed=5880
check FAN ERROR: fan num=1 , ought to be 2
Thanks.