Hello Guys, I'd like to ask the help of the community again.
My story in short version is the following:
I've got a small mining operation, and couple of days ago I needed to make space for a new miner,and I thought if I'm here,I'm going to replace my antminers to achieve a better airflow management. Unplugged them, placed them in their new place, plugged them back, and i
DID NOT change anything. Same router, internet connection, psu, cables, everything and they have started to misbehave. By misbehaving I mean they've started to stop mining completely, rebooting them remotely did nothing to them,
AND the normal way of restarting them physically failed as well, by plugging them out and in they were not able to start mining either. Right now,the only way to restart them is to reinstall their firmware and hope for the best, but still they will just simply stop mining and go into a circle of continuously restarting, or trying to restart at least.
The beginning of this restarting cycle can begin anywhere from 30minutes to 9-10 hours.
This happened to 6(!)of my total 9 antminers.
Another interesting thing that one or two can make is after not being able to hash, the green light is continuously on.
Here you can see my kernel log that the miner produces after it stops hashing:
[ 0.000000] Booting Linux on physical CPU 0x0
[ 0.000000] Linux version 3.14.0-xilinx-ge8a2f71-dirty (lzq@armdev2) (gcc version 4.8.3 20140320 (prerelease) (Sourcery CodeBench Lite 2014.05-23) ) #82 SMP PREEMPT Tue May 16 19:49:53 CST 2017
[ 0.000000] CPU: ARMv7 Processor [413fc090] revision 0 (ARMv7), cr=18c5387d
[ 0.000000] CPU: PIPT / VIPT nonaliasing data cache, VIPT aliasing instruction cache
[ 0.000000] Machine model: Xilinx Zynq
[ 0.000000] cma: CMA: reserved 128 MiB at 27800000
[ 0.000000] Memory policy: Data cache writealloc
[ 0.000000] On node 0 totalpages: 258048
[ 0.000000] free_area_init_node: node 0, pgdat c0740a40, node_mem_map e6fd8000
[ 0.000000] Normal zone: 1520 pages used for memmap
[ 0.000000] Normal zone: 0 pages reserved
[ 0.000000] Normal zone: 194560 pages, LIFO batch:31
[ 0.000000] HighMem zone: 496 pages used for memmap
[ 0.000000] HighMem zone: 63488 pages, LIFO batch:15
[ 0.000000] PERCPU: Embedded 8 pages/cpu @e6fc0000 s9088 r8192 d15488 u32768
[ 0.000000] pcpu-alloc: s9088 r8192 d15488 u32768 alloc=8*4096
[ 0.000000] pcpu-alloc: [0] 0 [0] 1
[ 0.000000] Built 1 zonelists in Zone order, mobility grouping on. Total pages: 256528
[ 0.000000] Kernel command line: noinitrd mem=1008M console=ttyPS0,115200 root=ubi0:rootfs ubi.mtd=1 rootfstype=ubifs rw rootwait
[ 0.000000] PID hash table entries: 4096 (order: 2, 16384 bytes)
[ 0.000000] Dentry cache hash table entries: 131072 (order: 7, 524288 bytes)
[ 0.000000] Inode-cache hash table entries: 65536 (order: 6, 262144 bytes)
[ 0.000000] Memory: 884148K/1032192K available (5032K kernel code, 283K rwdata, 1916K rodata, 204K init, 258K bss, 148044K reserved, 253952K highmem)
[ 0.000000] Virtual kernel memory layout:
[ 0.000000] vector : 0xffff0000 - 0xffff1000 ( 4 kB)
[ 0.000000] fixmap : 0xfff00000 - 0xfffe0000 ( 896 kB)
[ 0.000000] vmalloc : 0xf0000000 - 0xff000000 ( 240 MB)
[ 0.000000] lowmem : 0xc0000000 - 0xef800000 ( 760 MB)
[ 0.000000] pkmap : 0xbfe00000 - 0xc0000000 ( 2 MB)
[ 0.000000] modules : 0xbf000000 - 0xbfe00000 ( 14 MB)
[ 0.000000] .text : 0xc0008000 - 0xc06d1374 (6949 kB)
[ 0.000000] .init : 0xc06d2000 - 0xc0705380 ( 205 kB)
[ 0.000000] .data : 0xc0706000 - 0xc074cf78 ( 284 kB)
[ 0.000000] .bss : 0xc074cf84 - 0xc078d9fc ( 259 kB)
[ 0.000000] Preemptible hierarchical RCU implementation.
[ 0.000000] Dump stacks of tasks blocking RCU-preempt GP.
[ 0.000000] RCU restricting CPUs from NR_CPUS=4 to nr_cpu_ids=2.
[ 0.000000] RCU: Adjusting geometry for rcu_fanout_leaf=16, nr_cpu_ids=2
[ 0.000000] NR_IRQS:16 nr_irqs:16 16
[ 0.000000] ps7-slcr mapped to f0004000
[ 0.000000] zynq_clock_init: clkc starts at f0004100
[ 0.000000] Zynq clock init
[ 0.000016] sched_clock: 64 bits at 333MHz, resolution 3ns, wraps every 3298534883328ns
[ 0.000312] ps7-ttc #0 at f0006000, irq=43
[ 0.000623] Console: colour dummy device 80x30
[ 0.000663] Calibrating delay loop... 1325.46 BogoMIPS (lpj=6627328)
[ 0.040212] pid_max: default: 32768 minimum: 301
[ 0.040439] Mount-cache hash table entries: 2048 (order: 1, 8192 bytes)
[ 0.040462] Mountpoint-cache hash table entries: 2048 (order: 1, 8192 bytes)
[ 0.042614] CPU: Testing write buffer coherency: ok
[ 0.042978] CPU0: thread -1, cpu 0, socket 0, mpidr 80000000
[ 0.043040] Setting up static identity map for 0x4c4b00 - 0x4c4b58
[ 0.043264] L310 cache controller enabled
[ 0.043284] l2x0: 8 ways, CACHE_ID 0x410000c8, AUX_CTRL 0x72760000, Cache size: 512 kB
[ 0.121042] CPU1: Booted secondary processor
[ 0.210231] CPU1: thread -1, cpu 1, socket 0, mpidr 80000001
[ 0.210364] Brought up 2 CPUs
[ 0.210383] SMP: Total of 2 processors activated.
[ 0.210392] CPU: All CPU(s) started in SVC mode.
[ 0.211053] devtmpfs: initialized
[ 0.213480] VFP support v0.3: implementor 41 architecture 3 part 30 variant 9 rev 4
[ 0.214721] regulator-dummy: no parameters
[ 0.223296] NET: Registered protocol family 16
[ 0.225618] DMA: preallocated 256 KiB pool for atomic coherent allocations
[ 0.227909] cpuidle: using governor ladder
[ 0.227922] cpuidle: using governor menu
[ 0.235454] syscon f8000000.ps7-slcr: regmap [mem 0xf8000000-0xf8000fff] registered
[ 0.237000] hw-breakpoint: found 5 (+1 reserved) breakpoint and 1 watchpoint registers.
[ 0.237014] hw-breakpoint: maximum watchpoint size is 4 bytes.
[ 0.237139] zynq-ocm f800c000.ps7-ocmc: ZYNQ OCM pool: 256 KiB @ 0xf0080000
[ 0.259165] bio: create slab at 0
[ 0.260997] vgaarb: loaded
[ 0.262443] SCSI subsystem initialized
[ 0.263328] usbcore: registered new interface driver usbfs
[ 0.263501] usbcore: registered new interface driver hub
[ 0.263730] usbcore: registered new device driver usb
[ 0.264254] media: Linux media interface: v0.10
[ 0.264410] Linux video capture interface: v2.00
[ 0.264652] pps_core: LinuxPPS API ver. 1 registered
[ 0.264664] pps_core: Software ver. 5.3.6 - Copyright 2005-2007 Rodolfo Giometti
[ 0.264790] PTP clock support registered
[ 0.265167] EDAC MC: Ver: 3.0.0
[ 0.266218] Advanced Linux Sound Architecture Driver Initialized.
[ 0.269180] DMA-API: preallocated 4096 debug entries
[ 0.269194] DMA-API: debugging enabled by kernel config
[ 0.269288] Switched to clocksource arm_global_timer
[ 0.289263] NET: Registered protocol family 2
[ 0.290256] TCP established hash table entries: 8192 (order: 3, 32768 bytes)
[ 0.290353] TCP bind hash table entries: 8192 (order: 4, 65536 bytes)
[ 0.290513] TCP: Hash tables configured (established 8192 bind 8192)
[ 0.290576] TCP: reno registered
[ 0.290595] UDP hash table entries: 512 (order: 2, 16384 bytes)
[ 0.290653] UDP-Lite hash table entries: 512 (order: 2, 16384 bytes)
[ 0.290937] NET: Registered protocol family 1
[ 0.291321] RPC: Registered named UNIX socket transport module.
[ 0.291334] RPC: Registered udp transport module.
[ 0.291343] RPC: Registered tcp transport module.
[ 0.291351] RPC: Registered tcp NFSv4.1 backchannel transport module.
[ 0.291365] PCI: CLS 0 bytes, default 64
[ 0.291826] hw perfevents: enabled with ARMv7 Cortex-A9 PMU driver, 7 counters available
[ 0.293916] futex hash table entries: 512 (order: 3, 32768 bytes)
[ 0.295353] bounce pool size: 64 pages
[ 0.296294] jffs2: version 2.2. (NAND) © 2001-2006 Red Hat, Inc.
[ 0.296507] msgmni has been set to 1486
[ 0.297318] io scheduler noop registered
[ 0.297331] io scheduler deadline registered
[ 0.297373] io scheduler cfq registered (default)
[ 0.308459] dma-pl330 f8003000.ps7-dma: Loaded driver for PL330 DMAC-2364208
[ 0.308480] dma-pl330 f8003000.ps7-dma: DBUFF-128x8bytes Num_Chans-8 Num_Peri-4 Num_Events-16
[ 0.433543] e0001000.serial: ttyPS0 at MMIO 0xe0001000 (irq = 82, base_baud = 3124999) is a xuartps
[ 1.005644] console [ttyPS0] enabled
[ 1.009953] xdevcfg f8007000.ps7-dev-cfg: ioremap 0xf8007000 to f0068000
[ 1.017592] [drm] Initialized drm 1.1.0 20060810
[ 1.034622] brd: module loaded
[ 1.044079] loop: module loaded
[ 1.053803] e1000e: Intel(R) PRO/1000 Network Driver - 2.3.2-k
[ 1.059659] e1000e: Copyright(c) 1999 - 2013 Intel Corporation.
[ 1.067456] libphy: XEMACPS mii bus: probed
[ 1.071857] ------------- phy_id = 0x3625e62
[ 1.076596] xemacps e000b000.ps7-ethernet: pdev->id -1, baseaddr 0xe000b000, irq 54
[ 1.085323] ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
[ 1.092119] ehci-pci: EHCI PCI platform driver
[ 1.099363] zynq-dr e0002000.ps7-usb: Unable to init USB phy, missing?
[ 1.106185] usbcore: registered new interface driver usb-storage
[ 1.113103] mousedev: PS/2 mouse device common for all mice
[ 1.119210] i2c /dev entries driver
[ 1.126175] zynq-edac f8006000.ps7-ddrc: ecc not enabled
[ 1.131788] cpufreq_cpu0: failed to get cpu0 regulator: -19
[ 1.137680] Xilinx Zynq CpuIdle Driver started
[ 1.142564] sdhci: Secure Digital Host Controller Interface driver
[ 1.148658] sdhci: Copyright(c) Pierre Ossman
[ 1.153075] sdhci-pltfm: SDHCI platform and OF driver helper
[ 1.158813] mmc0: no vqmmc regulator found
[ 1.162857] mmc0: no vmmc regulator found
[ 1.199305] mmc0: SDHCI controller on e0100000.ps7-sdio [e0100000.ps7-sdio] using ADMA
[ 1.207991] usbcore: registered new interface driver usbhid
[ 1.213505] usbhid: USB HID core driver
[ 1.218254] nand: device found, Manufacturer ID: 0x2c, Chip ID: 0xda
[ 1.224556] nand: Micron MT29F2G08ABAEAWP
[ 1.228518] nand: 256MiB, SLC, page size: 2048, OOB size: 64
[ 1.234463] Bad block table found at page 131008, version 0x01
[ 1.240684] Bad block table found at page 130944, version 0x01
[ 1.246734] 3 ofpart partitions found on MTD device pl353-nand
[ 1.252520] Creating 3 MTD partitions on "pl353-nand":
[ 1.257607] 0x000000000000-0x000002000000 : "BOOT.bin-env-dts-kernel"
[ 1.265714] 0x000002000000-0x00000b000000 : "angstram-rootfs"
[ 1.273048] 0x00000b000000-0x000010000000 : "upgrade-rootfs"
[ 1.282074] TCP: cubic registered
[ 1.285312] NET: Registered protocol family 17
[ 1.290035] Registering SWP/SWPB emulation handler
[ 1.295979] regulator-dummy: disabling
[ 1.300400] UBI: attaching mtd1 to ubi0
[ 1.825264] UBI: scanning is finished
[ 1.837034] UBI: attached mtd1 (name "angstram-rootfs", size 144 MiB) to ubi0
[ 1.844114] UBI: PEB size: 131072 bytes (128 KiB), LEB size: 126976 bytes
[ 1.850874] UBI: min./max. I/O unit sizes: 2048/2048, sub-page size 2048
[ 1.857538] UBI: VID header offset: 2048 (aligned 2048), data offset: 4096
[ 1.864416] UBI: good PEBs: 1152, bad PEBs: 0, corrupted PEBs: 0
[ 1.870397] UBI: user volume: 1, internal volumes: 1, max. volumes count: 128
[ 1.877504] UBI: max/mean erase counter: 54/23, WL threshold: 4096, image sequence number: 1134783803
[ 1.886721] UBI: available PEBs: 0, total reserved PEBs: 1152, PEBs reserved for bad PEB handling: 40
[ 1.895938] UBI: background thread "ubi_bgt0d" started, PID 1080
[ 1.895943] drivers/rtc/hctosys.c: unable to open rtc device (rtc0)
[ 1.900027] ALSA device list:
[ 1.900031] No soundcards found.
[ 1.916406] UBIFS: background thread "ubifs_bgt0_0" started, PID 1082
[ 1.945403] UBIFS: recovery needed
[ 2.061126] UBIFS: recovery completed
[ 2.064796] UBIFS: mounted UBI device 0, volume 0, name "rootfs"
[ 2.070745] UBIFS: LEB size: 126976 bytes (124 KiB), min./max. I/O unit sizes: 2048 bytes/2048 bytes
[ 2.079852] UBIFS: FS size: 128626688 bytes (122 MiB, 1013 LEBs), journal size 9023488 bytes (8 MiB, 72 LEBs)
[ 2.089749] UBIFS: reserved for root: 0 bytes (0 KiB)
[ 2.094772] UBIFS: media format: w4/r0 (latest is w4/r0), UUID B079DD56-06BB-4E31-8F5E-A6604F480DB2, small LPT model
[ 2.106300] VFS: Mounted root (ubifs filesystem) on device 0:11.
[ 2.113724] devtmpfs: mounted
[ 2.116838] Freeing unused kernel memory: 204K (c06d2000 - c0705000)
[ 2.960506] random: dd urandom read with 0 bits of entropy available
[ 3.359312]
[ 3.359312] bcm54xx_config_init
[ 3.969339]
[ 3.969339] bcm54xx_config_init
[ 6.970148] xemacps e000b000.ps7-ethernet: Set clk to 24999999 Hz
[ 6.976243] xemacps e000b000.ps7-ethernet: link up (100/FULL)
This is XILINX board. Totalram: 1039794176
Detect 1GB control board of XILINX
DETECT HW version=0008c510
miner ID : 8048d52e123b885c
Miner Type = S9
AsicType = 1387
real AsicNum = 63
use critical mode to search freq...
get PLUG ON=0x000000e0
Find hashboard on Chain[5]
Find hashboard on Chain[6]
Find hashboard on Chain[7]
set_reset_allhashboard = 0x0000ffff
Check chain[5] PIC fw version=0x03
Check chain[6] PIC fw version=0x03
Check chain[7] PIC fw version=0x03
chain[5]: [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255]
has freq in PIC, will disable freq setting.
chain[5] has freq in PIC and will jump over...
Chain[5] has core num in PIC
Chain[5] ASIC[12] has core num=15
Chain[5] ASIC[22] has core num=5
Chain[5] ASIC[34] has core num=1
Chain[5] ASIC[38] has core num=9
Chain[5] ASIC[45] has core num=2
Check chain[5] PIC fw version=0x03
chain[6]: [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255]
has freq in PIC, will disable freq setting.
chain[6] has freq in PIC and will jump over...
Chain[6] has core num in PIC
Chain[6] ASIC[40] has core num=1
Chain[6] ASIC[42] has core num=1
Chain[6] ASIC[61] has core num=3
Check chain[6] PIC fw version=0x03
chain[7]: [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255]
has freq in PIC, will disable freq setting.
chain[7] has freq in PIC and will jump over...
Chain[7] has core num in PIC
Chain[7] ASIC[11] has core num=15
Chain[7] ASIC[14] has core num=15
Chain[7] ASIC[45] has core num=1
Check chain[7] PIC fw version=0x03
get PIC voltage=6 on chain[5], value=940
get PIC voltage=6 on chain[6], value=940
get PIC voltage=6 on chain[7], value=940
set_reset_allhashboard = 0x00000000
chain[5] temp offset record: 62,0,0,0,0,0,38,28
chain[5] temp chip I2C addr=0x9a
chain[5] has no middle temp, use special fix mode.
chain[6] temp offset record: 62,0,0,0,0,0,35,28
chain[6] temp chip I2C addr=0x98
chain[6] has no middle temp, use special fix mode.
chain[7] temp offset record: 62,0,0,0,0,0,38,28
chain[7] temp chip I2C addr=0x9a
chain[7] has no middle temp, use special fix mode.
set_reset_allhashboard = 0x0000ffff
set_reset_allhashboard = 0x00000000
CRC error counter=0
set command mode to VIL
--- check asic number
After Get ASIC NUM CRC error counter=0
set_baud=0
The min freq=700
set real timeout 52, need sleep=379392
After TEST CRC error counter=0
set_reset_allhashboard = 0x0000ffff
set_reset_allhashboard = 0x00000000
search freq for 1 times, completed chain = 3, total chain num = 3
set_reset_allhashboard = 0x0000ffff
set_reset_allhashboard = 0x00000000
restart Miner chance num=2
waiting for receive_func to exit!
waiting for pic heart to exit!
bmminer not found= 1448 root 0:00 grep bmminer
bmminer not found, restart bmminer ...
This is user mode for mining
Detect 1GB control board of XILINX
Miner Type = S9
Miner compile time: Fri Nov 17 17:57:49 CST 2017 type: Antminer S9set_reset_allhashboard = 0x0000ffff
set_reset_allhashboard = 0x00000000
set_reset_allhashboard = 0x0000ffff
miner ID : 8048d52e123b885c
set_reset_allhashboard = 0x0000ffff
Checking fans!get fan[2] speed=6120
get fan[2] speed=6120
get fan[2] speed=6120
get fan[2] speed=6120
get fan[2] speed=6120
get fan[2] speed=6120
get fan[2] speed=6120
get fan[2] speed=6120
get fan[2] speed=6120
get fan[5] speed=6120
get fan[2] speed=6120
get fan[5] speed=6120
get fan[2] speed=6120
get fan[5] speed=6120
chain[5]: [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255]
Detect: use voltage limit rules on single board!
Detect: S9_63 use voltage level=3 : 0
Chain[J6] has backup chain_voltage=870
Check chain[5] PIC fw version=0x03
chain[6]: [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255]
Detect: use voltage limit rules on single board!
Detect: S9_63 use voltage level=2 : 2
Chain[J7] has backup chain_voltage=880
Check chain[6] PIC fw version=0x03
chain[7]: [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255] [63:255]
Detect: use voltage limit rules on single board!
Detect: S9_63 use voltage level=3 : 0
Chain[J8] has backup chain_voltage=890
Check chain[7] PIC fw version=0x03
Chain[J6] orignal chain_voltage_pic=6 value=940
Chain[J6] will use backup chain_voltage_pic=870 [125]
Chain[J6] get working chain_voltage_pic=125
Chain[J7] orignal chain_voltage_pic=6 value=940
Chain[J7] will use backup chain_voltage_pic=880 [108]
Chain[J7] get working chain_voltage_pic=108
Chain[J8] orignal chain_voltage_pic=6 value=940
Chain[J8] will use backup chain_voltage_pic=890 [91]
Chain[J8] get working chain_voltage_pic=91
set_reset_allhashboard = 0x0000ffff
set_reset_allhashboard = 0x00000000
Chain[J6] has 63 asic
Chain[J7] has 63 asic
Chain[J8] has 63 asic
Since then, I have tried to only run one machine, bought new router, new switch, new internet, new Ethernet cables, trying out compatible
OFFICIAL firmware, but to no prevail.
I would really appreciate some new perspective, I might be missing something.
Thank you in advance,
Csavar.
Edit: I might have made the mistake - as it was pointed out- that i've installed the 2019 software and that may cause this problem. If that is the case, what are my options? I've tried since the "the package to fix upgrade failure" and installed the 2018 version,but that still doesn't seem to help.