Update: initial statistical evaluation
Having an almost statistically significant amount of units at hand and fought almost two weeks to get the most out of them, I have collected some numbers that might be interesting for you.
Setup details:
- 50 boards, serial numbers 55-100 and 126-129
- all boards SPI programmed to twin_test.bit
- all DIP switches set as documented for twin test mode, plus SW6-1 off for 115kBaud
- powered 7-port USB2.0 hubs
- 1 x Enermax Revo 1.5kW + 2 x CoolerMaster 1.2kW gold PSUs
- programming with xc3sprog natively under Linux
- mining with native cgminer-2.4.4 for Icarus under Linux
What made testing hard for me was the non deterministic failure patterns, i.e. one unit might fail today and work tomorrow or vice versa. After successively narrowing the problematic units out, I classified my batch like follows:
1. OKAround 23 boards work almost stable, meaning after 24h there are only 2-5 FPGAs that stopped operation, while the other ~40 settle down to a utility of ~2.5/min. This corresponds to ~360MHps / unit, pool is reporting around 8GH for the whole array. Having to power-cycle the whole setup every 24h is far from being maintenance-free, but I get already 20% of the potential total hashing power when the Quads operated as proposed.
2. UNSTABLEUnstable units are those that
- start to mine at full speed but stop after some hours
- mine with a significantly lower hashing rate: U < 1/min
- start to mine with only one FPGA
- or an arbitrary combination of the named issues
Those devices are the most problematic ones, since you need several rounds to identify an unstable unit. From my batch 15 belong to this category.
3. FAILINGI classify failing units those that are detected but do not mine, including
- those that fail the Icarus detection (i.e. fail to find the golden nonce)
- those that are detected as Icarus but do not generate any valid shares
My batch has 9 of those units, which are easy to identify, since the failure is reproducible, i.e. no matter if you reprogram them before you start or not, they always fail to generate valid shares.
4. BROKENThose are the units that either
- are not detected as ttyUSB serial ports
- get disconnected immediately after detection (Linux reports bus errors)
- fail to initialize communication (Linux reports errors setting com parameters or flow control)
- just freeze the system until you pull the plug
I have 3 of those boards that I assume need to be sent back to Enterpoint for repair, since the problem seems to be located at the FTDI side.
Bottom line: your chances to setup your Cairnsmore1 as Icarus and mine continuously are less than 45%. Your odds to have some unstable device are 30% and that device will trick and cheat you until you start to disbelieve in your skills. 18% will find their board not generating valid hashes in Icarus mode (maybe only working in shipping-test mode, not tested). And finally 6% might be the unlucky ones that buy new USB hubs and/or switch between Linux and Windows to just see that the board is not detected and/or makes the system hang.
This is a rough estimate based on my batch, YMMV. The only advice I can give: do not kick the board too hard and don't get frustrated about your skills - chances are 50%+ that yours is not the OK one.
Yohan, your team did a great job, the HW is really top-notch (Swiss made fans, do I need say more?
). But now, priority 1 is to unfold the potential of this piece of HW as soon as possible. While I am glad to see the PDB, stacking kits and up/down links and realize how they will significantly improve the mess I currently have in my setup - more important than that is to see the Quads operating continuously somewhere around 800MHps. I therefore hope that your own bitstream is progressing well and you also consider to support EldenTyrell developing a CM1 TriCore bitstream.
Thanks and good night.