Ok we don't know if this will fully fixes all problems because as yet we don't setups here that show the same problems some of you have and that is why we need work on each problem individually. We do need each board that you have a problem with to be reported to the bitcoin support email with the full circumstances. Not everyone on the team has the time to wade through the forum and they don't unless they stop work on new features or support work so that means problems can be missed. I will try to patch the gaps but I also have a limit in what I can find time to do. It's much better if the information arrives at the correct place and several people get to see it. It also acts as a log we can go back through then also.
Yohan,
I'd really like to provide you with valuable error reports to help your team sorting out the problems, but after the last incident I need to start from the beginning
As I reported, I had a setup with 26 almost stable working CM1 boards in dual Icarus mode. Over the weekend I disassembled the setup to stack the boards and after re-assembling it I found the non deterministic behavior I already saw with the defunct units I sorted out. Some boards failed the golden nonce test, others caused Linux to hang while trying to set COM parameters, or others that start mining with a very low hashrate.
Being aware that the units worked before, I started again monkey testing: varying USB cables, hubs, ports, etc. in a fully random manner. For me it turned out that some boards work with a very specific setup, like: only with one specific USB cable connected to a passive hub that is connected to a powered hub at a given port
As soon as I change cable or plug into a different USB port, the golden nonce test fails or other errors occur.
If I had a clue on HW design I would check whether the FTDI chip is correctly assembled, but since I don't I can only speculate that there is some systematic problem with this component. It is hard to believe that you did not encounter this issue during your testing, when from my batch every second board is fragile.
I spent so much time now getting my boards to work, but since it turned out to be such fragile and non deterministic, I lost motivation to dig deeper. Hope this will be solved once the up/down functionality is supported.
Have a nice day.