Ok, let's get started. Our latest board of the hour came in with the usual it doesn't work problem. There are a couple of types of it doesn't work:
It blows up the power supply and shorts out.
It comes up but does not hash
It doesn't come up.
First thing to check is to see if anything is unusual about the board. In this case, not really other than the fact that the heat sinks look like they were put on by a 5 year old. I spend a lot of time aligning heat sinks on S7's to be perfectly straight to minimize airflow turbulence since these things are air cooled in small boxes.
However I did notice that one was a little bit loose, could wiggle it a bit like a loose tooth. That's bad. Using the nose I could smell a bit of burned smell at that point on the board, greeeeeat......
So put the board on the preheater, warmed up the board, then used the air tools to warm up the heat sink. Not too difficult, and it came right off leaving this:
Now for a close up of the chip...
And a side view of the chip.
And a view of the heat sink itself.
Using these we can see the problem: This chips is one of the ones that does not have a heat sink on the bottom, and more importantly has only about 50% of the chip covered with the heat sink glue.
From what I can see here it looks like a sloppy Bitmain job when building it led to a board that would run somewhat warm anyway. The chip itself only had 50% contact with the sink, and judging how thick that compound is it looks like the compound made a poor connection between the chip and sink. Over time the chip got warm, had a lower resistance because it is in a series string, pulled more current, more heat, and the usual failure.
Bitmain should warranty this board, but it's probably out of warranty. The lack of compound on the whole chip points to a manufacturing fail.
Solution: We could pull the chip, but that gets complex and I need to talk to the owner before doing that. Likewise if the SCL signal goes through every chip in the board in series, removing the chip breaks the chain and the board does not work. Drat. Wonder if bitmain will sell me a bunch of chips....