My K16 is hashing with 1 chip.
It's not giving a correct result nonce yet but I'm almost 99% sure that's due to errors in precalc code or maybe I'm shifting in word order backwards.
I soldered an ASIC on the board this morning and spent all day twiddling this and that, mostly getting the clock config right. Then bamm I saw something that looked like nonce bits coming out but they were stunted little bits trying desperately to reach high. But the twinkle was enough and I went through the Avalon reference design again to check and realized I'd not used the correct pull-up value for the result lines. Somehow I f'd up and spec'd 100k resistors instead of 470R. So I quickly removed them and soldered in some 1K that I had here. And the bits sprung forth to their full heights. I was totally excited and hopping around my work bench. Even my wife was in to see what the big commotion was about.
So... next step, test the UART receiver, and muck with the send data until the right nonce comes back.
BTW I'm running at half clock b/c no heat sink attached yet. The chip gets slightly warmish to the finger. With only one chip and half clock you have to set the scope trigger and hold the probes for a while until a nonce is found. I counted about 4-8 seconds but of course it depends on what point it is in it's cycle. One chip at half clock (128MHz) would be about 33 seconds for a full sweep.
Brilliant!