I'm not an FPGA expert, but I dabble.
Is that right? SO I guessed correctly, there is at least one member of the bitcoin community with access to at least one FPGA. Are you going to post your code now that you have been outed?
I asked some folks...
And I buy Rogaine for a friend of mine...
The upshot was that a modern desktop machine has such a huge clockspeed advantage (8 cores at 3 GHz?),
Holy crap! Are you serious? This is an average desktop these days? I'm still using a single core P3 running Ubuntu. Well, at least I was until my power supply crapped out and killed my motherboard. Can you believe they want $200 to upgrade!?
so only a massively parallel implementation would have a chance of competing.
My root point was that, those who already owned these chips, don't really need to "compete". The FPGA would simply add to what their CPU(s) can already do. They don't have to *replace* the cpu.
There really isn't much of an I/O constraint, so it's just grinding on the nonce and testing for success. Most of the commercial SHA256 cores focus on I/O bandwidth, for the typical application of passing a lot of data into the hash.
So do you think that a coproccessor using local memory on a FPGA could go faster than the hardware on a VIA 7 due to I/O limitations?
This is something much different, with self-generated inputs and a test on the output of each cycle.
So.. How many rounds can you fit in a million gates?
I'm guessing at least four, maybe as many as 10. Harder to guess if one is using local memory, that stuff can be wicked fast on some things.
A modern GPU and some crafty OpenCL/CUDA code seems like a better avenue for research, rapid turnaround, and scalable speeds. You'd have both high clock speeds and parallelism at work.
Sure, for those who have a GPU and not a FPGA. Sure 55mhz doesn't seem like crap compared to 8 cores at 3 ghz, but a sha-256 setup in software can't process a hash in a single cycle, whereas a well made 'solid state machine' (which, as I understand it, what a FPGA program actually is) can process one hash in a single cycle, if the I/O isn't a limitation anymore. Multiply that by however many cores can be programmed onto a single chip, and there is serious potiental here.