Author

Topic: [About CPU coins] Intel unveils 1 teraflop chip with 50-plus cores (Read 1473 times)

legendary
Activity: 1666
Merit: 1057
Marketing manager - GO MP
50 Cores is nothing.

There are dirt cheap chips with more than twice of that. And as said fast memory is the issue with scrypt. Time will come were parallel architectures will be usable for this but not yet.

Memory takes alot of silicon estate, too much to pack a decent amount next to many small cores. But once we have sandwich '3D chips' this will be feasible.
legendary
Activity: 1148
Merit: 1008
If you want to walk on water, get out of the boat
Quote
Many AMD GPUs are already faster than this, for instance 5970 and 6990 can pull 1 TFLOPs.
Yes but, as DeathAndTaxes said, i think that this Knights Corner will have more cache than a GPU. Sure, it will be slower, but by having more cache it will be useful for more things
donator
Activity: 1218
Merit: 1079
Gerald Davis
I would hope less than that.  It is 22nm process which should be roughly 4x the performance per watt as a 45nm chip for the same architecture.

Granted we have nothing to compared it to directly but AMD 45nm GPU get ~350W per TFLOP and that includes a lot of non-computational components like highspeed video ram and render units so one would hope a dedicated chip would have higher performance per watt all things being equal.

If it is above 100W that is disapointing.  AMD 7000 series (32nm) chips will likely be in the ballpark of 150W per TFLOP (DP) and that isn't a "pure computing" optimized design.
legendary
Activity: 1484
Merit: 1005

So I wonder what wattage that chips needs.  


From the cooling on the card (high speed intake fan, heat pipes) I would guess that it's probably in the range of 150-300w.
donator
Activity: 1218
Merit: 1079
Gerald Davis


This is pretty cool.  Just 15 years.  From 5KW for 1 TFLOP to 20W.

This is one area it will destroy GPU (which are just starting to make their way into SuperComputers).

A 6970 gets 0.675 TFLOPS (double precision).  That works out to 370W per TFLOP.  While a lot better than 5000W it can't even compete w/ 20W.  Lower wattage not only means less electrical cost it means higher densities.  You can put more chips in the same rack given the same cooling capacity.

In this case almost 15x as many computing power in the same rack space.  Grin

On edit:  looks like I was wrong. The 20W in article isn't the consumption of Intel chip it is the goal for a exascale super computer by 2018 (~100x current fastest supercomputer).  20W per TFLOP is needed to avoid requiring a nuclear reactor to run your exascale computer.  Just for fun.  AMD top of the line GPU get about 370W per TFLOP.  That's 0.37 MW per PFLOP or 370MW per Exaflop.  Ok so a small nuclear reactor Grin

So I wonder what wattage that chips needs.  
donator
Activity: 1218
Merit: 1079
Gerald Davis
PCIe express interface --> slow memory transfer

Well depends on how they configure it and how many lanes. Could be multiple chips & memory on a single card with internal PCIE switch.  While not ultra fast it would be still pretty fast.  PCIe 3.0 w/ 16 lanes is 16GB/s.  The chip could support more than 16 lanes.  

Quote
The only real advantage to GPUs is x86 instruction sets in the cores which may allow for easy porting of code to the offboard "CPUs", but really, this is basically a GPU with slightly different architecture.  

I agree.  The line between CPU & GPU is betting blurred.  CPU becoming more GPU like, the rise of APU (hybrid chips), GPU gaining general purpose computing and moving to more complex "shaders" (more like processing engines).

This would have an advantage over GPU as all of the die is dedicated to computing.  The average GPU "wastes" about 30% of its die for non-shader related functionality.  Of course GPU are so powerful that they overcome this inefficiency w/ pure brute force but a compute engine could be very interesting.  

Quote
Many AMD GPUs are already faster than this, for instance 5970 and 6990 can pull 1 TFLOPs.

Well 5970 is only 0.98 TFLOPs in DP.  The 6990 is 1.278TFLOPS but that is w/ 2 chips.
legendary
Activity: 1484
Merit: 1005
PCIe express interface --> slow memory transfer

The only real advantage to GPUs is x86 instruction sets in the cores which may allow for easy porting of code to the offboard "CPUs", but really, this is basically a GPU with slightly different architecture.  Many AMD GPUs are already faster than this, for instance 5970 and 6990 can pull 1 TFLOPs.
donator
Activity: 1218
Merit: 1079
Gerald Davis
It's architecture looks more "GPU like".  It isn't designed to run operating system but rather co-exist w/ general purpose CPU and acts a a "math accelerator".  Very parallel architecture.

Remember there is no such thing as a "CPU coin" that is a misnomer.  There are simply some coins that require a lot of L1 cache and others that don't.  A CPU w/ insufficient L1 cache will perform poorly. A GPU w/ sufficient L1 cache will perform well.

Lastly there is nothing on its integer performance.  Hell it might not even have integer ALU.  Instead being a dedicated double floating point engine.  In that case it would be worthless for most crytpo work.

Still very cool technology.  Insane power densities. Awesome tech porn.  I loved the watercooled supercomputing chassis.  Interesting it looks like the internal piping is rigid copper not flexible tubing.  
Jump to: