婆罗门
精华
|
战斗力 鹅
|
回帖 0
注册时间 2006-3-6
|
从测试数据上看,cell 的强大我也没有能力质疑。简单看了那篇pdf的sm2测试部分,里面提到了他们的选择:
Only 96*96 block sizes provide enough computational intensity to overcome the additional block loads and stores, and thus achieving near-peak performance— over 200Gflop/s.
Although the time to load a DP 64*64 block is twice that of the SP version, the time required to compute on a 64*64 DP block is about 14x as long as the SP counterpart (due to the limitations of the DP issue logic). Thus it is far easier for DP to reach its peak performance. — a mere 14.6Gflop/s.
而看了http://www.pcper.com/article.php ... pe=expert&pid=3,并没有对这方面明确提及,所以我对实际的表现还是有所保留。
上次还有人提到目前ps3上只能用到4个spe ,再多的话性能会下降,加上据说当初ibm两个ppe的建议,cell性能的发挥也许真的障碍不少。 |
|