|
Advice Beginners BIOS Guide CPUs Links Mainboards Memory Network Storage Video/Sound Cards Contact Forum SiteMap Sponsors WebNews Home |
. | . |
Prices: Mainboards ABIT ASUS Chaintech Shuttle Soyo Tyan CPU Intel P4 2.4C-800 P4 2.6C-800 P4 2.8C-800 P4 3.0-800 P4 3.2-800 AMD AthlonXP XP 1700+ XP 2000+ XP 2400+ XP 2500+ XP 2700+ XP 3000+ XP 3200+ Athlon64 Athlon64 3200+ Athlon64 FX-51 Opteron Opteron 240 Opteron 242 Opteron 244 Opteron 246 Memory Corsair Crucial Kingston Mushkin OCZ |
LOSTCIRCUITS
|
|
| AMD's AM2 Platform DDR2 ... Moving On | |
|
(Review by MS May 23, 2006) |
Memory Subsystem
With a new memory architecture, the obvious questions are naturally about the theoretical performance and then the extrapolation of the results into real world applications. As generally accepted, even theoretical memory performance is based on two separate components, that is, raw bandwidty and latency. The first can be calculated rather easily from the bus width and frequency, in a dual channel DDR2 800 configuration, the theoretical peak bandwidth equals 128bit x 800 MHz or 12.8 GB/sec. In the latest iterations of the older DDR architecture, the peak bandwidth at 400 MHz was 6.4 GB and a good system could always reach real values in the order of 6 GB/sec - equivalent to roughly 90-95% bus utiliazation.

In the case of the new AM2 architecture, DDR2 results in a slight increase in streaming memory bandwidth, however, compared to the theoretical maximum of 12.8 GB/s, the real world performance of ~7.1 GB/sec only amounts to some 56 % bus utilization. Still, the overall memory bandwidth is higher than even the theoretical maximum of the DR400 standard in the first generation DDR architecture.
Latency
In the introduction, we mentioned that, because of the higher frequency and lower clock cycle time, the actual latencies of a DDR2-800 system should be close to that of a DDR-400 system. On the other hand, at DDR2-800 we did not succeed to maintain a CMD rate of 1T, which adds latency particularly in a page miss situation where another open row can be accessed through bank interleaving. This scenario is likely to happen in predictable page boundary crossing as it would happen in large block transfers at a predefined long stride.
Since we had two Windsor cores with both full and half caches, we compared them to the Toledo and the Manchester core, respectively. For simplicity reason, we only show the data at DDR2-800 vs. DDR400
FX60 vs. FX62 (both 2.8GHz)
FX-60 (solid) vs. FX-62 (transparent): lower is better
X2-3800+ vs X2-5000+

X2-3800+ (solid) vs. X2-5000+ (transparent): lower is better
As long as the stride length is kept below the page size (4KB), the DDR2 platform holds up admirably well with only a 10% increase in access latency of the main memory compared to the DDR400 configuration. However, whenever each access is a page miss, the latency increase of the DDR2 platform reach 25% (FX series) and 30% (X2 series). Whether this is a realistic scenario for real world performance is a question, though.
|
Athlon64 X2-3800+ (dual core) |
next page: => Power Consumption =>
All advice and educational articles on LostCircuits are free, but if you feel you can, please make a small donation to us!
Thank you!