| Intel's Sandy Bridge II: HD Gfx and AVX |
|
|
| Written by Michael Schuette | ||
| Jan 16, 2011 at 06:38 PM | ||
|
Page 7 of 18
Advanced Vector Extensions are an extension to the x 86 instruction set architecture geared at increasing floating point performance that were originally proposed by Intel. Briefly, AVX is part of the same family of extensions as 3DNow and SSE in its various generations and falls under the umbrella of single instruction/multiple data extensions. AVX increases the width of the SIMD register file from 128-bits to 256-bits and the register file set is further renamed from XMM0-15 to YMM0-15. Legacy SSE instructions operate on the lower half of the YMM register file, that is bits 0-127. AVX is supposed to dramatically increase floating point performance by essentially doubling the floating point throughput by doubling the width. So far, no final release of any Microsoft OS is supporting AVX the instruction set, howewer, there is a silver lining on the horizon in that the latest beta and RCs of Windows7 SP1 finally implement AVX support. Given the essentially non-existent support for AVX (which will, of course change) it is not surprising that currently, there are no applications out there that take advantage of the doubled floating point throughput. Notable exceptions are Sisoft Sandra and AIDA 64, both of which provide at least theoretical benchmarks. We used SiSoft Sandra's multimedia processor benchmark to see whether there was any truth to the claims:
![]()
In real life applications, the situation is somewhat different. We were lucky enough to hook up with Inartis to run the latest Kribibench 3D version, using Kribiplayer, a browser-based 3D-modeling/rendering program based on the Kribi 3.0 engine. As a word of caution, the benchmark was developed completely blind and is at this moment not optimized for AVX or tuned to make the best usage of AVX, so please, regard these benchmarks as work in progress. We ran all scenes but the relative results were essentially the same regardless of which scene we used. Needless to say that "legacy processors" such as the Phenom II or the Gulftown cannot take advantage of AVX instructions, nor can the Shanghai-based Opterons (all of which were run as control and for reference).
![]()
Given all these limitations and restrictions, it is reasonable to assume a 25-30% performance increase in AVX-enabled applications, rather than a doubling in performance as suggested by theoretical benchmarks. Discuss this article in our forums
|
||
| Last Updated ( Feb 21, 2011 at 05:08 AM ) | ||
| <Previous Article | Next Article> |
|---|

