|
Directory |
| Caching In: P4 Extreme Edition |
|
|
|
Page 4 of 6
Fundamental Differences between Intel and AMD
AMD uses exclusive cache, where cache is separated at each level. So
if one has 1024k of L2, 64k of L1 data, and 64k of L1 instruction, they
actually have a total of 1152k. Does this mean that Intel’s cache is
inferior? It can go either way. You’re still getting more total cache
with the Intel approach, and there are many other pros and cons that
factor into the equation. Inclusive cache is a lot easier to implement
than exclusive, but exclusive is significantly faster. In fact, if
there’s a cache miss on an AMD chip, and the data must be found in
memory, the data will go directly into L1 and only go to L2 if is
retired from L1. There is a slight increase in logic overhead though;
this comes from mapping channels to all levels of cache instead of just
to the highest level, and the need to figure out where everything is.
With inclusive, if something is in L1, you know it’s everywhere (so L3
or L2 doesn’t need to probe a lower level to figure out what goes
where) and synchronization is a breeze. |
||||||||