I performed simultaneous installation of 4 atlas 3.9.1 examplars on
Opteron 2350.
The CacheEdge value obtained was 384K and 512K (depending from
build/tune thread). OK, 512K is that I want: 512= size(L3)/4.
I thought that I'll see gemm kernel performance differences, at least
for large matrixes test. But make time gives pracrically no difference
between results for different CacheEdge values (2 MB, 512K or 384K).
So there is few questions.
1) How (d)gemm performance (for large matrixes) depends from CacheEdge
value ?
2) Does Atlas 3.9.x "know" that Opteron K10 has 512K L2 cache *in
addition* to L3 cache ?
I looked that 3.8.2 used *L2* cache size for CacheEdge value.
3) Does gemm kernels use software prefetch ? IMHO prefetch in K10 (in
opposition to K8) is performed directly to L1 cache (instead of L2
cache in K8).
Yours
Mikhail Kuzminsky
Computer Assistance to Chemical Research Center
Zelinsky Institute of Organic Chemistry
Moscow
-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/_______________________________________________
Math-atlas-devel mailing list
Math-atlas-devel@...
https://lists.sourceforge.net/lists/listinfo/math-atlas-devel