On Wed, 16 Jul 2008, Clint Whaley wrote:
> Guys,
>
> >Which Linux distro do you use ? You should kill all the
> >"power-sensual" daemons (like powersaved in SuSE) and remove the
> >corresponding kernel daemons like cpufreq.
>
> I tried two linux distros: kubuntu hardy heron & Fedora Core 9. FC9 does
> it a *lot* less than kubuntu, but it still does it. I have turned off
> "cool & quiet" in the bios, and cpuinfo shows full speed even as my
> timings drop by half. I verified that cpufreq doesn't work after the BIOS
> turnoff (the scaling directiries are missing from ACPI).
>
> I more & more suspect the problem is in the motherboard. Dean (I think it was)
> mentioned that thermal throttling is broken in the Phenom; I wonder if
> the mobo assumes it works and does some voltage things it can't handle
> in response to OS calls.
hmm that could be possible ... a few choices for figuring this out --
for reference, fam10h BKDG:
http://www.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/31116.PDFand fam10h revision guide:
http://www.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/41322.PDFtry this:
setpci -d 1022:1204 64.l
that should print out the "F3x64 Hardware Thermal Control (HTC) Register"
... if bit 0 is non-zero then HTC is enabled. try disabling it like so:
setpci -d 1022:1204 64.l=0
> > Whatever it is, it is affected by OS, so it is not pure hardware. But,
> I wonder if the OS sends some signal that the mobo should ignore,
> but instead attempts something the k10h can't do . . .
>
> Anyway, if anyone can tell me OS & mobo combinations that they have seen
> work for the Phenom, I'd appreciate it.
it's been a while since i've built atlas -- but i'll give it a spin on my
phenom and report back. 3.9.0 is good enough?
> >BIOS patch was declared as leading to some performance degradation.
>
> Yeah, but I did not expect that massive die-off for a cache-dominated
> algorithm like GEMM. For HPC, the slowdown is massive and pervasive,
> but the TLB bug is triggered daily.
are you sure it's the TLB bug? in lots of testing i've never tripped the
erratum 298 problem.
if you want to experiment with the workarounds, build
http://code.google.com/p/iotools/ and put it into your PATH.
then execute a script something like this:
for cpu in `awk '/^processor/ {print $3}' /proc/cpuinfo`; do
# disable erratum 298 workaround
wrmsr $cpu 0xc0010015 $(and $(rdmsr $cpu 0xc0010015) $(not $(shl 1 3)))
wrmsr $cpu 0xc0011023 $(and $(rdmsr $cpu 0xc0011023) $(not $(shl 1 1)))
# disable erratum 309 workaround
wrmsr $cpu 0xc0011023 $(and $(rdmsr $cpu 0xc0011023) $(not $(shl 1 23)))
done
you can get more info on both workarounds from the revision guide above.
-dean
-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/_______________________________________________
Math-atlas-devel mailing list
Math-atlas-devel@...
https://lists.sourceforge.net/lists/listinfo/math-atlas-devel