[wplug] Another OT buggy-hardware post
Brandon Kuczenski
brandon at 301south.net
Fri Mar 17 12:23:38 EST 2006
On Fri, 17 Mar 2006, Drew from Zhrodague wrote:
>> Thanks for everyone's suggestions, and thanks especially to those people
>> who can accomodate for the fact that I don't seem to be able to read
>> (Chet) or Google (Jonathan) properly..
>>
>> The power supply is a new premium low-noise one which is more recent than
>> the crashing started happening, so I think it's okay.
>>
>> I dd'ed memtest onto a floppy and ran it overnight at my underclocked speed
>> (1250 MHz). It ran for almost 8 hours and reported no errors. I rebooted
>> at the proper clockspeed (1666 MHz) and was about to leave it to run all
>> day when my computer spontaneously shutdown between 20 and 25 minutes in.
>>
>> Incidentally, memtest reports the processor as a Duron, not an Athlon XP.
>> /proc/cpuinfo reports:
>> processor : 0
>> vendor_id : AuthenticAMD
>> cpu family : 6
>> model : 8
>> model name : AMD Athlon(TM) XP 2000+
>> stepping : 0
>> cpu MHz : 1222.637
>> ...
>>
>> My conclusions: heat and the video card are clearly not the problems. I
>> think the memory itself is ok. There are no jumper settings on the mobo
>> for either CPU or Memory voltage -- just about the only user-configrable
>> settings are the clock multipliers -- so the problem must be the mobo or
>> the cpu. Based on comments in this thread, I'm going to buy myself a new
>> processor.
>
> Are you *sure* that heat is not a problem? CPUs generally get hotter
> under use. As an example, my Athlon 1.4Ghz workstation here has a clogged
> heatsink, and doesn't cool very well. I can sorta lightl use the machine (nfs
> host), but if I fire-up my distributed.net client, the thing will shit itself
> pretty quickly. Ditto if I do anything that is more processor intensive than
> just sitting there sharing its disk.
>
> I have a brand new heatsink/fan combo ready to go, and I'll bet that
> problem will be solved by swapping them. I'll post my results.
>
I thought it was thermal for a long time. But the more I learn, the more
I suspect a defect.
It seems like there are "crash-prone" cpu-intensive tasks and "safe"
cpu-intensive tasks. sensors(1) never reports a CPU temperature higher
than about 51.5 C, even when the computer is working fairly hard... but
like I said, there are particular running conditions (which I assume must
exercise different parts of the processor) which are basically guaranteed
to cause a crash. Running lame(1) will do it. Certain XScreensaver hacks
will do it (XLyap comes to mind), but most won't, even CPU-heavy ones.
I just ran fireworkx(1), one of the XScreensaver hacks that relies on
hardware graphics acceleration that my AGP card doesn't support, for 12
minutes:
b at plaza:/tmp$ uptime
11:38:22 up 1:43, 9 users, load average: 0.98, 1.05, 0.72
and sensors reports the CPU temperature is 50C and the M/B is 43C. Those
seem pretty comfortable based on what I've read and seen.
Of course, then I allowed the computer to rest (CPU temp came down to 47C
and stayed there) and ran 'lame' for about 5 minutes straight, which has
reliably caused crashes in the past, and the computer was fine. So who
knows? THe only thing that is absolutely reliable in generating crashes
is running at the rated clockspeed... and my training has always suggested
that you "fix the obvious problems and see if the more pernicious ones
'just go away,' because often they do."
If this hasn't convinced you that temperature is not an issue, let me
know. I'd love to hear your thoughts. Nobody is selling Athlon XP 2000+s
anymore...
Can I run a processor with a 333MHz FSB on a motherboard with a 266 MHz
FSB?
-Brandon
More information about the wplug
mailing list