public inbox for gentoo-amd64@lists.gentoo.org
 help / color / mirror / Atom feed
* [gentoo-amd64] Gentoo crashing?
@ 2007-05-14  5:15 Peter Davoust
  2007-05-14  5:25 ` Dustin C. Hatch
                   ` (2 more replies)
  0 siblings, 3 replies; 53+ messages in thread
From: Peter Davoust @ 2007-05-14  5:15 UTC (permalink / raw
  To: gentoo-amd64

[-- Attachment #1: Type: text/plain, Size: 1072 bytes --]

Ok, I just conquered my gdm issues in a previous thread, and now I'm finding
that gentoo is crashing on me! Twice so far, gentoo has crashed at fairly
inappropriate times. I've used gentoo before, and not once has it crashed
meaninglessly. First, I think I was listing a directory, and it totally
froze, and then the second time I was unzipping a bzipped archive of the
latest kernel sources and it came to a dead halt. I had to hard reboot the
machine, it wasn't fun. I'm starting to get worried, especially since if it
can't handle a little tar.bz2 file, then it certainly can't emerge anything.
I'm sure you'll be wanting some logs, and I'll get them to you next time I
boot gentoo. I've got 2 gig of RAM, and a dual core processor, so those
aren't the problem, and my hard drive has plenty of free space (talking gigs
here), so that's not it either. It sounds like it's something really obvious
that I'm overlooking, but I don't understand how it could just stop. Even
the normal clicking associated with the processor "thinking" just halts.
Isn't that weird?

-Peter

[-- Attachment #2: Type: text/html, Size: 1159 bytes --]

^ permalink raw reply	[flat|nested] 53+ messages in thread
* Re: [gentoo-amd64] Gentoo crashing?
@ 2007-05-15 12:41 Peter Hoff
  2007-05-15 12:57 ` Peter Davoust
  2007-05-15 13:19 ` Bernhard Auzinger
  0 siblings, 2 replies; 53+ messages in thread
From: Peter Hoff @ 2007-05-15 12:41 UTC (permalink / raw
  To: gentoo-amd64

[-- Attachment #1: Type: text/plain, Size: 1724 bytes --]

I can't even get 2.6.21 to compile. I was getting unhappy about that, but perhaps I've been looking at it the wrong way?


----- Original Message ----
From: Dustin C. Hatch <admiralnemo@dchweb.com>
To: gentoo-amd64@lists.gentoo.org
Sent: Sunday, May 13, 2007 10:25:46 PM
Subject: Re: [gentoo-amd64] Gentoo crashing?




  
  

I have been having that problem for quite some
time now.  It was really bad when I upgraded to 2.6.21.  I downgraded
back to 2.6.20-r4 and I haven't had near the problems.  I am not sure
what it is, but you aren't the only one with that problem.



Peter Davoust wrote:
Ok, I just conquered my gdm issues in a previous thread,
and now I'm finding that gentoo is crashing on me! Twice so far, gentoo
has crashed at fairly inappropriate times. I've used gentoo before, and
not once has it crashed meaninglessly. First, I think I was listing a
directory, and it totally froze, and then the second time I was
unzipping a bzipped archive of the latest kernel sources and it came to
a dead halt. I had to hard reboot the machine, it wasn't fun. I'm
starting to get worried, especially since if it can't handle a little
tar.bz2 file, then it certainly can't emerge anything. I'm sure you'll
be wanting some logs, and I'll get them to you next time I boot gentoo.
I've got 2 gig of RAM, and a dual core processor, so those aren't the
problem, and my hard drive has plenty of free space (talking gigs
here), so that's not it either. It sounds like it's something really
obvious that I'm overlooking, but I don't understand how it could just
stop. Even the normal clicking associated with the processor "thinking"
just halts. Isn't that weird? 

  

-Peter




-- 
Dustin C. Hatch
http://www.dchweb.com





[-- Attachment #2: Type: text/html, Size: 2366 bytes --]

^ permalink raw reply	[flat|nested] 53+ messages in thread
* Re: [gentoo-amd64] Gentoo crashing?
@ 2007-05-15 12:58 Peter Hoff
  0 siblings, 0 replies; 53+ messages in thread
From: Peter Hoff @ 2007-05-15 12:58 UTC (permalink / raw
  To: gentoo-amd64

[-- Attachment #1: Type: text/plain, Size: 2953 bytes --]

Just because Memtest86 went 30% without failure does not mean your RAM is 30% OK. It should run for at least a full loop, and I typically run it for at least 2 loops (though only once have I seen a system suddenly start spewing errors it didn't have during the first loop, and I've run it on a lot of systems). It does take a while, so you probably don't want to sit around watching it. I usually let it run overnight while I sleep.


----- Original Message ----
From: Peter Davoust <worldgnat@gmail.com>
To: gentoo-amd64@lists.gentoo.org
Sent: Monday, May 14, 2007 2:08:42 PM
Subject: Re: [gentoo-amd64] Gentoo crashing?

Ok, so I compiled a new kernel, and it seemed to work. I booted the new kernel, I was able to unzip and install dhcpcd, ndiswrapper, wireless-tools, and cab extract. Then, once the wireless was working, I tried emerge --search dhcpcd, because gentoo apparently doesn't like my manually configured dhcpcd.... CRASH! I ran memtest86+, as suggested, and it got to at least 30% without a failure. I'll try it again, but at least 30% of my memory is in tact. I'll try to emerge some other things, and see how it goes. 


-Peter

On 5/14/07, dustin@v.igoro.us <dustin@v.igoro.us> wrote:
On Mon, May 14, 2007 at 02:04:30AM -0400, Peter Davoust wrote:
>    Ok, first, while I appreciate your advice, this is a brand new laptop

>    and there's no way I'm running bonnie++ (that's prime95, right?), or
>    anything with the words "cpu" and "burn" in the same sentence on this
>    thing. Memtest86 might be an option as long as it has no potential to

>    kill anything. I agree, it could be the heat, and that was the first
>    thing that came to my mind, but Vista boots and runs for long periods
>    of time with no issues. I'll check it out with the new kernel in the

>    morning and see what it does.

Any new laptop should have the hardware smarts not to smoke itself, or
something really is broken.  It may shut down "unexpectedly" (which I
also consider a design bug), but actually causing damage is unlikely.


That said, this really sounds like a RAM problem, so I would run
memtest86 first.  Memtest86 has zero chance of smoking any system that
has passed a factory QA check.

I had a Gentoo system (a server) that pretty much ran (to be honest, it

was a heavily used database server that stayed up for a good 3 months in
this state).  However, its clock was skewed something like 10m/hour (I
now think this was due to lost ticks during processing of memory

faults).

I tried all the various kernel flags, largemem, etc., only to find out
that the problem was (as others on this thread have posted) incompatible
RAM.  I point this out only to say that bad RAM can cause *very* unusual

problems (not just the segfaults you'd expect), and to say that lots of
complex operations (like Vista, for example) can continue to run just
fine in such a broken environment.

Dustin
--

gentoo-amd64@gentoo.org mailing list








[-- Attachment #2: Type: text/html, Size: 4127 bytes --]

^ permalink raw reply	[flat|nested] 53+ messages in thread
* Re: [gentoo-amd64] Gentoo crashing?
@ 2007-05-15 13:24 Peter Hoff
  2007-05-15 15:48 ` Peter Davoust
  0 siblings, 1 reply; 53+ messages in thread
From: Peter Hoff @ 2007-05-15 13:24 UTC (permalink / raw
  To: gentoo-amd64

[-- Attachment #1: Type: text/plain, Size: 2593 bytes --]



----- Original Message ----
From: Peter Davoust <worldgnat@gmail.com>
To: gentoo-amd64@lists.gentoo.org
Sent: Monday, May 14, 2007 7:11:20 PM
Subject: Re: [gentoo-amd64] Gentoo crashing?

I know it doesn't actually burn the cpu, but I'd rather not cook any components if I don't have to. From what I know of torture tests, they run the cpu so hot it starts making computational errors, am I right? It still makes me nervous. I was hoping to be able to fix the issue just by recompiling my kernel, but no such luck. I'll mess with it some more and see what I can do. Can you give me any advice as to what I should to to a) not violate my warrantee and b) avoid killing my computer as much as possible? Could it just be something with my Gentoo install? I guess that's a stupid question; I've had this problem on an older computer, but it was a Desktop and it was much easier to swap components without messing up my warrantee. So if it were a hardware problem, wouldn't you think that suse 
10.2 would have run into it as well? I used to run 10.2 (used to as in 3 days ago) for hours on end without any problems at all. I agree that Gentoo can run the computer harder, but that doesn't quite click. 

-Peter




You're being silly. Software torture tests are not going to kill your hardware. Just run them and see what you get. Memtest will give you the address where the error occured, and I've always been able to determine which stick was bad from that, using a little deductive reasoning (I usually verify by testing the sticks alone, but so far I've not been wrong).

As for voiding your warranty, memory and the hard drive are typically considered user-servicable parts. In fact, most of the time if either of those are the problem they'll just send you the parts and you'll have to replace them yourself anyway.

More on torturing hardware: really, the only component that's at all vulnerable to this is the hard drive, simply because it's a mechanical device, but it will take an absurdly long time to do any actual damage. I used to test hard drives for video servers (think Tivo, but starting at $100k). We tried a wide variety of drive testing suites, but it turned out none of them ran the drives harder than our normal application. A surprising number of the oldest version of our product are still running, on the original drives, after over 10 years, in situations that are very demanding (like serving multiple channels for DirecTV). So, really, stop being so paranoid about software torture tests. It is a complete myth that you can ruin your hardware by running them.







[-- Attachment #2: Type: text/html, Size: 3061 bytes --]

^ permalink raw reply	[flat|nested] 53+ messages in thread
* Re: [gentoo-amd64] Gentoo crashing?
@ 2007-05-15 19:22 Peter Hoff
  0 siblings, 0 replies; 53+ messages in thread
From: Peter Hoff @ 2007-05-15 19:22 UTC (permalink / raw
  To: gentoo-amd64

[-- Attachment #1: Type: text/plain, Size: 6129 bytes --]

Yes, thinking that running a software based test is going to damage your hardware is silly. Maybe 20 years ago that was possible, but on modern hardware the very idea is absurd. You really should run all of the tests recommended earlier in this thread.

I don't think you understand what I mean by a "loop" in Memtest. Unless you have run all of the tests at least once, you have not done a loop (there are 8, IIRC). The amount of time you ran it for is irrelevant, as that's totally system dependent. I've had systems with 32M take 3 hours to complete a loop, and I've had systems with 1G do it in less than 45 minutes. Processor type and speed, RAM type and speed, memory controller, system bus... all of these are factors in how long it takes to complete a Memtest loop. BTW, Memtest tells you how many loops it has completed, and it automatically starts over on the first test once it completes the last one.

As for not wanting to put your fingers on the hardware, that's justifiable, especially if you aren't prepared to protect your equipment from electrostatic discharge. But, warranty questions can't be answered here, you'd have to talk to the vendor you got it from. I'm just telling you what my experience has been. Dell, for example, really doesn't want to pay someone to swap parts for you, and will do everything they can to get you to do it yourself. If you're really that worried about it, though, you could take it to the local computer repair shop. Most of the good ones can handle Linux these days.

I am also a student, and I have never had $100k to spend on a system, I was just trying to give you an idea of the level of experience I have with hardware testing. Those are systems that I used to test and repair, not systems that I owned.


----- Original Message ----
From: Peter Davoust <worldgnat@gmail.com>
To: gentoo-amd64@lists.gentoo.org
Sent: Tuesday, May 15, 2007 8:48:41 AM
Subject: Re: [gentoo-amd64] Gentoo crashing?


Now wait a minute, not everyone has $100k to spend on a brand new
laptop. I'm a student, and I have a single computer to last me through
two years of highschool and and at least a few years of college, and
there's no way I'm going to screw up my computer without some
insurance, ok? Before I run anything on this machine, I'm going to
make sure that I'm still under warrantee, whether the parts are user
servicable or not. Now if you call that being silly, then that's your
choice, but it's my choice if I want to be cautious, even overly so.

On that note, I did buck up and run memtest86+ from a Ubuntu livecd,
and after several loops (about 1h 30 min of straight testing) I didn't
get a single error. It was on Test #6 when I stopped, so I think the
memory's chill. Besides, as I said before, when I run anything GUI
(enlightement, right now), it's fine. I just have to jump in and out
of terminal really quickly. The fact that it likes to crash after
starting x server twice makes me think I might have a few damaged
portions on my harddrive. Does that sound about right? Of course, that
sounds like it could be a kernel issue too. If I can figure out how to
"downgrade" my kernel, maybe that will solve it.

I just clicked the "<<plain text" button and the setting has held for
this entire thread. Come to think of it, I may have actually converted
it back to Rich Text a few weeks back.

-Peter

On 5/15/07, Peter Hoff <petehoff@pacbell.net> wrote:
>
>
>
> ----- Original Message ----
> From: Peter Davoust <worldgnat@gmail.com>
> To: gentoo-amd64@lists.gentoo.org
> Sent: Monday, May 14, 2007 7:11:20 PM
> Subject: Re: [gentoo-amd64] Gentoo crashing?
>
> I know it doesn't actually burn the cpu, but I'd rather not cook any
> components if I don't have to. From what I know of torture tests, they run
> the cpu so hot it starts making computational errors, am I right? It still
> makes me nervous. I was hoping to be able to fix the issue just by
> recompiling my kernel, but no such luck. I'll mess with it some more and see
> what I can do. Can you give me any advice as to what I should to to a) not
> violate my warrantee and b) avoid killing my computer as much as possible?
> Could it just be something with my Gentoo install? I guess that's a stupid
> question; I've had this problem on an older computer, but it was a Desktop
> and it was much easier to swap components without messing up my warrantee.
> So if it were a hardware problem, wouldn't you think that suse 10.2 would
> have run into it as well? I used to run 10.2 (used to as in 3 days ago) for
> hours on end without any problems at all. I agree that Gentoo can run the
> computer harder, but that doesn't quite click.
>
> -Peter
>
>
>
> You're being silly. Software torture tests are not going to kill your
> hardware. Just run them and see what you get. Memtest will give you the
> address where the error occured, and I've always been able to determine
> which stick was bad from that, using a little deductive reasoning (I usually
> verify by testing the sticks alone, but so far I've not been wrong).
>
> As for voiding your warranty, memory and the hard drive are typically
> considered user-servicable parts. In fact, most of the time if either of
> those are the problem they'll just send you the parts and you'll have to
> replace them yourself anyway.
>
> More on torturing hardware: really, the only component that's at all
> vulnerable to this is the hard drive, simply because it's a mechanical
> device, but it will take an absurdly long time to do any actual damage. I
> used to test hard drives for video servers (think Tivo, but starting at
> $100k). We tried a wide variety of drive testing suites, but it turned out
> none of them ran the drives harder than our normal application. A surprising
> number of the oldest version of our product are still running, on the
> original drives, after over 10 years, in situations that are very demanding
> (like serving multiple channels for DirecTV). So, really, stop being so
> paranoid about software torture tests. It is a complete myth that you can
> ruin your hardware by running them.
>
>
>
-- 
gentoo-amd64@gentoo.org mailing list

[-- Attachment #2: Type: text/html, Size: 7605 bytes --]

^ permalink raw reply	[flat|nested] 53+ messages in thread

end of thread, other threads:[~2007-05-19 19:41 UTC | newest]

Thread overview: 53+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-05-14  5:15 [gentoo-amd64] Gentoo crashing? Peter Davoust
2007-05-14  5:25 ` Dustin C. Hatch
2007-05-14  5:44   ` Barry.SCHWARTZ
2007-05-14  5:53     ` Peter Davoust
2007-05-14  5:50   ` Naga
2007-05-14  5:56     ` Peter Davoust
2007-05-14  5:53 ` Wil Reichert
2007-05-14  6:04   ` Peter Davoust
2007-05-14 10:50     ` [gentoo-amd64] " Duncan
2007-05-14  9:36       ` Isidore Ducasse
2007-05-14 11:44         ` Sebastian Redl
2007-05-14 13:27         ` Florian Philipp
2007-05-14 13:57         ` Wil Reichert
2007-05-14 14:55         ` Duncan
2007-05-14 16:02     ` [gentoo-amd64] " dustin
2007-05-14 21:08       ` Peter Davoust
2007-05-15  1:41         ` Antoine Martin
2007-05-15 11:06         ` [gentoo-amd64] " Duncan
2007-05-15 12:51           ` Isidore Ducasse
2007-05-15 16:47             ` Duncan
2007-05-15  1:45     ` [gentoo-amd64] " Antoine Martin
2007-05-15  2:11       ` Peter Davoust
2007-05-15  2:17         ` Peter Davoust
2007-05-15  2:44           ` Barry.SCHWARTZ
2007-05-15  3:53             ` Peter Davoust
2007-05-15  4:51               ` Barry.SCHWARTZ
2007-05-15 11:39             ` Antoine Martin
2007-05-15 11:41           ` Antoine Martin
2007-05-15 12:57             ` Peter Davoust
2007-05-15 12:37               ` Isidore Ducasse
2007-05-15 19:36               ` Barry.SCHWARTZ
2007-05-14 11:43 ` Florian D.
  -- strict thread matches above, loose matches on Subject: below --
2007-05-15 12:41 Peter Hoff
2007-05-15 12:57 ` Peter Davoust
2007-05-15 13:19 ` Bernhard Auzinger
2007-05-15 12:58 Peter Hoff
2007-05-15 13:24 Peter Hoff
2007-05-15 15:48 ` Peter Davoust
2007-05-15 16:08   ` Bob Sanders
2007-05-15 18:58     ` Antoine Martin
2007-05-15 22:41       ` Isidore Ducasse
2007-05-15 23:03         ` Hemmann, Volker Armin
2007-05-19 17:35           ` Isidore Ducasse
2007-05-19 19:39             ` Peter Davoust
2007-05-15 23:13         ` david
2007-05-19  4:11           ` Peter Davoust
2007-05-19  4:45             ` Peter Davoust
2007-05-19  7:31               ` Boyd Stephen Smith Jr.
2007-05-19 16:51             ` Hemmann, Volker Armin
2007-05-15 16:11   ` Wil Reichert
2007-05-15 22:08     ` Peter Davoust
2007-05-15 22:22       ` Bob Sanders
2007-05-15 19:22 Peter Hoff

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox