* [gentoo-amd64] System crashes when idle @ 2007-05-29 6:59 Michael Ulm 2007-05-29 7:42 ` Peter Humphrey 2007-06-08 6:34 ` [gentoo-amd64] System crashes when idle [SOLUTION(?)] Michael Ulm 0 siblings, 2 replies; 9+ messages in thread From: Michael Ulm @ 2007-05-29 6:59 UTC (permalink / raw To: gentoo-amd64 Hi, I seem to get system crashes on my machine when it is idle for some time. Here is a typical crashing session: I leave the computer on idle for an hour or so. When I return, one or more applications have crashed. dmesg will show a segfault on some apparently random location, and the system will remain highly unstable. Usually, even halting the system will not be possible. The next reboot will produce several errors on the hard disk, which usually need manual intervention. After some repairing, the system then runs stable again. Can this be a software problem? I tried to disable all sleep/hibernate functionality in KDE, but maybe I'm missing something there. Maybe it's the hardware, but what kind of hardware issue could cause these symptoms? Any ideas anyone? Thanks, Michael -- Michael Ulm R&D Team ISIS Information Systems Austria tel: +43 2236 27551-542, fax: +43 2236 21081 e-mail: michael.ulm@isis-papyrus.com Visit our Website: www.isis-papyrus.com --------------------------------------------------------------- This e-mail is only intended for the recipient and not legally binding. Unauthorised use, publication, reproduction or disclosure of the content of this e-mail is not permitted. This email has been checked for known viruses, but ISIS accepts no responsibility for malicious or inappropriate content. --------------------------------------------------------------- -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-amd64] System crashes when idle 2007-05-29 6:59 [gentoo-amd64] System crashes when idle Michael Ulm @ 2007-05-29 7:42 ` Peter Humphrey 2007-05-29 9:02 ` Michael Ulm ` (2 more replies) 2007-06-08 6:34 ` [gentoo-amd64] System crashes when idle [SOLUTION(?)] Michael Ulm 1 sibling, 3 replies; 9+ messages in thread From: Peter Humphrey @ 2007-05-29 7:42 UTC (permalink / raw To: gentoo-amd64 On Tuesday 29 May 2007 07:59:42 Michael Ulm wrote: > Hi, > > I seem to get system crashes on my machine when it is idle for > some time. Here is a typical crashing session: [...] > Maybe it's the hardware, but what kind of hardware issue could cause > these symptoms? > > Any ideas anyone? A couple. Either RAM or hard disk would be my first suspect. You can check the RAM best by compiling any large package, such as gcc itself. Memtest and its like can only find the most obvious faults and in my opinion aren't worth their salt - if they can find a fault, it will already be obvious to you. Secondly, with the advent of SATA I'm suspecting a new class of fault in hard disks. Not only do we have the traditional transfer errors, but now I suspect it's possible for some faults to allow voltage spikes to be imposed on the line driver and cause random errors in other subsystems. I may be wrong, but I've had one experience of this already, and I'm currently working to eliminate another disk as the cause of weird problems I'm having. Oh, and there's always the power supply. If that goes wonky it can cause all manner of problems. You could well suspect it if problems don't occur until after the system's been running for quite a while. -- Rgds Peter Humphrey Linux Counter 5290, Aug 93 -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-amd64] System crashes when idle 2007-05-29 7:42 ` Peter Humphrey @ 2007-05-29 9:02 ` Michael Ulm 2007-05-29 16:10 ` Dustin J. Mitchell 2007-05-29 22:37 ` Joshua Hoblitt 2 siblings, 0 replies; 9+ messages in thread From: Michael Ulm @ 2007-05-29 9:02 UTC (permalink / raw To: gentoo-amd64 Peter Humphrey wrote: > On Tuesday 29 May 2007 07:59:42 Michael Ulm wrote: >> Hi, >> >> I seem to get system crashes on my machine when it is idle for >> some time. Here is a typical crashing session: > [...] >> Maybe it's the hardware, but what kind of hardware issue could cause >> these symptoms? >> >> Any ideas anyone? > > A couple. Either RAM or hard disk would be my first suspect. You can check > the RAM best by compiling any large package, such as gcc itself. Memtest > and its like can only find the most obvious faults and in my opinion aren't > worth their salt - if they can find a fault, it will already be obvious to > you. > > Secondly, with the advent of SATA I'm suspecting a new class of fault in > hard disks. Not only do we have the traditional transfer errors, but now I > suspect it's possible for some faults to allow voltage spikes to be imposed > on the line driver and cause random errors in other subsystems. I may be > wrong, but I've had one experience of this already, and I'm currently > working to eliminate another disk as the cause of weird problems I'm > having. > > Oh, and there's always the power supply. If that goes wonky it can cause all > manner of problems. You could well suspect it if problems don't occur until > after the system's been running for quite a while. > Thank you for your suggestions. I have had no problems so far compiling my Gentoo packages, and I often stressed my system with lengthy computations without any issues. So I tend to think the RAM is good. I thought that power supply issues would tend to occur under stress, not under rest. But you are right of course, the problems may occur after some delay. Unfortunately I've no idea how to test this. I'll check the disk to see if something comes up there. I also plan to install Ubuntu on some separate disk and see if the problem occurs there as well. If these are indeed hardware related problems, I feel like abusing this group (which is software related after all) with my problems. So, I'd also appreciate suggestions which online community would be better suited for help with debugging hardware issues. Thanks, Michael -- Michael Ulm R&D Team ISIS Information Systems Austria tel: +43 2236 27551-542, fax: +43 2236 21081 e-mail: michael.ulm@isis-papyrus.com Visit our Website: www.isis-papyrus.com --------------------------------------------------------------- This e-mail is only intended for the recipient and not legally binding. Unauthorised use, publication, reproduction or disclosure of the content of this e-mail is not permitted. This email has been checked for known viruses, but ISIS accepts no responsibility for malicious or inappropriate content. --------------------------------------------------------------- -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-amd64] System crashes when idle 2007-05-29 7:42 ` Peter Humphrey 2007-05-29 9:02 ` Michael Ulm @ 2007-05-29 16:10 ` Dustin J. Mitchell 2007-05-29 22:37 ` Joshua Hoblitt 2 siblings, 0 replies; 9+ messages in thread From: Dustin J. Mitchell @ 2007-05-29 16:10 UTC (permalink / raw To: gentoo-amd64 On Tue, May 29, 2007 at 08:42:10AM +0100, Peter Humphrey wrote: > A couple. Either RAM or hard disk would be my first suspect. You can check > the RAM best by compiling any large package, such as gcc itself. Memtest > and its like can only find the most obvious faults and in my opinion aren't > worth their salt - if they can find a fault, it will already be obvious to > you. FWIW, their value is in finding that the fault is with RAM, not with anything else. Compiling isn't going to narrow down the problem beyond "yep, the computer's broken" :) That said, that this happens when the system is quiet suggests it's not the HD. Dustin -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-amd64] System crashes when idle 2007-05-29 7:42 ` Peter Humphrey 2007-05-29 9:02 ` Michael Ulm 2007-05-29 16:10 ` Dustin J. Mitchell @ 2007-05-29 22:37 ` Joshua Hoblitt 2 siblings, 0 replies; 9+ messages in thread From: Joshua Hoblitt @ 2007-05-29 22:37 UTC (permalink / raw To: gentoo-amd64 [-- Attachment #1: Type: text/plain, Size: 822 bytes --] On Tue, May 29, 2007 at 08:42:10AM +0100, Peter Humphrey wrote: > A couple. Either RAM or hard disk would be my first suspect. You can check > the RAM best by compiling any large package, such as gcc itself. Memtest > and its like can only find the most obvious faults and in my opinion aren't > worth their salt - if they can find a fault, it will already be obvious to > you. I've also found GCC to be a fairly effective system torture test. Late last year I had an issue where compiling GCC was the only relibable way to reproduce an issue what we were seeing. The symptoms were random system hangs and believe it or not occasional segfaults. It turned out be bad L2 cache on the CPU. Repeatedly emerging gcc eventually generated a MCE log message allowing us to diagnose the problem. -J -- [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-amd64] System crashes when idle [SOLUTION(?)] 2007-05-29 6:59 [gentoo-amd64] System crashes when idle Michael Ulm 2007-05-29 7:42 ` Peter Humphrey @ 2007-06-08 6:34 ` Michael Ulm 2007-06-08 9:19 ` Sebastian Redl 1 sibling, 1 reply; 9+ messages in thread From: Michael Ulm @ 2007-06-08 6:34 UTC (permalink / raw To: gentoo-amd64 Michael Ulm wrote: > Hi, > > I seem to get system crashes on my machine when it is idle for > some time. Here is a typical crashing session: > > I leave the computer on idle for an hour or so. When I return, > one or more applications have crashed. dmesg will show a segfault > on some apparently random location, and the system will remain > highly unstable. Usually, even halting the system will not be > possible. The next reboot will produce several errors on the hard > disk, which usually need manual intervention. After some repairing, > the system then runs stable again. > > Can this be a software problem? I tried to disable all sleep/hibernate > functionality in KDE, but maybe I'm missing something there. > > Maybe it's the hardware, but what kind of hardware issue could cause > these symptoms? I installed Ubuntu on a second disk and it didn't crash. After a lengthy process of comparing setups, I think I found the problem. In xorg.conf, in the monitor section, I was missing the line Options="DPMS" I'll still have to research how leaving out this option could lead to crashes, so I appreciate any insight this group may give. Regards, Michael -- Michael Ulm R&D Team ISIS Information Systems Austria tel: +43 2236 27551-542, fax: +43 2236 21081 e-mail: michael.ulm@isis-papyrus.com Visit our Website: www.isis-papyrus.com --------------------------------------------------------------- This e-mail is only intended for the recipient and not legally binding. Unauthorised use, publication, reproduction or disclosure of the content of this e-mail is not permitted. This email has been checked for known viruses, but ISIS accepts no responsibility for malicious or inappropriate content. --------------------------------------------------------------- -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-amd64] System crashes when idle [SOLUTION(?)] 2007-06-08 6:34 ` [gentoo-amd64] System crashes when idle [SOLUTION(?)] Michael Ulm @ 2007-06-08 9:19 ` Sebastian Redl 2007-06-08 9:55 ` Peter Humphrey 0 siblings, 1 reply; 9+ messages in thread From: Sebastian Redl @ 2007-06-08 9:19 UTC (permalink / raw To: gentoo-amd64 Michael Ulm wrote: > I installed Ubuntu on a second disk and it didn't crash. After a lengthy > process of comparing setups, I think I found the problem. > > In xorg.conf, in the monitor section, I was missing the line > Options="DPMS" > > I'll still have to research how leaving out this option could lead to > crashes, so I appreciate any insight this group may give. http://en.wikipedia.org/wiki/VESA_Display_Power_Management_Signaling Since practically every monitor supports this, and since it's practically always enabled, it may be that other code paths of X.org are insufficiently tested, causing the server to crash when it wants to put the monitor into a power saver mode. That's my best guess. Sebastian Redl -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-amd64] System crashes when idle [SOLUTION(?)] 2007-06-08 9:19 ` Sebastian Redl @ 2007-06-08 9:55 ` Peter Humphrey 2007-06-08 12:06 ` Dustin C. Hatch 0 siblings, 1 reply; 9+ messages in thread From: Peter Humphrey @ 2007-06-08 9:55 UTC (permalink / raw To: gentoo-amd64 On Friday 08 June 2007 10:19:08 Sebastian Redl wrote: > Michael Ulm wrote: > > In xorg.conf, in the monitor section, I was missing the line > > Options="DPMS" > > > > I'll still have to research how leaving out this option could lead to > > crashes, so I appreciate any insight this group may give. > > Since practically every monitor supports this, and since it's > practically always enabled, it may be that other code paths of X.org are > insufficiently tested, causing the server to crash when it wants to put > the monitor into a power saver mode. That's my best guess. My experience doesn't match this. I've had to remove the DPMS option from my xorg.conf to prevent lockups. Those only happened if I left the KDM login screen up until the screen blanker cut in, but then the only way out was a hard reset. Apart from wanting my machine to do only what I tell it to, and I'm quite capable of switching my monitor off when I'm not using it, I think it's dangerous for any system to make assumptions about what facilities will be required in any particular case. My hardware is an nVidia GeForce 7300 GS driving an Iiyama AU4831D screen, and the X nv driver comes from x11-base/xorg-x11-7.2. Or sometimes I play with the nVidia closed-source drivers, but as far as I know that doesn't affect the lockups. -- Rgds Peter Humphrey Linux Counter 5290, Aug 93 -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-amd64] System crashes when idle [SOLUTION(?)] 2007-06-08 9:55 ` Peter Humphrey @ 2007-06-08 12:06 ` Dustin C. Hatch 0 siblings, 0 replies; 9+ messages in thread From: Dustin C. Hatch @ 2007-06-08 12:06 UTC (permalink / raw To: gentoo-amd64 [-- Attachment #1: Type: text/plain, Size: 2550 bytes --] I have been experiencing this problem for quite some time now, but I have found that when I use gentoo-sources-2.6.20-r4 instead of other kernels, the problem seems to go away. Last night, I was trying 2.6.21-r2 to see if I still experienced the problem. I had started a lengthy copy process before going to bed. When I woke this morning, the computer was in fact locked up. Like always, the cursor still moved around the screen when I moved the mouse, but button clicks and keyboard keystrokes had no effect. Again, I had to use the Magic SysRq sequence to reboot the machine. When it had rebooted, back to 2.6.20-r4, I restarted rsync to finish the copy process. To my surprise, the copy was already finished. I should point out that every time I experience this, I try unsuccessfully to ssh into my computer. The computer does not respond to any network traffic I initiate, SSH, ICMP, HTTP, etc. I have an nVidia GeForce 6600 running the proprietary "nvidia" driver. I h ave no DPMS option in my xorg.conf, and xscreensaver is running, though set to never display a screen saver. Dustin C. Hatch http://www.dchweb.com Peter Humphrey wrote: > On Friday 08 June 2007 10:19:08 Sebastian Redl wrote: > >> Michael Ulm wrote: >> >>> In xorg.conf, in the monitor section, I was missing the line >>> Options="DPMS" >>> >>> I'll still have to research how leaving out this option could lead to >>> crashes, so I appreciate any insight this group may give. >>> >> Since practically every monitor supports this, and since it's >> practically always enabled, it may be that other code paths of X.org are >> insufficiently tested, causing the server to crash when it wants to put >> the monitor into a power saver mode. That's my best guess. >> > > My experience doesn't match this. I've had to remove the DPMS option from my > xorg.conf to prevent lockups. Those only happened if I left the KDM login > screen up until the screen blanker cut in, but then the only way out was a > hard reset. > > Apart from wanting my machine to do only what I tell it to, and I'm quite > capable of switching my monitor off when I'm not using it, I think it's > dangerous for any system to make assumptions about what facilities will be > required in any particular case. > > My hardware is an nVidia GeForce 7300 GS driving an Iiyama AU4831D screen, > and the X nv driver comes from x11-base/xorg-x11-7.2. Or sometimes I play > with the nVidia closed-source drivers, but as far as I know that doesn't > affect the lockups. > > [-- Attachment #2: Type: text/html, Size: 3219 bytes --] ^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2007-06-08 12:09 UTC | newest] Thread overview: 9+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2007-05-29 6:59 [gentoo-amd64] System crashes when idle Michael Ulm 2007-05-29 7:42 ` Peter Humphrey 2007-05-29 9:02 ` Michael Ulm 2007-05-29 16:10 ` Dustin J. Mitchell 2007-05-29 22:37 ` Joshua Hoblitt 2007-06-08 6:34 ` [gentoo-amd64] System crashes when idle [SOLUTION(?)] Michael Ulm 2007-06-08 9:19 ` Sebastian Redl 2007-06-08 9:55 ` Peter Humphrey 2007-06-08 12:06 ` Dustin C. Hatch
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox