From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from pigeon.gentoo.org ([69.77.167.62] helo=lists.gentoo.org) by finch.gentoo.org with esmtp (Exim 4.60) (envelope-from ) id 1LYrKw-0001J9-D4 for garchives@archives.gentoo.org; Mon, 16 Feb 2009 00:27:50 +0000 Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id D0716E039F; Mon, 16 Feb 2009 00:27:44 +0000 (UTC) Received: from wf-out-1314.google.com (wf-out-1314.google.com [209.85.200.172]) by pigeon.gentoo.org (Postfix) with ESMTP id 91717E039F for ; Mon, 16 Feb 2009 00:27:44 +0000 (UTC) Received: by wf-out-1314.google.com with SMTP id 29so2243586wff.10 for ; Sun, 15 Feb 2009 16:27:43 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from :user-agent:mime-version:to:subject:references:in-reply-to :content-type:content-transfer-encoding; bh=auLjGVJZ6tZp+My+wfoFizPoEgR0r83eP76rBb3bFpY=; b=xMAUjKmKzxCOI51Rp4+8bYuzcZYjkBtMgLF2s415U9dYILH2tDFcwhwJjXcd/iw+s7 96+lBxTMwlTA74gnq6BgQ0uIovgPAwAjB5GS1t1VH2d+ym4uFUt7TIIbbRnIy+nCQ4u1 UqtNJrBG+YgkOuZrNHq4FsSVSyCfwSa7KD61I= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; b=RUDwDQEyoTthTFqWF+kQ/2pbuRqb+UGNuaOpoJc4Irj6CYPYG+p9uieILsqfv6W6Cr AwMCCF2abf8RRRzOrO7IPERwm+kYpWYosLguzK697cs3ozfdHOoJpxl0LTee8s4euQOD C2Ch1V1MagFR7mE1wuiX7kk3NHSp+mFVF/hSs= Received: by 10.142.185.21 with SMTP id i21mr2083175wff.220.1234744062500; Sun, 15 Feb 2009 16:27:42 -0800 (PST) Received: from ?4.230.105.102? (dialup-4.230.105.102.Dial1.Houston1.Level3.net [4.230.105.102]) by mx.google.com with ESMTPS id 24sm8206547wfc.37.2009.02.15.16.27.15 (version=SSLv3 cipher=RC4-MD5); Sun, 15 Feb 2009 16:27:41 -0800 (PST) Message-ID: <4998B2AF.4070509@gmail.com> Date: Sun, 15 Feb 2009 18:26:23 -0600 From: Dale User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.19) Gecko/20081227 SeaMonkey/1.1.14 Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-user@lists.gentoo.org Reply-to: gentoo-user@lists.gentoo.org MIME-Version: 1.0 To: gentoo-user@lists.gentoo.org Subject: Re: [gentoo-user] spontaneous reboots.. what to look for References: <87wsbrs1vv.fsf@newsguy.com> <5bdc1c8b0902151556n4e066b97ob7762fb1918da717@mail.gmail.com> In-Reply-To: <5bdc1c8b0902151556n4e066b97ob7762fb1918da717@mail.gmail.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Archives-Salt: 9ec2e380-e9b5-4f29-9469-8b6fc6f899e8 X-Archives-Hash: f7f67d19f886d51c29dfb93470c27c98 Mark Knecht wrote: > On Sun, Feb 15, 2009 at 3:42 PM, Harry Putnam wrote: > >> I've been experiencing spontaneous reboots on one gentoo machine >> lately. Looking thru /var/log/messages... I see the restarts but >> looking above that... I'm not seeing anything I recognize as being a >> culprit. >> >> Its been happening for a few weeks... but I've been busy and only now >> digging into it ( The machine is no kind of server ). >> >> It appears to only happen in X (I'm using xfce4) and I've only noticed >> it since I started running 2.6.28 kernels. Although I couldn't say >> that it seemed to be directly related. >> >> I mean I didn't boot into 2.6.28 and suddenly notice spontaneous >> rebooting. >> >> It does not appear to be heat realated... but I am only now using >> lm_sensors to keep an accurate record and see if there appears to be a >> relationship. >> >> I've had two today so either its happening more often or I'm just >> spending more time on that machine. >> >> It may also be on the first or second time its happened while I as >> actually right at the keyboard. >> >> I'm sorry to be so vague about it, but in truth, I've been pretty lazy >> about it... since no real harm comes of an unexpected reboot on that >> machine (so far anyway). But clearly something that has to be figured >> out. >> >> The only things I've checked so far... >> 1) browsing thru /var/log/messages (Having trouble recognizing any >> thing that looks suspicious. >> >> I have noticed what appears to be a time/date anomaly where the >> progression of time is suddenly irregular. That is, an earlier >> time shows up amongst some later times. >> >> It appears to have been me sudoing to visudo. And apparently >> having /etc/sudoers open long enough for the closing of it to be >> earlier than other events taking place. >> >> Again ... I'm not real sure exactly what happened there but it >> does not appear to coincide with a reboot anyway. >> >> 2) checking how hot the cpu is getting (Doesn't appear to be a >> problem) But now running a cron job recording temperatures every 10 >> minutes. So that may turn up something. >> >> 3) checking for overfilled disks. (none show in df -h) >> >> > > Reseat memory and PCI cards, etc. Consider removing for a period of > time any hardware not absolutely necessary to debug the problem. (I.e. > - second video card, extra disk drives, extra network adapters, etc.) > Run memtest86 for a few days if you can spare the machine. Run > spinrite, etc., to look for drive problems. Open the box up and place > a fan blowing extra air for additional cooling. > > good luck, > Mark > > > To add another test. I had this issue once before and it was a faulty driver for my hard drives. I ran a command like this to test mine: hdparm -Tt /dev/hda && hdparm -Tt /dev/hda && hdparm -Tt /dev/hda && hdparm -Tt /dev/hda && hdparm -Tt /dev/hda If it can pass that then it should be all right and you can look elsewhere. Mine would only fail when the drives were very busy and that test should do that pretty good. Hope that helps. Dale :-) :-)