From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from pigeon.gentoo.org ([208.92.234.80] helo=lists.gentoo.org) by finch.gentoo.org with esmtp (Exim 4.60) (envelope-from ) id 1Pt25Q-0002XH-WE for garchives@archives.gentoo.org; Fri, 25 Feb 2011 18:08:17 +0000 Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id 5A178E07AB for ; Fri, 25 Feb 2011 18:08:16 +0000 (UTC) Received: from mail-ww0-f53.google.com (mail-ww0-f53.google.com [74.125.82.53]) by pigeon.gentoo.org (Postfix) with ESMTP id 142001C01A for ; Fri, 25 Feb 2011 17:14:08 +0000 (UTC) Received: by wwb17 with SMTP id 17so2548390wwb.10 for ; Fri, 25 Feb 2011 09:14:08 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=e1TdtkqhzBhtSF8K1Lk8Azq6atM+hQ+mmxhmS8C699k=; b=NhPRfqd7i+yfyfmTCqQ9jaEAgTbyAJ0jEsPLgtA8TVGWf0IUYttqTPiTSGJw0FISFi lHk0TniAHlMTK8xJMqU4a3/TOdchzI7cZaclUkNd7xsEaj0en3EO6ud4h6vuqzv8iVhQ b8LQFyGl0L6qcAyOMeO0Ep6+wiI/Di5Uy2nbQ= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=sFFaz4NfX8rgaQcHX5MOqDUM8ARx+a3XcHbC4CB7TYq7mwydHVWD3PM7jr6UwHzAxR x4exhz8tXY7adXI6bMqT4ATs41B8Wfe2SQwIQUxx3wMC/xK5yvb99SS1WttRY3k8lCtS x1dwB2vq4vtl+o+6XmkEilUj+EBm3StcfyO+s= Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-user@lists.gentoo.org Reply-to: gentoo-user@lists.gentoo.org MIME-Version: 1.0 Received: by 10.216.35.74 with SMTP id t52mr2174136wea.103.1298654048165; Fri, 25 Feb 2011 09:14:08 -0800 (PST) Received: by 10.216.90.136 with HTTP; Fri, 25 Feb 2011 09:14:08 -0800 (PST) In-Reply-To: <4D67CBC0.4000604@gmail.com> References: <4D67CBC0.4000604@gmail.com> Date: Fri, 25 Feb 2011 09:14:08 -0800 Message-ID: Subject: Re: [gentoo-user] Random reboots. Where to start? From: Mark Knecht To: gentoo-user@lists.gentoo.org Cc: Dale Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Archives-Salt: X-Archives-Hash: 0bf750f5080f075a4163d9a85bca905a On Fri, Feb 25, 2011 at 7:33 AM, Dale wrote: > Well, I think my machine is possessed or something. =C2=A0I'm getting ran= dom > reboots here. =C2=A0When it does this, it is like hitting the reset butto= n. =C2=A0It > is sitting on the grub screen when it does this. =C2=A0I noticed the firs= t time > the other day and this was before adding the extra memory. =C2=A0I seemed= to be > stable at 4Gbs but I seem to be rebooting at random. =C2=A0I ran memtest > yesterday, it checked fine. =C2=A0It didn't find a error but it looked li= ke it > was only testing part of it. =C2=A0Memtest recognizes all 16Gbs on the la= st run > but it didn't seem to be testing it all. =C2=A0Is there a trick to gettin= g it to > test the whole thing? > > This is the last few lines from messages before the reboot: > > Feb 25 05:10:01 localhost cron[5697]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 05:14:47 localhost smartd[3902]: Device: /dev/sdb [SAT], SMART Usa= ge > Attribute: 194 Temperature_Celsius changed from 113 to 112 > Feb 25 05:14:47 localhost smartd[3902]: Device: /dev/sdc [SAT], SMART Usa= ge > Attribute: 190 Airflow_Temperature_Cel changed from 80 to 78 > Feb 25 05:14:47 localhost smartd[3902]: Device: /dev/sdc [SAT], SMART Usa= ge > Attribute: 194 Temperature_Celsius changed from 75 to 74 > Feb 25 05:20:01 localhost cron[5850]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 05:30:01 localhost cron[5994]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 05:40:01 localhost cron[6136]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 05:41:49 localhost uptimed: moving up to position 20: 0 days, > 01:27:23 > Feb 25 05:44:47 localhost smartd[3902]: Device: /dev/sdc [SAT], SMART Usa= ge > Attribute: 190 Airflow_Temperature_Cel changed from 78 to 77 > Feb 25 05:50:01 localhost cron[6284]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 05:59:01 localhost cron[6413]: (root) CMD (rm -f > /var/spool/cron/lastrun/cron.hourly) > Feb 25 06:00:01 localhost cron[6429]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 06:10:01 localhost cron[6573]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 06:14:47 localhost smartd[3902]: Device: /dev/sdc [SAT], SMART Usa= ge > Attribute: 190 Airflow_Temperature_Cel changed from 77 to 76 > Feb 25 06:20:01 localhost cron[6722]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 06:30:01 localhost cron[6865]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 06:40:01 localhost cron[7008]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 06:50:01 localhost cron[7156]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 06:59:01 localhost cron[7286]: (root) CMD (rm -f > /var/spool/cron/lastrun/cron.hourly) > Feb 25 07:00:01 localhost cron[7301]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 07:10:01 localhost cron[7444]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 07:20:01 localhost cron[7592]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 07:30:01 localhost cron[7741]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 07:40:01 localhost cron[7884]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > Feb 25 07:42:49 localhost uptimed: moving up to position 19: 0 days, > 03:28:23 > Feb 25 07:50:01 localhost cron[8032]: (root) CMD (test -x > /usr/sbin/run-crons && /usr/sbin/run-crons ) > > I don't see anything out of the norm, do you? =C2=A0What else should I ch= eck? =C2=A0I > have a Gigabyte mobo, anything in the BIOS I should check? =C2=A0After I = added > the last two sticks of ram, I loaded the optimized settings. =C2=A0No > overclocking or anything here. > > It does this while logged into KDE and after running a while. =C2=A0I hav= e shut > down folding and the CPU is running below 85F and all the fans are runnin= g > fine. =C2=A0I don't think this could be a heat issue. =C2=A0It's a Cooler= Master HAF > 932 case with lots of cooling. > > I'm going to reboot and let memtest run a while and see exactly what it w= as > that makes me think it is not testing ALL the memory. > > Thanks. > > Dale > > :-) =C2=A0:-) Is folding pretty CPU intensive? If it is then possibly shut that off completely until you find the root cause. Additional CPU heating can cause higher temps all through the machine. If you have a broken trace somewhere that only comes apart when the motherboard heats up, etc. The order I walk through this sort of problem is: 1) Google, Google, Google for your exact hardware looking for similar problems. (and hopefully solutions...) The main culprits are generally: - Motherboard - Power supply - VGA 2) Unlikely if this is your new machine but use some canned air and blow out all heat sinks if they have collected dust. 3) Remove _ALL_ adapter cards and any external devices that you don't absolutely need for testing. Run for a number of hours or days. If you are still rebooting then consider changing your power supply first. What sort of supply are you using now? Does it have _more_ than power for your machine? I hope you find it soon. This can be very frustrating. (From experience...) Good luck, Mark