public inbox for gentoo-user@lists.gentoo.org
 help / color / mirror / Atom feed
* [gentoo-user] Any way of tracing kernel freezes?
@ 2013-06-13 13:47 Frank Steinmetzger
  2013-06-13 14:05 ` [gentoo-user] " Nikos Chantziaras
  0 siblings, 1 reply; 8+ messages in thread
From: Frank Steinmetzger @ 2013-06-13 13:47 UTC (permalink / raw
  To: gentoo-user

[-- Attachment #1: Type: text/plain, Size: 1872 bytes --]

Hello list,

recently my netbook got the habit of freezing sporadically. It happens
apparently out of the blue. Until now I thought it might have something
to do with audio playback, because I remember always having music
playing when it happens. But just now it happened without music:

One second I was surfing along, the other second -- dead. While I was
already 3 hours in of a 4-hour emerge. :-(

If music is playing at a freeze, the last bit of it (less than a
second's worth) is looped indefinitely. The screen output is still
there, but there is no reaction to any input, not even sysrequests.
I also tried SSHing in without success.

I have no experience with Kernel debugging. Is there any possibility at
all of finding out what the problem is? Perhaps enable some log output?
Being a netbook, there is no serial port (which I don't have the means
to connect to anyway). In the hopes that the problem started with a 3.9
kernel, I am building a 3.8.13 again (using kernel.org archives).


I ran a memtest a few months back and stopped it after 8 successful
passes.  I might try that again as soon as I find out how¹, but I'd
think that a corrupt memory would cause something different than a full
freeze.

I'm running a 64 bit kernel and 32 bit userland with march=atom. I'd
really hate to find out the machine has a hardware problem, it's a very
neat piece of silicon with a great screen. Who cares that the 6.5 Watt
CPU is a tad slow. It's perfect for a console environment. :-)


¹ I have the in-kernel memtest enabled, so I tried booting it with the
  "memtest" option right after the last freeze happened. But all I got
  was a black screen with blinking cursor.
-- 
Gruß | Greetings | Qapla’
Please do not share anything from, with or about me with any Facebook service.

The circle is the parallel to the dot.

[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [gentoo-user] Re: Any way of tracing kernel freezes?
  2013-06-13 13:47 [gentoo-user] Any way of tracing kernel freezes? Frank Steinmetzger
@ 2013-06-13 14:05 ` Nikos Chantziaras
  2013-06-13 15:18   ` Neil Bothwick
                     ` (2 more replies)
  0 siblings, 3 replies; 8+ messages in thread
From: Nikos Chantziaras @ 2013-06-13 14:05 UTC (permalink / raw
  To: gentoo-user

On 13/06/13 16:47, Frank Steinmetzger wrote:
> recently my netbook got the habit of freezing sporadically. [...]
>[...]
> I ran a memtest a few months back and stopped it after 8 successful
> passes.  I might try that again as soon as I find out how¹, but I'd
> think that a corrupt memory would cause something different than a full
> freeze.

It usually manifests in segfaults that seem to come at random.  But it's 
still worth a shot.  It's very easy.  Emerge "sys-apps/memtest86+" and 
add this grub entry:

   title=Memtest86+
   root (hd0,0)
   kernel /boot/memtest86plus/memtest.bin

(Adapt the "hd0,0" of course to be the same disk as the one you're using 
to boot your kernel.)

That's it.  Now your boot menu will include a "Memtest86+" option.  This 
is for grub 1 though.  If you migrated to grub 2 by now, then I don't 
know how that boot entry would look like.  I suspect it will be some 
300-line monstrosity or something :-|



^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [gentoo-user] Re: Any way of tracing kernel freezes?
  2013-06-13 14:05 ` [gentoo-user] " Nikos Chantziaras
@ 2013-06-13 15:18   ` Neil Bothwick
  2013-06-13 21:04   ` William Kenworthy
  2013-06-13 21:14   ` Paul Hartman
  2 siblings, 0 replies; 8+ messages in thread
From: Neil Bothwick @ 2013-06-13 15:18 UTC (permalink / raw
  To: gentoo-user

[-- Attachment #1: Type: text/plain, Size: 1366 bytes --]

On Thu, 13 Jun 2013 17:05:15 +0300, Nikos Chantziaras wrote:

> add this grub entry:
> 
>    title=Memtest86+
>    root (hd0,0)
>    kernel /boot/memtest86plus/memtest.bin
> 
> (Adapt the "hd0,0" of course to be the same disk as the one you're
> using to boot your kernel.)
> 
> That's it.  Now your boot menu will include a "Memtest86+" option.
> This is for grub 1 though.  If you migrated to grub 2 by now, then I
> don't know how that boot entry would look like.  I suspect it will be
> some 300-line monstrosity or something :-|

With GRUB2 you don't have to add anything. /etc/grub.d/39_memtest86+ is
created by the ebuild, containing slightly less than 300 lines

#!/bin/sh
exec tail -n +3 $0

menuentry "memtest86+ 4.20" {
        linux16 /boot/memtest86plus/memtest
}

menuentry "memtest86+ 4.20 (netbsd)" {
        insmod bsd
        knetbsd /boot/memtest86plus/memtest.netbsd
}

This is added to your menu automatically by grub2-mkconfig.

This is one of the great things about GRUB2, it allows install scripts,
whether they are package installers or complete distro installers, to add
to your GRUB menu without disturbing, or even needing to know about, your
existing configuration.


-- 
Neil Bothwick

Give a man a fish and you feed him for a day; teach him
to use the Net and he won't bother you for weeks.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [gentoo-user] Re: Any way of tracing kernel freezes?
  2013-06-13 14:05 ` [gentoo-user] " Nikos Chantziaras
  2013-06-13 15:18   ` Neil Bothwick
@ 2013-06-13 21:04   ` William Kenworthy
  2013-06-14 14:42     ` Frank Steinmetzger
  2013-06-13 21:14   ` Paul Hartman
  2 siblings, 1 reply; 8+ messages in thread
From: William Kenworthy @ 2013-06-13 21:04 UTC (permalink / raw
  To: gentoo-user

On 13/06/13 22:05, Nikos Chantziaras wrote:
> On 13/06/13 16:47, Frank Steinmetzger wrote:
>> recently my netbook got the habit of freezing sporadically. [...]
>> [...]
>> I ran a memtest a few months back and stopped it after 8 successful
>> passes.  I might try that again as soon as I find out how¹, but I'd
>> think that a corrupt memory would cause something different than a full
>> freeze.
> 
> It usually manifests in segfaults that seem to come at random.  But it's
> still worth a shot.  It's very easy.  Emerge "sys-apps/memtest86+" and
> add this grub entry:
> 
>   title=Memtest86+
>   root (hd0,0)
>   kernel /boot/memtest86plus/memtest.bin
> 
> (Adapt the "hd0,0" of course to be the same disk as the one you're using
> to boot your kernel.)
> 
> That's it.  Now your boot menu will include a "Memtest86+" option.  This
> is for grub 1 though.  If you migrated to grub 2 by now, then I don't
> know how that boot entry would look like.  I suspect it will be some
> 300-line monstrosity or something :-|
> 
> 

Temperature? - check for dust puppies clogging the heatsinks, cooling etc.

BillK




^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [gentoo-user] Re: Any way of tracing kernel freezes?
  2013-06-13 14:05 ` [gentoo-user] " Nikos Chantziaras
  2013-06-13 15:18   ` Neil Bothwick
  2013-06-13 21:04   ` William Kenworthy
@ 2013-06-13 21:14   ` Paul Hartman
  2 siblings, 0 replies; 8+ messages in thread
From: Paul Hartman @ 2013-06-13 21:14 UTC (permalink / raw
  To: gentoo-user

On Thu, Jun 13, 2013 at 9:05 AM, Nikos Chantziaras <realnc@gmail.com> wrote:
> On 13/06/13 16:47, Frank Steinmetzger wrote:
>>
>> recently my netbook got the habit of freezing sporadically. [...]
>> [...]
>>
>> I ran a memtest a few months back and stopped it after 8 successful
>> passes.  I might try that again as soon as I find out how¹, but I'd
>> think that a corrupt memory would cause something different than a full
>> freeze.
>
>
> It usually manifests in segfaults that seem to come at random.  But it's
> still worth a shot.  It's very easy.  Emerge "sys-apps/memtest86+" and add
> this grub entry:
>
>   title=Memtest86+
>   root (hd0,0)
>   kernel /boot/memtest86plus/memtest.bin
>
> (Adapt the "hd0,0" of course to be the same disk as the one you're using to
> boot your kernel.)
>
> That's it.  Now your boot menu will include a "Memtest86+" option.  This is
> for grub 1 though.  If you migrated to grub 2 by now, then I don't know how
> that boot entry would look like.  I suspect it will be some 300-line
> monstrosity or something :-|

Actually it is -- dare I say it -- even more simple in grub2 :)

menuentry "Memtest86+ 4.20" {
        linux16 /memtest86plus/memtest.bin
}

That's from my grub.cfg... I don't use the grub auto-configuration
tools. I just made a manual grub.cfg like in the grub1 days. It is
quite similar to the old grub syntax, but more can do more stuff.

BTW there is a new version 5.00 of memtest86+ which is on rc1 release
right now, it supports multi-core processors (and tests/uses them),
shows system temperature while testing, is much faster than the old
one, and has a few new tests. There is a link to it on the memtest
forums.


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [gentoo-user] Re: Any way of tracing kernel freezes?
  2013-06-13 21:04   ` William Kenworthy
@ 2013-06-14 14:42     ` Frank Steinmetzger
  0 siblings, 0 replies; 8+ messages in thread
From: Frank Steinmetzger @ 2013-06-14 14:42 UTC (permalink / raw
  To: gentoo-user

[-- Attachment #1: Type: text/plain, Size: 1419 bytes --]

On Fri, Jun 14, 2013 at 05:04:15AM +0800, William Kenworthy wrote:

> On 13/06/13 22:05, Nikos Chantziaras wrote:

> >> but I'd think that a corrupt memory would cause something different
> >> than a full freeze.
> > 
> > It usually manifests in segfaults that seem to come at random.  But it's
> > still worth a shot.  It's very easy.  Emerge "sys-apps/memtest86+" and
> > add this grub entry:

I found my old USB Gentoo which has memtest installed (and which I used
for my first test also). I let it run for two passes, both successful.
So the RAM seems fine.

> Temperature? - check for dust puppies clogging the heatsinks, cooling etc.

It’s a netbook with a 6.5 Watt CPU. I’m going through the big emerge
again right now (3.5 hours in), and it never goes above 66°. Besides, the
freezes also happened when I didn’t do anything heavy. Just surfing
(though Firefox can also be heave for an Atom ^^).

Anyhoo, I’m running 3.8.13 again now. I didn’t have my config anymore,
so I oldconfig'ed it from 3.9. Let’s see whether it’s more stable. If
yes, hm... I can’t really report this to the kernel devs: “My netbook
freezes since 3.9, that’s all I know. Here, have my configs.” :-I
-- 
Gruß | Greetings | Qapla’
Please do not share anything from, with or about me with any Facebook service.

The problem with FORTRAN jokes is that they never fit into a single li

[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [gentoo-user] Any way of tracing kernel freezes?
@ 2013-06-15  5:56 Thomas Mueller
  2013-06-20  1:21 ` Frank Steinmetzger
  0 siblings, 1 reply; 8+ messages in thread
From: Thomas Mueller @ 2013-06-15  5:56 UTC (permalink / raw
  To: gentoo-user

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain, Size: 1615 bytes --]

Excerpt from Frank Steinmetzger:

> I found my old USB Gentoo which has memtest installed (and which I used
> for my first test also). I let it run for two passes, both successful.
> So the RAM seems fine.

> > Temperature? - check for dust puppies clogging the heatsinks, cooling etc.

> It's a netbook with a 6.5 Watt CPU. I'm going through the big emerge
> again right now (3.5 hours in), and it never goes above 66°. Besides, the
> freezes also happened when I didn't do anything heavy. Just surfing
> (though Firefox can also be heave for an Atom ^^).

> Anyhoo, I'm running 3.8.13 again now. I didn't have my config anymore,
> so I oldconfig'ed it from 3.9. Let's see whether it's more stable. If
> yes, hm... I can't really report this to the kernel devs: "My netbook
> freezes since 3.9, that's all I know. Here, have my configs." :-I

Is your temperature of 66° F or C? 

System temperature or surrounding room temperature?

I have an old computer whose fan has quit as happened once before.

CPUs generate considerable heat, I see system temperature and realize the fan is much more critical than whether the room temperature is a chilly 20 C or sweaty (for humans) 35 C.

I don't use that old 2001 computer much, am getting ready to put together a new computer from parts to run FreeBSD and Linux, likely Gentoo; otherwise I'd order a Socket A fan.

When I do use that old computer, I open the case and prop a hair dryer to run at low, ambient-temperature air pointed at the CPU.  This keeps the CPU down to 47 C according to the BIOS/CMOS screen.  This is cool for the CPU if not for us humans.


Tom



^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [gentoo-user] Any way of tracing kernel freezes?
  2013-06-15  5:56 [gentoo-user] " Thomas Mueller
@ 2013-06-20  1:21 ` Frank Steinmetzger
  0 siblings, 0 replies; 8+ messages in thread
From: Frank Steinmetzger @ 2013-06-20  1:21 UTC (permalink / raw
  To: gentoo-user

[-- Attachment #1: Type: text/plain, Size: 1865 bytes --]

On Fri, Jun 14, 2013 at 10:56:51PM -0700, Thomas Mueller wrote:

> Excerpt from Frank Steinmetzger:
> > > Temperature? - check for dust puppies clogging the heatsinks, cooling etc.
> 
> > It's a netbook with a 6.5 Watt CPU. I'm going through the big emerge
> > again right now (3.5 hours in), and it never goes above 66°. Besides, the
> > freezes also happened when I didn't do anything heavy. Just surfing
> > (though Firefox can also be heave for an Atom ^^).


> 
> > Anyhoo, I'm running 3.8.13 again now. I didn't have my config anymore,
> > so I oldconfig'ed it from 3.9. Let's see whether it's more stable. If
> > yes, hm... I can't really report this to the kernel devs: "My netbook
> > freezes since 3.9, that's all I know. Here, have my configs." :-I
> 
> Is your temperature of 66° F or C? 

The proper degrees of course ;-) i.e. Celsius.

> System temperature or surrounding room temperature?

CPU Temp. It’s the only sensor I’ve got.

> I have an old computer whose fan has quit as happened once before.

The fan is fine.

> CPUs generate considerable heat […]
> When I do use that old computer, I open the case and prop a hair dryer
> to run at low, ambient-temperature air pointed at the CPU.

Please re-read the first paragraph you quoted from me in your reply.


On an on-topic sidenote: I had a freeze the other day with the 3.8
kernel. So it could likely be a hardware issue.

After I accidentially cut the WiFi cable where it goes through the
hinge, I installed another netbook's antenna and connected it to the
metal foil in the back of the lid. That can't possibly be the culprit,
can it? WiFi is working fine, as far as I can tell.
-- 
Gruß | Greetings | Qapla’
Please do not share anything from, with or about me with any Facebook service.

I hate being bi-polar.  It’s fantastic!

[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2013-06-20  1:22 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-06-13 13:47 [gentoo-user] Any way of tracing kernel freezes? Frank Steinmetzger
2013-06-13 14:05 ` [gentoo-user] " Nikos Chantziaras
2013-06-13 15:18   ` Neil Bothwick
2013-06-13 21:04   ` William Kenworthy
2013-06-14 14:42     ` Frank Steinmetzger
2013-06-13 21:14   ` Paul Hartman
  -- strict thread matches above, loose matches on Subject: below --
2013-06-15  5:56 [gentoo-user] " Thomas Mueller
2013-06-20  1:21 ` Frank Steinmetzger

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox