From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from lists.gentoo.org (pigeon.gentoo.org [208.92.234.80]) by finch.gentoo.org (Postfix) with ESMTP id 1E7AA139897 for ; Tue, 25 Aug 2015 17:54:57 +0000 (UTC) Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id 15F9A14239; Tue, 25 Aug 2015 17:54:43 +0000 (UTC) Received: from mail-la0-f41.google.com (mail-la0-f41.google.com [209.85.215.41]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by pigeon.gentoo.org (Postfix) with ESMTPS id C06AC141E9 for ; Tue, 25 Aug 2015 17:54:41 +0000 (UTC) Received: by lalv9 with SMTP id v9so103349163lal.0 for ; Tue, 25 Aug 2015 10:54:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=kPK+WFE9hZUYjB7hz1FiBNyv7Pm+DGXJRzWq9ylvswc=; b=D8voMNcz3P97Rkx6/VgIG6wjOaAzz/puk3mOlz1i2NQ2IW8hyaH9Op5U8599n/N8AA LQmTWiaGGdfNR/LCcA6gS4cf9r71RVvZNN3Q2iUH5zJ232alG7gtrXwEq23zy888wwEf /fdNDlmNfo6xJO7s55zw98i1uuGmk5wOg8eruF7X2BC+ji2+l00sozRv0y05D11nYLCb +MfGoqW3izIjubH01z/sErUd3d6MLD2inwptyeB730Uh3Mg/NnFsvEayOkavnyd7iTGW o+2Kl/8hJR1acCDlyMue6lIhWlIm6wunN3pyJeCgdiu6MSLlShsF5RPYiBQCBHBNYShd s/mw== X-Received: by 10.152.42.244 with SMTP id r20mr26810425lal.90.1440525280269; Tue, 25 Aug 2015 10:54:40 -0700 (PDT) Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-user@lists.gentoo.org Reply-to: gentoo-user@lists.gentoo.org MIME-Version: 1.0 Received: by 10.25.163.75 with HTTP; Tue, 25 Aug 2015 10:54:00 -0700 (PDT) In-Reply-To: References: From: Alexander Kapshuk Date: Tue, 25 Aug 2015 20:54:00 +0300 Message-ID: Subject: Re: [gentoo-user] Problems booting vanilla kernel 4.1.x To: Gentoo mailing list Content-Type: text/plain; charset=UTF-8 X-Archives-Salt: eae3774d-2160-45a1-84bf-4d87877ddf83 X-Archives-Hash: f4e47b139a517e55fc54a44c06527363 On Tue, Aug 25, 2015 at 7:40 PM, Peter Weilbacher wrote: > Hi Alexander, > > On Sun, 23 Aug 2015, Alexander Kapshuk wrote: > >> On Sun, Aug 23, 2015 at 4:08 PM, Peter Weilbacher wrote: >> > >> > after successfully using kernel 4.0.5 (vanilla-sources) for a while, I >> > upgraded to 4.1.5 last week and 4.1.6 today. I cannot boot either of >> > them. On the screen I see >> > >> > Decompressing Linux... Parsing ELF... done. >> > Booting the kernel. >> > >> > as the last thing, then it just sits there. >> >> I am running vanilla-sources 4.1.6, and so far I have not had any >> trouble booting it. >> >> Are you able to boot some of your previous kernels? If so, what does >> your '/boot/grub/grub.cfg' look like? >> What is the output of 'cat /etc/fstab' and 'ls -1 /boot'? > > I can still boot 4.0.5 fine, with the same setup. I use lilo, and I > checked that I changed the two/four digits correctly in /etc/lilo.conf. > > By chance I left the boot sit there for more than the typical minute, > and got multiple messages like > > INFO: rcu_sched self-detected stall on CPU { 3} (t=60000 jiffies g=-256 c=-257 q=193) > rcu_sched kthread starved for 50027 jiffies! > > right after the above "Booting the kernel." line. > > Do I need to activate a different kind of clocking or a CPU feature in > 4.1.x? > > Peter. > I've never experienced this particular kernel trouble myself, so I'm not sure if my input would be of much help. Here's what the kernel documentation has to say about this kind of issue: /usr/src/linux/Documentation/RCU/stallwarn.txt:29,33 CONFIG_RCU_CPU_STALL_INFO This kernel configuration parameter causes the stall warning to print out additional per-CPU diagnostic information, including information on scheduling-clock ticks and RCU's idle-CPU tracking. /usr/src/linux/Documentation/RCU/stallwarn.txt:104,109 If the CONFIG_RCU_CPU_STALL_INFO kernel configuration parameter is set, more information is printed with the stall-warning message, for example: INFO: rcu_preempt detected stall on CPU 0: (63959 ticks this GP) idle=241/3fffffffffffffff/0 softirq=82/543 (t=65000 jiffies) /usr/src/linux/Documentation/RCU/stallwarn.txt:240,250 To diagnose the cause of the stall, inspect the stack traces. The offending function will usually be near the top of the stack. If you have a series of stall warnings from a single extended stall, comparing the stack traces can often help determine where the stall is occurring, which will usually be in the function nearest the top of that portion of the stack which remains the same from trace to trace. If you can reliably trigger the stall, ftrace can be quite helpful. RCU bugs can often be debugged with the help of CONFIG_RCU_TRACE and with RCU's event tracing. For information on RCU's event tracing, see include/trace/events/rcu.h. Have a look for possibly stack traces in these log files: /var/log/{messages,dmesg}. Hopefully, someone else with more kernel debugging experience will have something more substantial to say about this.