From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from pigeon.gentoo.org ([208.92.234.80] helo=lists.gentoo.org) by finch.gentoo.org with esmtp (Exim 4.60) (envelope-from ) id 1MCrLh-0002Lx-EG for garchives@archives.gentoo.org; Sat, 06 Jun 2009 08:33:58 +0000 Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id 01F3FE032A; Sat, 6 Jun 2009 08:33:56 +0000 (UTC) Received: from puchmayr.linznet.at (puchmayr.linznet.at [80.66.46.165]) by pigeon.gentoo.org (Postfix) with ESMTP id 583E3E032A for ; Sat, 6 Jun 2009 08:33:55 +0000 (UTC) Received: (qmail 30541 invoked by uid 210); 6 Jun 2009 08:33:54 -0000 Received: from zeus.puchmayr.linznet.at by hephaestos (envelope-from , uid 201) with qmail-scanner-2.05st (clamdscan: 0.94.2/9433. spamassassin: 3.2.1. perlscan: 2.05st. Clear:RC:1(192.168.1.2):. Processed in 0.10627 secs); 06 Jun 2009 08:33:54 -0000 Received: from zeus.puchmayr.linznet.at (192.168.1.2) by hephaestos.puchmayr.linznet.at with SMTP; 6 Jun 2009 08:33:53 -0000 From: Alexander Puchmayr To: gentoo-user@lists.gentoo.org Subject: [gentoo-user] Kernel freezes Date: Sat, 6 Jun 2009 10:33:52 +0200 User-Agent: KMail/1.9.10 Organization: Fa Linznet Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-user@lists.gentoo.org Reply-to: gentoo-user@lists.gentoo.org MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200906061033.53197.alexander.puchmayr@linznet.at> X-Archives-Salt: f2ce7099-8cdd-45f8-a7db-abfecc76e1d4 X-Archives-Hash: fe38405c54d9c9c5db3982bc0aa62bea Hi there! This week I've tried to setup a home-server, but the system is highly instable. The first symptoms were lots of page allocation errors, which disappeared after setting the internal memory allocator from SLUB to SLAB and increasing the min_free_kbytes in /proc/sys/vm from 8MB to 20MB. The machine is a AMD Athlon64X2 5050e on a asus M3A78-Pro board with 2x2GB RAM. I'm using kernel 2.6.29.4 (vanilla, but the result is the same as using 2.6.29-gentoo-r5), and I also upgraded the board's BIOS to the latest version (which is 0902) But still the system freezes after some hours. It just freezes. Console is dead, no entry in the logs, no network connectivity, even sysrq doesn't seem to do anything. The worst thing is I don't even have an idea what the error could be, and in the rare situations when it crashed and the console was not blanked, I only see the end of a stack trace, and the intresting parts are scrolled out (and I can't scroll back as the console is absolutely dead :-( ) The only button that is still working is the reset button, and after rebooting the log does't tell anything (just ends without any message) I inspected my dmesg-output right after booting more precisely, and I've found some strange entries which could indicate a problem. What do you think about them? [ 0.000000] ACPI Warning (tbfadt-0568): 32/64X length mismatch in Gpe0Block: 64/32 [20081204] [ 0.000000] FADT: X_PM1a_EVT_BLK.bit_width (16) does not match PM1_EVT_LEN (4) ... [ 0.000000] 4 Processors exceeds NR_CPUS limit of 2 [ 0.000000] SMP: Allowing 2 CPUs, 0 hotplug CPUs ... [ 0.000999] Aperture pointing to e820 RAM. Ignoring. [ 0.000999] Your BIOS doesn't leave a aperture memory hole [ 0.000999] Please enable the IOMMU option in the BIOS setup [ 0.000999] This costs you 64 MB of RAM [ 0.000999] Mapping aperture over 65536 KB of RAM @ 20000000 [ 0.000999] PM: Registered nosave memory: 0000000020000000 - 0000000024000000 ... [ 0.099055] mtrr: your CPUs had inconsistent fixed MTRR settings [ 0.099059] mtrr: probably your BIOS does not setup all CPUs. [ 0.099116] mtrr: corrected configuration. ... [ 0.151260] PCI-DMA: Disabling AGP. [ 0.151260] PCI-DMA: aperture base @ 20000000 size 65536 KB [ 0.151260] PCI-DMA: using GART IOMMU. [ 0.151260] PCI-DMA: Reserving 64MB of IOMMU area in the AGP aperture ... [ 0.163241] system 00:09: iomem range 0xfec00000-0xfec00fff has been reserved [ 0.163305] system 00:09: iomem range 0xfee00000-0xfee00fff has been reserved [ 0.163365] system 00:0a: ioport range 0x4d0-0x4d1 has been reserved [ 0.163422] system 00:0a: ioport range 0x40b-0x40b has been reserved [ 0.163480] system 00:0a: ioport range 0x4d6-0x4d6 has been reserved [ 0.163537] system 00:0a: ioport range 0xc00-0xc01 has been reserved [ 0.163595] system 00:0a: ioport range 0xc14-0xc14 has been reserved [ 0.163653] system 00:0a: ioport range 0xc50-0xc51 has been reserved [ 0.163711] system 00:0a: ioport range 0xc52-0xc52 has been reserved [ 0.163769] system 00:0a: ioport range 0xc6c-0xc6c has been reserved [ 0.163827] system 00:0a: ioport range 0xc6f-0xc6f has been reserved [ 0.163885] system 00:0a: ioport range 0xcd0-0xcd1 has been reserved [ 0.163942] system 00:0a: ioport range 0xcd2-0xcd3 has been reserved [ 0.163999] system 00:0a: ioport range 0xcd4-0xcd5 has been reserved [ 0.164070] system 00:0a: ioport range 0xcd6-0xcd7 has been reserved [ 0.164127] system 00:0a: ioport range 0xcd8-0xcdf has been reserved [ 0.164184] system 00:0a: ioport range 0x800-0x89f has been reserved [ 0.164241] system 00:0a: ioport range 0xb00-0xb3f has been reserved [ 0.164305] system 00:0a: ioport range 0x900-0x90f has been reserved [ 0.164363] system 00:0a: ioport range 0x910-0x91f has been reserved [ 0.164421] system 00:0a: ioport range 0xfe00-0xfefe has been reserved [ 0.164480] system 00:0a: iomem range 0xffb80000-0xffbfffff has been reserved [ 0.164538] system 00:0a: iomem range 0xfec10000-0xfec1001f has been reserved [ 0.164598] system 00:0c: ioport range 0xe00-0xe0f has been reserved [ 0.164656] system 00:0c: ioport range 0xe80-0xe8f has been reserved [ 0.164713] system 00:0c: ioport range 0xf40-0xf4f has been reserved [ 0.164771] system 00:0c: ioport range 0xa30-0xa3f has been reserved [ 0.164830] system 00:0d: iomem range 0xe0000000-0xefffffff has been reserved [ 0.164890] system 00:0e: iomem range 0x0-0x9ffff could not be reserved [ 0.164947] system 00:0e: iomem range 0xc0000-0xcffff has been reserved [ 0.165018] system 00:0e: iomem range 0xe0000-0xfffff could not be reserved [ 0.165076] system 00:0e: iomem range 0x100000-0xdfffffff could not be reserved [ 0.165158] system 00:0e: iomem range 0xfec00000-0xffffffff could not be reserved ... [ 21.298450] ACPI: I/O resource piix4_smbus [0xb00-0xb07] conflicts with ACPI region SOR1 [0xb00-0xb0f] [ 21.298454] ACPI: Device needs an ACPI driver [ 21.298461] piix4_smbus 0000:00:14.0: SMBus Host Controller at 0xb00, revision 0 ... [ 73.861479] ACPI: I/O resource it87 [0xe85-0xe86] conflicts with ACPI region HWRE [0xe85-0xe86] [ 73.861483] ACPI: Device needs an ACPI driver Whats does this message "4 Processors exceeds NR_CPUS" say? the system is a Dual-Core AMD Athlon64 5050e, AFAIK it has two cores and nothing more. The mttr-Message later also indicate that there could be more than 2 CPUs available. wondering... The next thing which seems somewhat strange to me is the AGP aperture and the IOMMU. The Mainboard does not have an AGP port, nor does the bios have any option to enable. The only thing I can set is the size of the memory reservered for the onboad video card, which I set to the smallest value of 32MB as the machine will usually not even have a display. The iomem-range reservation errors at the end? Harmful or not? The last messages come after loading the hw-sensors modules it87.ko and i2c_piix4. Thanks in advance for suggestions Alex