From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <gentoo-dev-return-12428-arch-gentoo-dev=gentoo.org@lists.gentoo.org>
Received: (qmail 5297 invoked from network); 13 May 2004 13:03:18 +0000
Received: from smtp.gentoo.org (128.193.0.39)
  by eagle.gentoo.oregonstate.edu with DES-CBC3-SHA encrypted SMTP; 13 May 2004 13:03:18 +0000
Received: from lists.gentoo.org ([128.193.0.34] helo=eagle.gentoo.org)
	by smtp.gentoo.org with esmtp (Exim 4.24)
	id 1BOFrp-0005C3-SF
	for arch-gentoo-dev@lists.gentoo.org; Thu, 13 May 2004 13:03:17 +0000
Received: (qmail 15712 invoked by uid 50004); 13 May 2004 13:03:15 +0000
Mailing-List: contact gentoo-dev-help@gentoo.org; run by ezmlm
Precedence: bulk
List-Post: <mailto:gentoo-dev@gentoo.org>
List-Help: <mailto:gentoo-dev-help@gentoo.org>
List-Unsubscribe: <mailto:gentoo-dev-unsubscribe@gentoo.org>
List-Subscribe: <mailto:gentoo-dev-subscribe@gentoo.org>
List-Id: Gentoo Linux mail <gentoo-dev.gentoo.org>
X-BeenThere: gentoo-dev@gentoo.org
Received: (qmail 15565 invoked from network); 13 May 2004 13:03:14 +0000
From: Chris Gianelloni <wolf31o2@gentoo.org>
Reply-To: wolf31o2@gentoo.org
To: Kevin <gentoo-dev@gnosys.biz>
Cc: Gentoo Dev <gentoo-dev@lists.gentoo.org>
In-Reply-To: <200405130706.12534.gentoo-dev@gnosys.biz>
References: <793F9D20-A427-11D8-AC04-0003939E069A@mac.com>
	 <200405130706.12534.gentoo-dev@gnosys.biz>
Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="=-5dhJAK1k/pdjjpnH3zLG"
Organization: Gentoo Linux
Message-Id: <1084453494.19614.29.camel@localhost>
Mime-Version: 1.0
X-Mailer: Ximian Evolution 1.4.6 
Date: Thu, 13 May 2004 09:04:55 -0400
Subject: Re: [gentoo-dev] Major MCE problem with SMP on Gentoo kernels
X-Archives-Salt: 68ffcfa7-d08b-4bf5-996c-8c5fd99b1f0b
X-Archives-Hash: 12a7087829145c79a5ddc57ae9124478

--=-5dhJAK1k/pdjjpnH3zLG
Content-Type: text/plain
Content-Transfer-Encoding: quoted-printable

On Thu, 2004-05-13 at 07:06, Kevin wrote:
> I've now tried a stage 3 installation booting the 2.6.1 SMP kernel from a=
=20
> 2004.0 LiveCD (the SMP configs on 2004.1 LiveCDs are all broken---see bug=
=20
> #49382).

I know this isn't exactly what you're looking for, but I have a CD
(actually, a GameCD beta) available at
http://dev.gentoo.org/~wolf31o2/x86-ut2004demo-20040420.iso that you
could grab.  It has only one kernel, and it is SMP.  It has booted and
worked successfully on every machine I have tried it on, and even has
X+fluxbox on it.

> I had no lockup problems while running that kernel, but after rebooting=20
> with my kernel (gentoo-sources, built with Chris's=20
> CFLAGS=3D"-march=3Dpentium4 -O2 -pipe -fomit-frame-pointer") I've already=
=20
> suffered two lockups.  I set MAKEOPS=3D"-j1" for safety.

Copy the kernel from my CD and the /lib/modules/2.6.5-gentoo-r1 and see
if that kernel works fine on your machine in your build environment.

> Something very weird here.  Next, I'm going to try booting from the cd an=
d=20
> chrooting into my system and then doing more extensive testing of the=20
> kernel on the cd, but I'm really running out of options here.  I'll=20
> probably also try building another kernel with CFLAGS=3D"-march=3Dpentium=
3=20
> -O2 -pipe".  Any other suggestions?

Try my CD... it works in SMP.  That will help test some of the problems,
especially since the kernel you are "testing" with is not SMP, so you're
not really testing anything.

> Does anyone think that my two CPUs having different stepping levels could=
=20
> have anything to do with this problem?  One is level 7 and the other 9.

It is possible that is causing the problem.  You never really know.  I
*doubt* it should be a problem, unless one CPU is running out of spec.

> Greg KH thinks it's bad memory, but I'm skeptical of that because the mai=
n=20
> address that fails (some 30 times in a row) is at 1023.8MB and the Dell=20
> Utilities only test up to 1022MB, and because I haven't seen the problem=20
> with the liveCD kernel.

It still could be bad memory.  I think I would trust memtest86 before
the Dell utilities.  You could also try finding another bootable system
checker.  I'm sure there are plenty available.

--=20
Chris Gianelloni
Developer, Gentoo Linux
Games Team

Is your power animal a penguin?

--=-5dhJAK1k/pdjjpnH3zLG
Content-Type: application/pgp-signature; name=signature.asc
Content-Description: This is a digitally signed message part

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)

iD8DBQBAo3J2kT4lNIS36YERAlJvAJ4nkgvX2hqyhoY6o67rz4NLNXAmHACfWNgL
BnKe1WBC3djGYe3MMHyS8VY=
=ojOq
-----END PGP SIGNATURE-----

--=-5dhJAK1k/pdjjpnH3zLG--