* [gentoo-amd64] Segfault in emerge @ 2006-05-11 23:04 Anders Thøgersen 2006-05-12 0:06 ` Robert Walter 2006-05-12 4:52 ` [gentoo-amd64] " Duncan 0 siblings, 2 replies; 10+ messages in thread From: Anders Thøgersen @ 2006-05-11 23:04 UTC (permalink / raw To: gentoo-amd64 Hi, I have been consistently getting a segfault when doing emerge sync. This has happened 3 times all while at 51%: [...] Total bytes read: 3377649 wrote 381 bytes read 3377649 bytes 14783.50 bytes/sec total size is 133702955 speedup is 39.58 >>> Updating Portage cache: 51%Segmentation fault The output of dmesg is: emerge[9290]: segfault at 000000aa015e53b8 rip 00002aaaaac0cfad rsp 00007fffffebb8c0 error 4 Emerge version: # emerge --version Portage 2.0.54 (default-linux/amd64/2005.1, gcc-3.4.5, glibc-2.3.5-r2, 2.6.14-gentoo-r2 x86_64) I am unsure of how to tackle this. Could it be a hardware problem? /Anders -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Segfault in emerge 2006-05-11 23:04 [gentoo-amd64] Segfault in emerge Anders Thøgersen @ 2006-05-12 0:06 ` Robert Walter 2006-05-20 22:35 ` Anders Thøgersen 2006-05-12 4:52 ` [gentoo-amd64] " Duncan 1 sibling, 1 reply; 10+ messages in thread From: Robert Walter @ 2006-05-12 0:06 UTC (permalink / raw To: gentoo-amd64 On Friday 12 May 2006 01:04, Anders Thøgersen wrote: > I am unsure of how to tackle this. Could it be a hardware problem? i guess so :-( maybe ram or net-hardware? is it just emerge sync having that problem? do other applications accessing the net behave strange? regards. robert -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Segfault in emerge 2006-05-12 0:06 ` Robert Walter @ 2006-05-20 22:35 ` Anders Thøgersen 0 siblings, 0 replies; 10+ messages in thread From: Anders Thøgersen @ 2006-05-20 22:35 UTC (permalink / raw To: gentoo-amd64 Hi! On 02:06 Fri 12 May 2006, Robert Walter wrote: > On Friday 12 May 2006 01:04, Anders Thøgersen wrote: > > I am unsure of how to tackle this. Could it be a hardware problem? > i guess so :-( maybe ram or net-hardware? > is it just emerge sync having that problem? Sorry for the late reply.. Yes only emerge sync, and emerge --metadata display this problem. I am able to compile firefox and other packages with no problem. > do other applications accessing the net behave strange? Interesting question. Last week firefox unexpectedly segfaulted, but otherwise I have only seen problems with rox and amarok but that was due to a problem of binary incompatibility between an older gcc and gcc 3.4. I have decided not to consider this problem as a hardware issue, until I am out of ideas and suggestions about how to fix my problem. Thanks, /Anders > regards. robert > > -- > gentoo-amd64@gentoo.org mailing list > > -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 10+ messages in thread
* [gentoo-amd64] Re: Segfault in emerge 2006-05-11 23:04 [gentoo-amd64] Segfault in emerge Anders Thøgersen 2006-05-12 0:06 ` Robert Walter @ 2006-05-12 4:52 ` Duncan 2006-05-20 22:30 ` Anders Thøgersen 1 sibling, 1 reply; 10+ messages in thread From: Duncan @ 2006-05-12 4:52 UTC (permalink / raw To: gentoo-amd64 Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted 20060511230451.GC8675@genaro.mydomain, excerpted below, on Fri, 12 May 2006 01:04:51 +0200: > I have been consistently getting a segfault when doing emerge sync. > This has happened 3 times all while at 51%: > > [...] > > Total bytes read: 3377649 > > wrote 381 bytes read 3377649 bytes 14783.50 bytes/sec > total size is 133702955 speedup is 39.58 > > >>> Updating Portage cache: 51%Segmentation fault > > Emerge version: > > # emerge --version > Portage 2.0.54 (default-linux/amd64/2005.1, gcc-3.4.5, glibc-2.3.5-r2, 2.6.14-gentoo-r2 x86_64) > > > I am unsure of how to tackle this. Could it be a hardware problem? That's almost certainly a portage cache corruption issue, not a hardware problem, since it always happens at the same place when updating the cache. Try emerge --metadata. (I think that command works with old/stable portage, I'm running the ~amd64 portage-2.1-preX versions here, and have been since they went ~amd64.) That should just update the cache without doing the sync part first. If it's really the cache, that too will probably fail. If it doesn't fail, try another emerge --sync, as it should then be fixed. If it did fail as it likely will, you'll need to delete the cache and then run emerge --metadata again, to rebuild it. The cache is /var/cache/edb. Don't directly delete it. Instead, move/rename it to /var/cache/edb.bak or some such, thereby keeping a backup, just in case. Run emerge --metadata, then emerge --sync, and see if the problem is fixed. If it is, you can safely delete edb.bak. If not, delete the new edb and move the backup version back into place. At that point, you'll probably need a bit more advance help to figure out exactly which file is corrupted. However, it's quite likely that removing edb and letting portage rebuild it will fix the problem. -- Duncan - List replies preferred. No HTML msgs. "Every nonfree program has a lord, a master -- and if you use the program, he is your master." Richard Stallman -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Re: Segfault in emerge 2006-05-12 4:52 ` [gentoo-amd64] " Duncan @ 2006-05-20 22:30 ` Anders Thøgersen 2006-05-21 14:31 ` [gentoo-amd64] " Duncan 2006-05-21 16:11 ` [gentoo-amd64] " Bob Slawson 0 siblings, 2 replies; 10+ messages in thread From: Anders Thøgersen @ 2006-05-20 22:30 UTC (permalink / raw To: gentoo-amd64 Hi, On 04:52 Fri 12 May 2006, Duncan wrote: > Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted > 20060511230451.GC8675@genaro.mydomain, excerpted below, on Fri, 12 May > 2006 01:04:51 +0200: > > > I have been consistently getting a segfault when doing emerge sync. > > This has happened 3 times all while at 51%: > > > > [...] > > > > Total bytes read: 3377649 > > > > wrote 381 bytes read 3377649 bytes 14783.50 bytes/sec > > total size is 133702955 speedup is 39.58 > > > > >>> Updating Portage cache: 51%Segmentation fault > > > That's almost certainly a portage cache corruption issue, not a hardware > problem, since it always happens at the same place when updating the cache. > > Try emerge --metadata. (I think that command works with old/stable > portage, I'm running the ~amd64 portage-2.1-preX versions here, and have > been since they went ~amd64.) That should just update the cache without > doing the sync part first. > > If it's really the cache, that too will probably fail. If it doesn't > fail, try another emerge --sync, as it should then be fixed. If it did > fail as it likely will, you'll need to delete the cache and then run > emerge --metadata again, to rebuild it. Sorry for the late reply,... I backed up /var/cache/edb as you suggested and began emerge --metadata, ... First segfault occurred at 31%. Feeling bold i restarted the command and this time it went all the way to the magic 51% where it segfaulted as before. From here every emerge --metadata results in a segfault at 51% :-/ If I understand you correctly the problem of this segfault is due to a specific file in the poretage tree. To correct this problem must I then locate this file? I am a little puzzled... Regards /Anders -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 10+ messages in thread
* [gentoo-amd64] Re: Re: Segfault in emerge 2006-05-20 22:30 ` Anders Thøgersen @ 2006-05-21 14:31 ` Duncan 2006-05-21 16:11 ` [gentoo-amd64] " Bob Slawson 1 sibling, 0 replies; 10+ messages in thread From: Duncan @ 2006-05-21 14:31 UTC (permalink / raw To: gentoo-amd64 Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted 20060520223006.GA9058@genaro.mydomain, excerpted below, on Sun, 21 May 2006 00:30:06 +0200: > On 04:52 Fri 12 May 2006, Duncan wrote: >> Anders posted as summarized on 12 May 2006: >> >> > [Repeatable segfault doing emerge sync at 51%. Portage-2.0.54] >> >> [That's almost certainly a portage cache corruption issue. Try emerge >> --metadata. That should just update the cache without doing the sync >> part first. If that fails, delete the cache and run emerge --metadata >> again, to rebuild it.] > > Sorry for the late reply,... Don't worry too much about the timeliness as the problem's yours, not mine, so your schedule. From the other side, that's one reason I prefer newsgroups or mailing lists to private help -- if one person doesn't get in a timely reply, someone else likely will. (The other big reason is that no single person always guesses the problem right or has the experience to fix it, and a list/newsgroup allows more folks a chance to look at it than private mail would.) > I backed up /var/cache/edb as you suggested and began emerge --metadata, > ... First segfault occurred at 31%. Feeling bold i restarted the > command and this time it went all the way to the magic 51% where it > segfaulted as before. From here every emerge --metadata results in a > segfault at 51% :-/ > > If I understand you correctly the problem of this segfault is due to a > specific file in the poretage tree. To correct this problem must I then > locate this file? Well, locating it would help, but it may be that it isn't necessary, as there are other ways to tackle the problem. A couple things to keep in mind: (1) Portage /can/ operate without that cache -- it's just /very/ slow. Thus, if it comes to being a problem with the portage you are running, you should still be able to merge a different version. (2) We now know the problem regenerates from a clear cache. At this point, with the problem regenerating from a clear cache, the next thing I'd want to establish is that it's not a file system problem. Delete the cache again. If you have /var or /var/cache on its own mount, umount it (depending on whether you have /var/log on the same mount, and on the services you are running, you may have to switch to single user mode or at least shut down your syslog and perhaps other services in order to umount /var) and do a full fsck on it. Remount and startup your services again or simply reboot, and try the emerge --metadata again. If the problem isn't yet gone, delete the cache again and continue... The next item on the checklist is the file system containing the portage tree itself. The tree can be redownloaded, so in general, it's safe to delete. If you run FEATURES=buildpkg, as I've often recommended on this list (different topic but something to look at once you get up and running again, if you haven't already), and your $PKGDIR is in the portage tree as it is by default (/usr/portage/packages, IIRC), you'll want to copy or move that elsewhere. Depending on your internet speed and whether you are charged per byte downloaded, you may wish to do the same thing with $DISTDIR (/usr/portage/distfiles by default), which contains all the source tarballs portage had downloaded. Then delete the portage tree, and if it's on a non-root filesystem, unmount and fsck it as well. See below for refetching, as there's an easier way than emerge --sync when you are fetching the entire thing. If either or both of the above are on your root filesystem, after the deletes, reboot or boot to your rescue solution (the liveCD or alternate boot volume or whatever) and do the fsck from there. The deletes aren't absolutely necessary, but are worthwhile since the data is redownloadable/rebuildable anyway, and if the problem /is/ a filesystem error, it's easier just renewing the data than it is trying to rebuild the file from incomplete data in lost&found. Additionally, if there happen to be other errors on the filesystem and thus other files end up in lost&found, it's easier to find the files you really /do/ need to recover there if there's less noise from files that would be easier simply refetched or recached. Now that you know it's not a problem with a bad filesystem, the next step is getting a new copy of the portage tree. Since we deleted the tree we had, emerge --sync isn't the most efficient option, tho it would normally do the job. Rather, and this kills two birds with one stone as it's the next thing to try as well, use emerge-webrsync. This fetches a verified snapshot tarball of the tree taken daily, so it's not quite as uptodate as a live sync would be (it could be up to 24 hours old), but it's more efficient if you aren't starting with a mostly uptodate tree with only a few changes needed, than emerge --sync would be. Doing it this way, we test another sync method and ensure that we get a complete copy of the tree, as well, bypassing the rsync and any possibly broken files that had been causing problems in your local copy of the tree. emerge-webrsync performs an emerge --metadata after completing the tree sync, so if it goes fine, you should be back in business. Try another emerge --sync and see. If you are still having problems at /that/ point, having verified that it's not a filesystem issue, and trying a completely new copy of the tree fetched with emerge-webrsync, /then/ things start getting interesting. There are still some things that can be tried, but better to wait until we know they are needed before getting worried. The output of emerge-webrsync or the next sync where the problem reoccurs would be interesting as well, so post it. Also, at this point, it may be useful to file a portage bug and get the opinion of the real experts. However, hopefully, that's not necessary, as a clean filesystem and copy of the tree will have eliminated the issue. -- Duncan - List replies preferred. No HTML msgs. "Every nonfree program has a lord, a master -- and if you use the program, he is your master." Richard Stallman -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Re: Segfault in emerge 2006-05-20 22:30 ` Anders Thøgersen 2006-05-21 14:31 ` [gentoo-amd64] " Duncan @ 2006-05-21 16:11 ` Bob Slawson 2006-07-09 3:28 ` Anders Thøgersen 1 sibling, 1 reply; 10+ messages in thread From: Bob Slawson @ 2006-05-21 16:11 UTC (permalink / raw To: gentoo-amd64 Anders � wrote: > Hi, > > On 04:52 Fri 12 May 2006, Duncan wrote: > >> Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted >> 20060511230451.GC8675@genaro.mydomain, excerpted below, on Fri, 12 May >> 2006 01:04:51 +0200: >> >> >>> I have been consistently getting a segfault when doing emerge sync. >>> This has happened 3 times all while at 51%: Have you run `fsck' on the filesystem containing `/usr/portage' lately? There might be a bad link at the spot where the segfault is consistently recurring. You may need to `shutdown -r -F now' to force a fsck of the root filesystem before it is mounted during boot up. BobS -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Re: Segfault in emerge 2006-05-21 16:11 ` [gentoo-amd64] " Bob Slawson @ 2006-07-09 3:28 ` Anders Thøgersen 2006-07-09 11:19 ` [gentoo-amd64] " Duncan 0 siblings, 1 reply; 10+ messages in thread From: Anders Thøgersen @ 2006-07-09 3:28 UTC (permalink / raw To: gentoo-amd64 Hi All, On 12:11 Sun 21 May 2006, Bob Slawson wrote: > Anders ??? wrote: > > Hi, > > > > On 04:52 Fri 12 May 2006, Duncan wrote: > > > >> Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted > >> 20060511230451.GC8675@genaro.mydomain, excerpted below, on Fri, 12 May > >> 2006 01:04:51 +0200: > >> > >> > >>> I have been consistently getting a segfault when doing emerge sync. > >>> This has happened 3 times all while at 51%: > Have you run `fsck' on the filesystem containing `/usr/portage' lately? There might be a bad link at the spot where the segfault is consistently recurring. > > You may need to `shutdown -r -F now' to force a fsck of the root > filesystem before it is mounted during boot up. This did the trick! After the fsck i did an emerge sync and an update of portage, and now all is well :-) Thanks, /Anders -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 10+ messages in thread
* [gentoo-amd64] Re: Re: Segfault in emerge 2006-07-09 3:28 ` Anders Thøgersen @ 2006-07-09 11:19 ` Duncan 2006-07-09 20:32 ` Anders Thøgersen 0 siblings, 1 reply; 10+ messages in thread From: Duncan @ 2006-07-09 11:19 UTC (permalink / raw To: gentoo-amd64 Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted 20060709032840.GA29449@genaro.mydomain, excerpted below, on Sun, 09 Jul 2006 05:28:40 +0200: > On 12:11 Sun 21 May 2006, Bob Slawson wrote: >> > >> >> Anders: >> >> >> >>> I have been consistently getting a segfault when doing emerge sync. >> >>> This has happened 3 times all while at 51%: >> Have you run `fsck' on the filesystem containing `/usr/portage' lately? > > This did the trick! After the fsck i did an emerge sync and an update > of portage, and now all is well :-) Cool! =8^) Thanks for the fix report! -- Duncan - List replies preferred. No HTML msgs. "Every nonfree program has a lord, a master -- and if you use the program, he is your master." Richard Stallman -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Re: Re: Segfault in emerge 2006-07-09 11:19 ` [gentoo-amd64] " Duncan @ 2006-07-09 20:32 ` Anders Thøgersen 0 siblings, 0 replies; 10+ messages in thread From: Anders Thøgersen @ 2006-07-09 20:32 UTC (permalink / raw To: gentoo-amd64 Hi On 11:19 Sun 09 Jul 2006, Duncan wrote: > Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted > 20060709032840.GA29449@genaro.mydomain, excerpted below, on Sun, 09 Jul > 2006 05:28:40 +0200: > > > On 12:11 Sun 21 May 2006, Bob Slawson wrote: > >> > > >> >> Anders: > >> >> > >> >>> I have been consistently getting a segfault when doing emerge sync. > >> >>> This has happened 3 times all while at 51%: > > >> Have you run `fsck' on the filesystem containing `/usr/portage' lately? > > > > This did the trick! After the fsck i did an emerge sync and an update > > of portage, and now all is well :-) > > Cool! =8^) Thanks for the fix report! And thank you! Reading your thorough replies are very valuable to me,I seem to learn something new each time :-) /Anders -- gentoo-amd64@gentoo.org mailing list ^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2006-07-09 18:34 UTC | newest] Thread overview: 10+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2006-05-11 23:04 [gentoo-amd64] Segfault in emerge Anders Thøgersen 2006-05-12 0:06 ` Robert Walter 2006-05-20 22:35 ` Anders Thøgersen 2006-05-12 4:52 ` [gentoo-amd64] " Duncan 2006-05-20 22:30 ` Anders Thøgersen 2006-05-21 14:31 ` [gentoo-amd64] " Duncan 2006-05-21 16:11 ` [gentoo-amd64] " Bob Slawson 2006-07-09 3:28 ` Anders Thøgersen 2006-07-09 11:19 ` [gentoo-amd64] " Duncan 2006-07-09 20:32 ` Anders Thøgersen
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox