* [gentoo-amd64] Segfault in emerge
@ 2006-05-11 23:04 Anders Thøgersen
2006-05-12 0:06 ` Robert Walter
2006-05-12 4:52 ` [gentoo-amd64] " Duncan
0 siblings, 2 replies; 10+ messages in thread
From: Anders Thøgersen @ 2006-05-11 23:04 UTC (permalink / raw
To: gentoo-amd64
Hi,
I have been consistently getting a segfault when doing emerge sync.
This has happened 3 times all while at 51%:
[...]
Total bytes read: 3377649
wrote 381 bytes read 3377649 bytes 14783.50 bytes/sec
total size is 133702955 speedup is 39.58
>>> Updating Portage cache: 51%Segmentation fault
The output of dmesg is:
emerge[9290]: segfault at 000000aa015e53b8 rip 00002aaaaac0cfad rsp 00007fffffebb8c0 error 4
Emerge version:
# emerge --version
Portage 2.0.54 (default-linux/amd64/2005.1, gcc-3.4.5, glibc-2.3.5-r2, 2.6.14-gentoo-r2 x86_64)
I am unsure of how to tackle this. Could it be a hardware problem?
/Anders
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Segfault in emerge
2006-05-11 23:04 [gentoo-amd64] Segfault in emerge Anders Thøgersen
@ 2006-05-12 0:06 ` Robert Walter
2006-05-20 22:35 ` Anders Thøgersen
2006-05-12 4:52 ` [gentoo-amd64] " Duncan
1 sibling, 1 reply; 10+ messages in thread
From: Robert Walter @ 2006-05-12 0:06 UTC (permalink / raw
To: gentoo-amd64
On Friday 12 May 2006 01:04, Anders Thøgersen wrote:
> I am unsure of how to tackle this. Could it be a hardware problem?
i guess so :-( maybe ram or net-hardware?
is it just emerge sync having that problem?
do other applications accessing the net behave strange?
regards. robert
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 10+ messages in thread
* [gentoo-amd64] Re: Segfault in emerge
2006-05-11 23:04 [gentoo-amd64] Segfault in emerge Anders Thøgersen
2006-05-12 0:06 ` Robert Walter
@ 2006-05-12 4:52 ` Duncan
2006-05-20 22:30 ` Anders Thøgersen
1 sibling, 1 reply; 10+ messages in thread
From: Duncan @ 2006-05-12 4:52 UTC (permalink / raw
To: gentoo-amd64
Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted
20060511230451.GC8675@genaro.mydomain, excerpted below, on Fri, 12 May
2006 01:04:51 +0200:
> I have been consistently getting a segfault when doing emerge sync.
> This has happened 3 times all while at 51%:
>
> [...]
>
> Total bytes read: 3377649
>
> wrote 381 bytes read 3377649 bytes 14783.50 bytes/sec
> total size is 133702955 speedup is 39.58
>
> >>> Updating Portage cache: 51%Segmentation fault
>
> Emerge version:
>
> # emerge --version
> Portage 2.0.54 (default-linux/amd64/2005.1, gcc-3.4.5, glibc-2.3.5-r2, 2.6.14-gentoo-r2 x86_64)
>
>
> I am unsure of how to tackle this. Could it be a hardware problem?
That's almost certainly a portage cache corruption issue, not a hardware
problem, since it always happens at the same place when updating the cache.
Try emerge --metadata. (I think that command works with old/stable
portage, I'm running the ~amd64 portage-2.1-preX versions here, and have
been since they went ~amd64.) That should just update the cache without
doing the sync part first.
If it's really the cache, that too will probably fail. If it doesn't
fail, try another emerge --sync, as it should then be fixed. If it did
fail as it likely will, you'll need to delete the cache and then run
emerge --metadata again, to rebuild it.
The cache is /var/cache/edb. Don't directly delete it. Instead,
move/rename it to /var/cache/edb.bak or some such, thereby keeping a
backup, just in case. Run emerge --metadata, then emerge --sync, and see if
the problem is fixed. If it is, you can safely delete edb.bak. If not,
delete the new edb and move the backup version back into place. At that
point, you'll probably need a bit more advance help to figure out exactly
which file is corrupted. However, it's quite likely that removing edb and
letting portage rebuild it will fix the problem.
--
Duncan - List replies preferred. No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master." Richard Stallman
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Re: Segfault in emerge
2006-05-12 4:52 ` [gentoo-amd64] " Duncan
@ 2006-05-20 22:30 ` Anders Thøgersen
2006-05-21 14:31 ` [gentoo-amd64] " Duncan
2006-05-21 16:11 ` [gentoo-amd64] " Bob Slawson
0 siblings, 2 replies; 10+ messages in thread
From: Anders Thøgersen @ 2006-05-20 22:30 UTC (permalink / raw
To: gentoo-amd64
Hi,
On 04:52 Fri 12 May 2006, Duncan wrote:
> Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted
> 20060511230451.GC8675@genaro.mydomain, excerpted below, on Fri, 12 May
> 2006 01:04:51 +0200:
>
> > I have been consistently getting a segfault when doing emerge sync.
> > This has happened 3 times all while at 51%:
> >
> > [...]
> >
> > Total bytes read: 3377649
> >
> > wrote 381 bytes read 3377649 bytes 14783.50 bytes/sec
> > total size is 133702955 speedup is 39.58
> >
> > >>> Updating Portage cache: 51%Segmentation fault
> >
> That's almost certainly a portage cache corruption issue, not a hardware
> problem, since it always happens at the same place when updating the cache.
>
> Try emerge --metadata. (I think that command works with old/stable
> portage, I'm running the ~amd64 portage-2.1-preX versions here, and have
> been since they went ~amd64.) That should just update the cache without
> doing the sync part first.
>
> If it's really the cache, that too will probably fail. If it doesn't
> fail, try another emerge --sync, as it should then be fixed. If it did
> fail as it likely will, you'll need to delete the cache and then run
> emerge --metadata again, to rebuild it.
Sorry for the late reply,...
I backed up /var/cache/edb as you suggested and began emerge --metadata,
... First segfault occurred at 31%. Feeling bold i restarted the
command and this time it went all the way to the magic 51% where it
segfaulted as before. From here every emerge --metadata results in a
segfault at 51% :-/
If I understand you correctly the problem of this segfault is due to a
specific file in the poretage tree. To correct this problem must I then
locate this file?
I am a little puzzled...
Regards
/Anders
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Segfault in emerge
2006-05-12 0:06 ` Robert Walter
@ 2006-05-20 22:35 ` Anders Thøgersen
0 siblings, 0 replies; 10+ messages in thread
From: Anders Thøgersen @ 2006-05-20 22:35 UTC (permalink / raw
To: gentoo-amd64
Hi!
On 02:06 Fri 12 May 2006, Robert Walter wrote:
> On Friday 12 May 2006 01:04, Anders Thøgersen wrote:
> > I am unsure of how to tackle this. Could it be a hardware problem?
> i guess so :-( maybe ram or net-hardware?
> is it just emerge sync having that problem?
Sorry for the late reply..
Yes only emerge sync, and emerge --metadata display this problem. I am
able to compile firefox and other packages with no problem.
> do other applications accessing the net behave strange?
Interesting question. Last week firefox unexpectedly segfaulted, but
otherwise I have only seen problems with rox and amarok but that was due
to a problem of binary incompatibility between an older gcc and gcc 3.4.
I have decided not to consider this problem as a hardware issue, until I
am out of ideas and suggestions about how to fix my problem.
Thanks,
/Anders
> regards. robert
>
> --
> gentoo-amd64@gentoo.org mailing list
>
>
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 10+ messages in thread
* [gentoo-amd64] Re: Re: Segfault in emerge
2006-05-20 22:30 ` Anders Thøgersen
@ 2006-05-21 14:31 ` Duncan
2006-05-21 16:11 ` [gentoo-amd64] " Bob Slawson
1 sibling, 0 replies; 10+ messages in thread
From: Duncan @ 2006-05-21 14:31 UTC (permalink / raw
To: gentoo-amd64
Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted
20060520223006.GA9058@genaro.mydomain, excerpted below, on Sun, 21 May
2006 00:30:06 +0200:
> On 04:52 Fri 12 May 2006, Duncan wrote:
>> Anders posted as summarized on 12 May 2006:
>>
>> > [Repeatable segfault doing emerge sync at 51%. Portage-2.0.54]
>>
>> [That's almost certainly a portage cache corruption issue. Try emerge
>> --metadata. That should just update the cache without doing the sync
>> part first. If that fails, delete the cache and run emerge --metadata
>> again, to rebuild it.]
>
> Sorry for the late reply,...
Don't worry too much about the timeliness as the problem's yours, not
mine, so your schedule. From the other side, that's one reason I prefer
newsgroups or mailing lists to private help -- if one person doesn't get
in a timely reply, someone else likely will. (The other big reason is
that no single person always guesses the problem right or has the
experience to fix it, and a list/newsgroup allows more folks a chance to
look at it than private mail would.)
> I backed up /var/cache/edb as you suggested and began emerge --metadata,
> ... First segfault occurred at 31%. Feeling bold i restarted the
> command and this time it went all the way to the magic 51% where it
> segfaulted as before. From here every emerge --metadata results in a
> segfault at 51% :-/
>
> If I understand you correctly the problem of this segfault is due to a
> specific file in the poretage tree. To correct this problem must I then
> locate this file?
Well, locating it would help, but it may be that it isn't necessary, as
there are other ways to tackle the problem.
A couple things to keep in mind: (1) Portage /can/ operate without that
cache -- it's just /very/ slow. Thus, if it comes to being a problem with
the portage you are running, you should still be able to merge a different
version. (2) We now know the problem regenerates from a clear cache.
At this point, with the problem regenerating from a clear cache, the next
thing I'd want to establish is that it's not a file system problem.
Delete the cache again. If you have /var or /var/cache on its own mount,
umount it (depending on whether you have /var/log on the same mount, and
on the services you are running, you may have to switch to single user
mode or at least shut down your syslog and perhaps other services in order
to umount /var) and do a full fsck on it. Remount and startup your
services again or simply reboot, and try the emerge --metadata again. If
the problem isn't yet gone, delete the cache again and continue...
The next item on the checklist is the file system containing the portage
tree itself. The tree can be redownloaded, so in general, it's safe to
delete. If you run FEATURES=buildpkg, as I've often recommended on this
list (different topic but something to look at once you get up and running
again, if you haven't already), and your $PKGDIR is in the portage tree as
it is by default (/usr/portage/packages, IIRC), you'll want to copy or
move that elsewhere. Depending on your internet speed and whether you are
charged per byte downloaded, you may wish to do the same thing with
$DISTDIR (/usr/portage/distfiles by default), which contains all the
source tarballs portage had downloaded. Then delete the portage tree, and
if it's on a non-root filesystem, unmount and fsck it as well. See below
for refetching, as there's an easier way than emerge --sync when you are
fetching the entire thing.
If either or both of the above are on your root filesystem, after the
deletes, reboot or boot to your rescue solution (the liveCD or
alternate boot volume or whatever) and do the fsck from there. The
deletes aren't absolutely necessary, but are worthwhile since the data is
redownloadable/rebuildable anyway, and if the problem /is/ a filesystem
error, it's easier just renewing the data than it is trying to rebuild the
file from incomplete data in lost&found. Additionally, if there happen to
be other errors on the filesystem and thus other files end up in
lost&found, it's easier to find the files you really /do/ need to recover
there if there's less noise from files that would be easier simply
refetched or recached.
Now that you know it's not a problem with a bad filesystem, the next step
is getting a new copy of the portage tree. Since we deleted the tree we
had, emerge --sync isn't the most efficient option, tho it would normally
do the job. Rather, and this kills two birds with one stone as it's the
next thing to try as well, use emerge-webrsync. This fetches a verified
snapshot tarball of the tree taken daily, so it's not quite as uptodate as
a live sync would be (it could be up to 24 hours old), but it's more
efficient if you aren't starting with a mostly uptodate tree with only a
few changes needed, than emerge --sync would be. Doing it this way, we
test another sync method and ensure that we get a complete copy of the
tree, as well, bypassing the rsync and any possibly broken files that had
been causing problems in your local copy of the tree.
emerge-webrsync performs an emerge --metadata after completing the tree
sync, so if it goes fine, you should be back in business. Try another
emerge --sync and see.
If you are still having problems at /that/ point, having verified that
it's not a filesystem issue, and trying a completely new copy of the tree
fetched with emerge-webrsync, /then/ things start getting interesting.
There are still some things that can be tried, but better to wait until we
know they are needed before getting worried. The output of
emerge-webrsync or the next sync where the problem reoccurs would be
interesting as well, so post it. Also, at this point, it may be useful to
file a portage bug and get the opinion of the real experts. However,
hopefully, that's not necessary, as a clean filesystem and copy of the
tree will have eliminated the issue.
--
Duncan - List replies preferred. No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master." Richard Stallman
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Re: Segfault in emerge
2006-05-20 22:30 ` Anders Thøgersen
2006-05-21 14:31 ` [gentoo-amd64] " Duncan
@ 2006-05-21 16:11 ` Bob Slawson
2006-07-09 3:28 ` Anders Thøgersen
1 sibling, 1 reply; 10+ messages in thread
From: Bob Slawson @ 2006-05-21 16:11 UTC (permalink / raw
To: gentoo-amd64
Anders � wrote:
> Hi,
>
> On 04:52 Fri 12 May 2006, Duncan wrote:
>
>> Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted
>> 20060511230451.GC8675@genaro.mydomain, excerpted below, on Fri, 12 May
>> 2006 01:04:51 +0200:
>>
>>
>>> I have been consistently getting a segfault when doing emerge sync.
>>> This has happened 3 times all while at 51%:
Have you run `fsck' on the filesystem containing `/usr/portage' lately? There might be a bad link at the spot where the segfault is consistently recurring.
You may need to `shutdown -r -F now' to force a fsck of the root
filesystem before it is mounted during boot up.
BobS
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Re: Segfault in emerge
2006-05-21 16:11 ` [gentoo-amd64] " Bob Slawson
@ 2006-07-09 3:28 ` Anders Thøgersen
2006-07-09 11:19 ` [gentoo-amd64] " Duncan
0 siblings, 1 reply; 10+ messages in thread
From: Anders Thøgersen @ 2006-07-09 3:28 UTC (permalink / raw
To: gentoo-amd64
Hi All,
On 12:11 Sun 21 May 2006, Bob Slawson wrote:
> Anders ??? wrote:
> > Hi,
> >
> > On 04:52 Fri 12 May 2006, Duncan wrote:
> >
> >> Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted
> >> 20060511230451.GC8675@genaro.mydomain, excerpted below, on Fri, 12 May
> >> 2006 01:04:51 +0200:
> >>
> >>
> >>> I have been consistently getting a segfault when doing emerge sync.
> >>> This has happened 3 times all while at 51%:
> Have you run `fsck' on the filesystem containing `/usr/portage' lately? There might be a bad link at the spot where the segfault is consistently recurring.
>
> You may need to `shutdown -r -F now' to force a fsck of the root
> filesystem before it is mounted during boot up.
This did the trick! After the fsck i did an emerge sync and an update
of portage, and now all is well :-)
Thanks,
/Anders
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 10+ messages in thread
* [gentoo-amd64] Re: Re: Segfault in emerge
2006-07-09 3:28 ` Anders Thøgersen
@ 2006-07-09 11:19 ` Duncan
2006-07-09 20:32 ` Anders Thøgersen
0 siblings, 1 reply; 10+ messages in thread
From: Duncan @ 2006-07-09 11:19 UTC (permalink / raw
To: gentoo-amd64
Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted
20060709032840.GA29449@genaro.mydomain, excerpted below, on Sun, 09 Jul
2006 05:28:40 +0200:
> On 12:11 Sun 21 May 2006, Bob Slawson wrote:
>> >
>> >> Anders:
>> >>
>> >>> I have been consistently getting a segfault when doing emerge sync.
>> >>> This has happened 3 times all while at 51%:
>> Have you run `fsck' on the filesystem containing `/usr/portage' lately?
>
> This did the trick! After the fsck i did an emerge sync and an update
> of portage, and now all is well :-)
Cool! =8^) Thanks for the fix report!
--
Duncan - List replies preferred. No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master." Richard Stallman
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [gentoo-amd64] Re: Re: Segfault in emerge
2006-07-09 11:19 ` [gentoo-amd64] " Duncan
@ 2006-07-09 20:32 ` Anders Thøgersen
0 siblings, 0 replies; 10+ messages in thread
From: Anders Thøgersen @ 2006-07-09 20:32 UTC (permalink / raw
To: gentoo-amd64
Hi
On 11:19 Sun 09 Jul 2006, Duncan wrote:
> Anders =?iso-8859-1?Q?Th=F8gersen?= <anderslt@gmail.com> posted
> 20060709032840.GA29449@genaro.mydomain, excerpted below, on Sun, 09 Jul
> 2006 05:28:40 +0200:
>
> > On 12:11 Sun 21 May 2006, Bob Slawson wrote:
> >> >
> >> >> Anders:
> >> >>
> >> >>> I have been consistently getting a segfault when doing emerge sync.
> >> >>> This has happened 3 times all while at 51%:
>
> >> Have you run `fsck' on the filesystem containing `/usr/portage' lately?
> >
> > This did the trick! After the fsck i did an emerge sync and an update
> > of portage, and now all is well :-)
>
> Cool! =8^) Thanks for the fix report!
And thank you! Reading your thorough replies are very valuable to me,I
seem to learn something new each time :-)
/Anders
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2006-07-09 18:34 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-05-11 23:04 [gentoo-amd64] Segfault in emerge Anders Thøgersen
2006-05-12 0:06 ` Robert Walter
2006-05-20 22:35 ` Anders Thøgersen
2006-05-12 4:52 ` [gentoo-amd64] " Duncan
2006-05-20 22:30 ` Anders Thøgersen
2006-05-21 14:31 ` [gentoo-amd64] " Duncan
2006-05-21 16:11 ` [gentoo-amd64] " Bob Slawson
2006-07-09 3:28 ` Anders Thøgersen
2006-07-09 11:19 ` [gentoo-amd64] " Duncan
2006-07-09 20:32 ` Anders Thøgersen
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox