From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from pigeon.gentoo.org ([208.92.234.80] helo=lists.gentoo.org) by finch.gentoo.org with esmtp (Exim 4.60) (envelope-from ) id 1Nl4u2-0004tp-JA for garchives@archives.gentoo.org; Fri, 26 Feb 2010 18:27:06 +0000 Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id A905FE0888; Fri, 26 Feb 2010 18:26:57 +0000 (UTC) Received: from mail-pv0-f181.google.com (mail-pv0-f181.google.com [74.125.83.181]) by pigeon.gentoo.org (Postfix) with ESMTP id 69A26E0888 for ; Fri, 26 Feb 2010 18:26:57 +0000 (UTC) Received: by pvc30 with SMTP id 30so96977pvc.40 for ; Fri, 26 Feb 2010 10:26:57 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=4/2FC8O/MFK/rwAYhbstPCm8nCSL9sQgDjIyxg7nX+4=; b=GQMCx+bek0S8UNoz7m5QWHiCMD1sdO9tCz++bnLnceG8iSjlKqHacOlioI/JC/s6Yx UKIDQHRSzVWf0Td5vawxfJsDk7ythldMjNjzIzvLUoQTijTb01por+IFXUGTR2raKdNc BG85P6u8l8DHdzIpkVkC8G+A3FlIqeVGJANN8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=nmVEPGBleTSBJcw7v2GWLAzMAGtbHnOctMSejwWPrYLlLyLCEai//TbQrve98iuf0H w1JPpiDBEf5OeF/rQ8ydt1iaQz04OFPrdFQPNkJGrWeFsxVBOkgZHews9EZakBjZCYbX bNYGOvrcfSEKJ5DjlhLH76NNTdb72xO7ZIqv8= Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-user@lists.gentoo.org Reply-to: gentoo-user@lists.gentoo.org MIME-Version: 1.0 Received: by 10.142.249.29 with SMTP id w29mr391631wfh.163.1267208816851; Fri, 26 Feb 2010 10:26:56 -0800 (PST) In-Reply-To: <201002261859.00344.volkerarmin@googlemail.com> References: <5bdc1c8b1002251933s6a250b99v607c97e09f41d4fe@mail.gmail.com> <201002261827.27513.wonko@wonkology.org> <5bdc1c8b1002260951l7cf3fd77g24a966cf9b399b5c@mail.gmail.com> <201002261859.00344.volkerarmin@googlemail.com> Date: Fri, 26 Feb 2010 10:26:56 -0800 Message-ID: <5bdc1c8b1002261026p4e5d1c22ucc0725511ec400e6@mail.gmail.com> Subject: Re: [gentoo-user] recovery from /var corruption? From: Mark Knecht To: gentoo-user@lists.gentoo.org Content-Type: text/plain; charset=UTF-8 X-Archives-Salt: 1802802b-879e-463c-b6b1-07ae2a14df3a X-Archives-Hash: 66998b72b6d9ca885bf9165eb07d4174 On Fri, Feb 26, 2010 at 9:59 AM, Volker Armin Hemmann wrote: > On Freitag 26 Februar 2010, Mark Knecht wrote: > >> >> The machine _mostly_ crashed while running badblocks. I say mostly >> because the mouse is still alive but I can no longer ssh in and cannot >> open a terminal on my wife's desktop or get to the console. > > because it is not crashed but waiting for the ide timeouts. So if I let it continue running is it going to come back in the next hour or two? I am assuming the IDE timeouts are because the drive is having trouble, correct? That's the theory here? If so then unless the software can mark them bad and somehow create good files out of bad then I'm still left with a machine that is going to need serious work done before it's a happy box again, correct? On the other hand, because I have reasonably good user backups (although no real system backups) right now if I bite the bullet and build the machine then when my wife gets it back it's hopefully going to be more reliable, wouldn't it? I'm thinking that maybe I just copy a little stuff off the box - /etc and the like - and then boot the machine with the Gentoo install CD or System Resuce CD and see what the drive is doing? That doesn't cost me anything to look around, but if SMART won't turn on and badblocks is suggesting the drive is having trouble maybe running something like badblocks and actually __marking__ blocks as bad and then reloading Gentoo would work in the long run? (A lot of work though.) I'm really not interested in buying new drive because the machine is ATA100/133 and if it's not the drive then the money is wasted for a new machine. The cheapest at NewEgg is about $40. Why spend the buck for an old Intel Centrino machine? > >> >> I tried to Ctrl-C out out of badblocks here (this is running shelled >> in) before I figured out it was a total crash which messed up the >> terminal a bit but you can see what it was reporting before the crash >> >> dragonfly ~ # badblocks -sv /dev/hda >> Checking blocks 0 to 156290903 >> Checking for bad blocks (read-only test): 89360960done, 35:00 elapsed >> 89360961done, 35:09 elapsed >> 89360962 >> 89360963 >> ^C^C18% done, 35:27 elapsed >> >> So, there seem to be problems, possibly with the drive, or maybe it's >> some sort of overheating problem on the processor and this was just >> the way the processor failed before the crash? >> >> I ran memtest86 night before last for 8 hours and had no memory >> problems. I'll remove memory and PCI cards, reseat everything, and >> then see what happens. > > protip: if you are running badblocks (or ddrescue) on a probably damaged > device - attach it with an usb adapter. That way your box is still usable. > > /me hates linux kernel for making processes in D unkillable and sucking very > much on diskio. > > Good inputs. Thanks! Cheers, Mark