public inbox for gentoo-user@lists.gentoo.org
 help / color / mirror / Atom feed
* [gentoo-user]  Does this drive need a funeral?
@ 2011-11-01 18:58 Dale
  2011-11-01 19:07 ` Mark Knecht
                   ` (2 more replies)
  0 siblings, 3 replies; 17+ messages in thread
From: Dale @ 2011-11-01 18:58 UTC (permalink / raw
  To: gentoo-user

Hi,

For the first time in my life, I think I have a drive failing on me.  
Here is the info:

root@smoker / # smartctl -a /dev/sdc
smartctl 5.40 2010-10-16 r3189 [i686-pc-linux-gnu] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Caviar family
Device Model:     WDC WD800BB-00DKA0
Serial Number:    WD-WCAHL2497094
Firmware Version: 77.07W77
User Capacity:    80,026,361,856 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   6
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Tue Nov  1 13:52:49 2011 CDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
Drive failure expected in less than 24 hours. SAVE ALL DATA.
See vendor-specific Attribute list for failed Attributes.

General SMART Values:
Offline data collection status:  (0x85) Offline data collection activity
                                         was aborted by an interrupting 
command from host.
                                         Auto Offline Data Collection: 
Enabled.
Self-test execution status:      (  73) The previous self-test completed 
having
                                         a test element that failed and 
the test
                                         element that failed is not known.
Total time to complete Offline
data collection:                 (2478) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                         Auto Offline data collection 
on/off support.
                                         Suspend Offline collection upon new
                                         command.
                                         Offline surface scan supported.
                                         Self-test supported.
                                         Conveyance Self-test supported.
                                         Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                         power-saving mode.
                                         Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                         No General Purpose Logging support.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  38) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      
UPDATED  WHEN_FAILED RAW_VALUE
   1 Raw_Read_Error_Rate     0x000b   018   001   051    Pre-fail  
Always   FAILING_NOW 1904
   3 Spin_Up_Time            0x0007   087   084   021    Pre-fail  
Always       -       2166
   4 Start_Stop_Count        0x0032   099   099   040    Old_age   
Always       -       1288
   5 Reallocated_Sector_Ct   0x0033   199   199   140    Pre-fail  
Always       -       1
   7 Seek_Error_Rate         0x000b   200   200   051    Pre-fail  
Always       -       0
   9 Power_On_Hours          0x0032   023   023   000    Old_age   
Always       -       56466
  10 Spin_Retry_Count        0x0013   100   100   051    Pre-fail  
Always       -       0
  11 Calibration_Retry_Count 0x0013   100   100   051    Pre-fail  
Always       -       0
  12 Power_Cycle_Count       0x0032   099   099   000    Old_age   
Always       -       1039
194 Temperature_Celsius     0x0022   110   253   000    Old_age   
Always       -       33
196 Reallocated_Event_Count 0x0032   199   199   000    Old_age   
Always       -       1
197 Current_Pending_Sector  0x0012   199   199   000    Old_age   
Always       -       17
198 Offline_Uncorrectable   0x0012   200   200   000    Old_age   
Always       -       10
199 UDMA_CRC_Error_Count    0x000a   200   253   000    Old_age   
Always       -       1155
200 Multi_Zone_Error_Rate   0x0009   195   085   051    Pre-fail  
Offline      -       191

SMART Error Log Version: 1
ATA Error Count: 4449 (device log contains only the most recent five errors)
         CR = Command Register [HEX]
         FR = Features Register [HEX]
         SC = Sector Count Register [HEX]
         SN = Sector Number Register [HEX]
         CL = Cylinder Low Register [HEX]
         CH = Cylinder High Register [HEX]
         DH = Device/Head Register [HEX]
         DC = Device Command Register [HEX]
         ER = Error register [HEX]
         ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 4449 occurred at disk power-on lifetime: 759 hours (31 days + 15 
hours)
   When the command that caused the error occurred, the device was doing 
SMART Offline or Self-test.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 51 08 a7 73 a8 f4

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
   -- -- -- -- -- -- -- --  ----------------  --------------------
   01 00 c8 00 00 08 00 00      01:47:13.550  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:13.550  NOP [Abort queued commands]
   01 00 ef 00 00 45 00 00      01:47:13.550  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:13.550  NOP [Abort queued commands]
   01 00 c8 00 00 08 00 00      01:47:13.550  [RESERVED]

Error 4448 occurred at disk power-on lifetime: 759 hours (31 days + 15 
hours)
   When the command that caused the error occurred, the device was doing 
SMART Offline or Self-test.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 51 08 a7 73 a8 f4

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
   -- -- -- -- -- -- -- --  ----------------  --------------------
   01 00 c8 00 00 08 00 00      01:47:11.550  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:11.550  NOP [Abort queued commands]
   01 00 ef 00 00 45 00 00      01:47:11.550  [RESERVED]
   01 00 27 00 00 00 00 00      01:47:11.550  [RESERVED]
   04 00 a8 00 00 a7 73 00      01:47:11.550  [RESERVED]

Error 4447 occurred at disk power-on lifetime: 759 hours (31 days + 15 
hours)
   When the command that caused the error occurred, the device was doing 
SMART Offline or Self-test.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 51 08 a7 73 a8 f4

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
   -- -- -- -- -- -- -- --  ----------------  --------------------
   01 00 c8 00 00 08 00 00      01:47:09.500  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:09.500  NOP [Abort queued commands]
   01 00 ef 00 00 45 00 00      01:47:09.500  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:09.500  NOP [Abort queued commands]
   01 00 c8 00 00 08 00 00      01:47:09.500  [RESERVED]

Error 4446 occurred at disk power-on lifetime: 759 hours (31 days + 15 
hours)
   When the command that caused the error occurred, the device was doing 
SMART Offline or Self-test.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 51 08 a7 73 a8 f4

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
   -- -- -- -- -- -- -- --  ----------------  --------------------
   01 00 c8 00 00 08 00 00      01:47:07.550  [RESERVED]
   01 00 ec 00 00 00 00 00      01:47:07.550  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:07.550  NOP [Abort queued commands]
   01 00 ec 00 00 00 00 00      01:47:07.550  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:07.550  NOP [Abort queued commands]

Error 4445 occurred at disk power-on lifetime: 759 hours (31 days + 15 
hours)
   When the command that caused the error occurred, the device was doing 
SMART Offline or Self-test.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 51 08 a7 73 a8 f4

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
   -- -- -- -- -- -- -- --  ----------------  --------------------
   01 00 c8 00 00 08 00 00      01:47:05.600  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:05.600  NOP [Abort queued commands]
   01 00 ef 00 00 45 00 00      01:47:05.600  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:05.600  NOP [Abort queued commands]
   01 00 c8 00 00 08 00 00      01:47:05.600  [RESERVED]

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: unknown failure    90%       
760         4294705476
# 2  Extended offline    Completed: unknown failure    90%       
759         4294705476
# 3  Extended offline    Completed: unknown failure    90%       
759         4294705476
# 4  Extended offline    Completed without error       00%      
1038         -
# 5  Short offline       Completed without error       00%      
1037         -
# 6  Extended offline    Completed without error       00%      
1075         -
# 7  Extended offline    Completed without error       00%       
305         -
# 8  Extended offline    Completed without error       00%       
660         -
# 9  Extended offline    Completed without error       00%       
213         -
#10  Extended offline    Completed without error       00%       
687         -
#11  Extended offline    Completed without error       00%       
686         -
#12  Extended offline    Completed without error       00%       
629         -

SMART Selective self-test log data structure revision number 1
  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
     1        0        0  Not_testing
     2        0        0  Not_testing
     3        0        0  Not_testing
     4        0        0  Not_testing
     5        0        0  Not_testing
Selective self-test flags (0x0):
   After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

root@smoker / #

What you folks think?  Can I fix it somehow?  I got a good shovel handy 
just in case.

Dale

:-)  :-)



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-01 18:58 [gentoo-user] Does this drive need a funeral? Dale
@ 2011-11-01 19:07 ` Mark Knecht
  2011-11-01 19:47   ` Dale
  2011-11-02  1:17 ` Dale
  2011-11-02 21:01 ` Mick
  2 siblings, 1 reply; 17+ messages in thread
From: Mark Knecht @ 2011-11-01 19:07 UTC (permalink / raw
  To: gentoo-user

On Tue, Nov 1, 2011 at 11:58 AM, Dale <rdalek1967@gmail.com> wrote:
> Hi,
>
> For the first time in my life, I think I have a drive failing on me.  Here
> is the info:
>
<SNIP>
>
> What you folks think?  Can I fix it somehow?  I got a good shovel handy just
> in case.
>
> Dale

Start doing backups before you write even 1 more email! ;;-)

- Mark



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-01 19:07 ` Mark Knecht
@ 2011-11-01 19:47   ` Dale
  2011-11-01 19:58     ` Michael Mol
  2011-11-01 20:04     ` Joost Roeleveld
  0 siblings, 2 replies; 17+ messages in thread
From: Dale @ 2011-11-01 19:47 UTC (permalink / raw
  To: gentoo-user

Mark Knecht wrote:
> On Tue, Nov 1, 2011 at 11:58 AM, Dale<rdalek1967@gmail.com>  wrote:
>> Hi,
>>
>> For the first time in my life, I think I have a drive failing on me.  Here
>> is the info:
>>
> <SNIP>
>> What you folks think?  Can I fix it somehow?  I got a good shovel handy just
>> in case.
>>
>> Dale
> Start doing backups before you write even 1 more email! ;;-)
>
> - Mark
>
>


Well, it was in my brothers winders rig.  Winders couldn't do anything 
but puke on the keyboard so I brought it down here and put it in my old 
Linux rig.  I mounted it ro and got the data off it FIRST THING.  There 
was a boatload of pictures from their camera.  Anyway, the data seems to 
be safe tho a few may have gotten messed up.  I got to test that in a bit.

Is this terminal or can something be done to correct this?  I did run 
the dd command before I ran the selftest.  I don't think it matters but 
thought it wouldn't hurt either.  If it is terminal, I'll get my 
screwdriver out and see what these drives look like on the inside.  The 
last one I looked into was a old 14" thing many years ago.  Platters 
were about the size of 33 rpm records. lol  Dang I'm old.  O_O

Dale

:-)  :-)



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-01 19:47   ` Dale
@ 2011-11-01 19:58     ` Michael Mol
  2011-11-01 20:04     ` Joost Roeleveld
  1 sibling, 0 replies; 17+ messages in thread
From: Michael Mol @ 2011-11-01 19:58 UTC (permalink / raw
  To: gentoo-user

On Tue, Nov 1, 2011 at 3:47 PM, Dale <rdalek1967@gmail.com> wrote:
> Mark Knecht wrote:
>> On Tue, Nov 1, 2011 at 11:58 AM, Dale<rdalek1967@gmail.com>  wrote:
> Well, it was in my brothers winders rig.  Winders couldn't do anything but
> puke on the keyboard so I brought it down here and put it in my old Linux
> rig.  I mounted it ro and got the data off it FIRST THING.  There was a
> boatload of pictures from their camera.  Anyway, the data seems to be safe
> tho a few may have gotten messed up.  I got to test that in a bit.
>
> Is this terminal or can something be done to correct this?  I did run the dd
> command before I ran the selftest.

Normally, you can extract a little more life out of the drive, but
it's on its last legs. At work, I was once stuck using a drive that
failed on me three times. Each time, it happened to be in the
hibernate.sys file, which meant the machine didn't wake up properly
each time the drive glitched. I was on that drive a few months before
it was finally replaced.

The trouble with that approach, though, is that you don't know when
the drive is going to glitch out, and it's more difficult to diagnose
other issues when it might be a strangely-corrupted file somewhere.

>  I don't think it matters but thought it
> wouldn't hurt either.  If it is terminal, I'll get my screwdriver out and
> see what these drives look like on the inside.  The last one I looked into
> was a old 14" thing many years ago.  Platters were about the size of 33 rpm
> records. lol  Dang I'm old.  O_O

I'm partial to percussive data destruction, myself. The steel platters
will even stop .45 ACP, but they don't stop any rifle round I've
tried. If you want to see what they look like, just look online. If
you want the magnets, I highly recommend K&J Magnetics.[1] I've done a
variety of fun craft projects with them.[2] :)

[1] http://www.kjmagnetics.com/Default.asp
[2] http://www.flickr.com/photos/28208534@N07/sets/72157620071707617/

-- 
:wq



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-01 19:47   ` Dale
  2011-11-01 19:58     ` Michael Mol
@ 2011-11-01 20:04     ` Joost Roeleveld
  1 sibling, 0 replies; 17+ messages in thread
From: Joost Roeleveld @ 2011-11-01 20:04 UTC (permalink / raw
  To: gentoo-user

On Tuesday, November 01, 2011 02:47:27 PM Dale wrote:
> Mark Knecht wrote:
> > On Tue, Nov 1, 2011 at 11:58 AM, Dale<rdalek1967@gmail.com>  wrote:
> >> Hi,
> >> 
> >> For the first time in my life, I think I have a drive failing on me. 
> >> Here> 
> >> is the info:
> > <SNIP>
> > 
> >> What you folks think?  Can I fix it somehow?  I got a good shovel
> >> handy just in case.
> >> 
> >> Dale
> > 
> > Start doing backups before you write even 1 more email! ;;-)
> > 
> > - Mark
> 
> Well, it was in my brothers winders rig.  Winders couldn't do anything
> but puke on the keyboard so I brought it down here and put it in my old
> Linux rig.  I mounted it ro and got the data off it FIRST THING.

Good idea...
MS Windows has a tendency not to be able to handle failing disks...

> There
> was a boatload of pictures from their camera.  Anyway, the data seems to
> be safe tho a few may have gotten messed up.  I got to test that in a bit.

If there were no read errors, any damaged files were caused by ms windows, not 
the disk.

> Is this terminal or can something be done to correct this?  I did run
> the dd command before I ran the selftest.  I don't think it matters but
> thought it wouldn't hurt either.  If it is terminal, I'll get my
> screwdriver out and see what these drives look like on the inside.  The
> last one I looked into was a old 14" thing many years ago.  Platters
> were about the size of 33 rpm records. lol  Dang I'm old.  O_O

If SMART is saying it will die in 24 hours it will make a nice doorstop :)

I wouldn't use it for data (even throw away stuff) anymore.

--
Joost



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user]  Does this drive need a funeral?
  2011-11-01 18:58 [gentoo-user] Does this drive need a funeral? Dale
  2011-11-01 19:07 ` Mark Knecht
@ 2011-11-02  1:17 ` Dale
  2011-11-02  1:35   ` Dale
                     ` (2 more replies)
  2011-11-02 21:01 ` Mick
  2 siblings, 3 replies; 17+ messages in thread
From: Dale @ 2011-11-02  1:17 UTC (permalink / raw
  To: gentoo-user

Dale wrote:
> Hi,
>
> For the first time in my life, I think I have a drive failing on me.  
> Here is the info:
>
> <<<< SNIP >>>>
>
> What you folks think?  Can I fix it somehow?  I got a good shovel 
> handy just in case.
>
> Dale
>
> :-)  :-)
>


OK.  For those that have never seen this before, here is what messages 
looks like when a hard drive is going belly up:

Nov  1 19:08:09 localhost kernel: ata4.01: exception Emask 0x0 SAct 0x0 
SErr 0x0 action 0x6 frozen
Nov  1 19:08:09 localhost kernel: ata4.01: failed command: WRITE DMA EXT
Nov  1 19:08:09 localhost kernel: ata4.01: cmd 
35/00:8c:7f:78:a8/00:03:04:00:00/f0 tag 0 dma 464896 out
Nov  1 19:08:09 localhost kernel: res 
40/00:01:00:4f:c2/00:00:00:00:00/10 Emask 0x4 (timeout)
Nov  1 19:08:09 localhost kernel: ata4.01: status: { DRDY }
Nov  1 19:08:14 localhost kernel: ata4: link is slow to respond, please 
be patient (ready=0)
Nov  1 19:08:19 localhost kernel: ata4: device not ready (errno=-16), 
forcing hardreset
Nov  1 19:08:19 localhost kernel: ata4: soft resetting link
Nov  1 19:08:19 localhost kernel: ata4.01: failed to IDENTIFY (I/O 
error, err_mask=0x3)
Nov  1 19:08:19 localhost kernel: ata4.01: revalidation failed (errno=-5)
Nov  1 19:08:24 localhost kernel: ata4: soft resetting link
Nov  1 19:08:24 localhost kernel: ata4.01: failed to IDENTIFY (I/O 
error, err_mask=0x3)
Nov  1 19:08:24 localhost kernel: ata4.01: revalidation failed (errno=-5)
Nov  1 19:08:29 localhost kernel: ata4: soft resetting link
Nov  1 19:08:29 localhost kernel: ata4.01: failed to IDENTIFY (I/O 
error, err_mask=0x3)
Nov  1 19:08:29 localhost kernel: ata4.01: revalidation failed (errno=-5)
Nov  1 19:08:29 localhost kernel: ata4.01: disabled
Nov  1 19:08:29 localhost kernel: ata4.01: device reported invalid CHS 
sector 0
Nov  1 19:08:29 localhost kernel: ata4: soft resetting link
Nov  1 19:08:29 localhost kernel: ata4: EH complete
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc] Unhandled error code
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc]  Result: 
hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc] CDB: Write(10): 2a 
00 04 a8 78 7f 00 03 8c 00
Nov  1 19:08:29 localhost kernel: end_request: I/O error, dev sdc, 
sector 78149759
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc] Unhandled error code
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc]  Result: 
hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc] CDB: Write(10): 2a 
00 04 a8 7c 1f 00 03 13 00
Nov  1 19:08:29 localhost kernel: end_request: I/O error, dev sdc, 
sector 78150687
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc] Unhandled error code
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc]  Result: 
hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc] CDB: Read(10): 28 00 
00 00 4a f2 00 00 01 00
Nov  1 19:08:29 localhost kernel: end_request: I/O error, dev sdc, 
sector 19186
Nov  1 19:08:29 localhost kernel: FAT: FAT read failed (blocknr 19123)
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc] Unhandled error code
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc]  Result: 
hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc] CDB: Read(10): 28 00 
00 00 4a f2 00 00 01 00
Nov  1 19:08:29 localhost kernel: end_request: I/O error, dev sdc, 
sector 19186
Nov  1 19:08:29 localhost kernel: FAT: FAT read failed (blocknr 19123)
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc] Unhandled error code
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc]  Result: 
hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Nov  1 19:08:29 localhost kernel: sd 3:0:1:0: [sdc] CDB: Read(10): 28 00 
00 00 4a f2 00 00 01 00
Nov  1 19:08:29 localhost kernel: end_request: I/O error, dev sdc, 
sector 19186

More info:

/dev/sdc1             78112544  39098000  39014544  51% /mnt/media

It won't accept any more files either.  That looks like this:

cp: cannot create directory `/mnt/media/home3/home/dale/.porthole': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.adobe': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.cache': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.gconf': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.fonts': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.gnupg': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.hplip': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.local': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.opera': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.tkdvd': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.fontconfig': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.macromedia': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.scribus': 
Input/output error
cp: cannot create directory `/mnt/media/home3/home/dale/.ooo3.old': 
Input/output error
cp: failed to preserve ownership for `/mnt/media/home3/home/dale': 
Operation not permitted
cp: cannot create directory `/mnt/media/home3/home/dale2': Input/output 
error
root@smoker / #


So, if you see things like this in your log file, rescue data real 
soon.  Make a note of this since this could happen on your rig one day.

Now I know this is a Western Digital drive.  Let's all face it, they all 
die eventually.  I just find it ironic that it is only a few months out 
of warranty.  :-@

Oh, funny how that drive ran fine on Linux then croaked when put in a 
windoze rig.  Maybe WD just hates M$ as much as I do.  LOL

Dale

:-)  :-)



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user]  Does this drive need a funeral?
  2011-11-02  1:17 ` Dale
@ 2011-11-02  1:35   ` Dale
  2011-11-02 14:02     ` Michael Mol
  2011-11-02 14:15   ` Lorenzo Bandieri
  2011-11-02 16:51   ` James Broadhead
  2 siblings, 1 reply; 17+ messages in thread
From: Dale @ 2011-11-02  1:35 UTC (permalink / raw
  To: gentoo-user

Dale wrote:
>
>
> So, if you see things like this in your log file, rescue data real 
> soon.  Make a note of this since this could happen on your rig one day.
>
> Now I know this is a Western Digital drive.  Let's all face it, they 
> all die eventually.  I just find it ironic that it is only a few 
> months out of warranty.  :-@
>
> Oh, funny how that drive ran fine on Linux then croaked when put in a 
> windoze rig.  Maybe WD just hates M$ as much as I do.  LOL
>
> Dale
>
> :-)  :-)
>

One last update.  When you zero the drive then cfdisk tells you this:

FATAL ERROR: Cannot read disk drive
Press any key to exit cfdisk

I think that means you have a nice new door stop.  My first drive to 
ever fail that I recall.  I still say windoze screwed up my drive.  It 
lasted about 3 days in the windoze rig.

Dale

:-)  :-)



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-02  1:35   ` Dale
@ 2011-11-02 14:02     ` Michael Mol
  2011-11-02 14:07       ` Neil Bothwick
  0 siblings, 1 reply; 17+ messages in thread
From: Michael Mol @ 2011-11-02 14:02 UTC (permalink / raw
  To: gentoo-user

On Tue, Nov 1, 2011 at 9:35 PM, Dale <rdalek1967@gmail.com> wrote:
> Dale wrote:
> I think that means you have a nice new door stop.  My first drive to ever
> fail that I recall.  I still say windoze screwed up my drive.  It lasted
> about 3 days in the windoze rig.

It may also be that the Windows rig's PSU is unhealthy.

-- 
:wq



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-02 14:02     ` Michael Mol
@ 2011-11-02 14:07       ` Neil Bothwick
  2011-11-02 15:29         ` Dale
  0 siblings, 1 reply; 17+ messages in thread
From: Neil Bothwick @ 2011-11-02 14:07 UTC (permalink / raw
  To: gentoo-user

[-- Attachment #1: Type: text/plain, Size: 598 bytes --]

On Wed, 2 Nov 2011 10:02:03 -0400, Michael Mol wrote:

> > I think that means you have a nice new door stop.  My first drive to
> > ever fail that I recall.  I still say windoze screwed up my drive.
> >  It lasted about 3 days in the windoze rig.  
> 
> It may also be that the Windows rig's PSU is unhealthy.

Or even the mains supply to that PSU if it is in a different location. I
had a lot of hardware failures after moving house, including several
expensive SCSI drives. They stopped after I bought a UPS.


-- 
Neil Bothwick

Pedestrians come in two types: Quick or Dead.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-02  1:17 ` Dale
  2011-11-02  1:35   ` Dale
@ 2011-11-02 14:15   ` Lorenzo Bandieri
  2011-11-02 14:18     ` Lorenzo Bandieri
  2011-11-02 15:33     ` Dale
  2011-11-02 16:51   ` James Broadhead
  2 siblings, 2 replies; 17+ messages in thread
From: Lorenzo Bandieri @ 2011-11-02 14:15 UTC (permalink / raw
  To: gentoo-user

> OK.  For those that have never seen this before, here is what messages looks
> like when a hard drive is going belly up:
 (...)
> So, if you see things like this in your log file, rescue data real soon.
>  Make a note of this since this could happen on your rig one day.
>

Thanks, Dale. Actually I've never experienced an hard drive failure,
but your thread made me want to do one as soon as possible :D

(Besides that, "is going belly up" made me burst out laughing)

Lorenzo



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-02 14:15   ` Lorenzo Bandieri
@ 2011-11-02 14:18     ` Lorenzo Bandieri
  2011-11-02 14:52       ` Pandu Poluan
  2011-11-02 15:33     ` Dale
  1 sibling, 1 reply; 17+ messages in thread
From: Lorenzo Bandieri @ 2011-11-02 14:18 UTC (permalink / raw
  To: gentoo-user

> Thanks, Dale. Actually I've never experienced an hard drive failure,
> but your thread made me want to do one as soon as possible :D

Sorry, I meant "made me want to do a backup as soon as possible"....



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-02 14:18     ` Lorenzo Bandieri
@ 2011-11-02 14:52       ` Pandu Poluan
  0 siblings, 0 replies; 17+ messages in thread
From: Pandu Poluan @ 2011-11-02 14:52 UTC (permalink / raw
  To: gentoo-user

[-- Attachment #1: Type: text/plain, Size: 343 bytes --]

On Nov 2, 2011 9:22 PM, "Lorenzo Bandieri" <lorenzo.bandieri@gmail.com>
wrote:
>
> > Thanks, Dale. Actually I've never experienced an hard drive failure,
> > but your thread made me want to do one as soon as possible :D
>
> Sorry, I meant "made me want to do a backup as soon as possible"....
>

I almost thought you're a masochist :-P

Rgds,

[-- Attachment #2: Type: text/html, Size: 503 bytes --]

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-02 14:07       ` Neil Bothwick
@ 2011-11-02 15:29         ` Dale
  0 siblings, 0 replies; 17+ messages in thread
From: Dale @ 2011-11-02 15:29 UTC (permalink / raw
  To: gentoo-user

Neil Bothwick wrote:
> On Wed, 2 Nov 2011 10:02:03 -0400, Michael Mol wrote:
>
>>> I think that means you have a nice new door stop.  My first drive to
>>> ever fail that I recall.  I still say windoze screwed up my drive.
>>>   It lasted about 3 days in the windoze rig.
>> It may also be that the Windows rig's PSU is unhealthy.
> Or even the mains supply to that PSU if it is in a different location. I
> had a lot of hardware failures after moving house, including several
> expensive SCSI drives. They stopped after I bought a UPS.
>
>

Well, his rig does start up strange.  You have to push and HOLD the 
power button to get it to stay on.  If you just push it like normal, it 
goes back off again.  Sort of gets on my nerves but I don't know if it 
is a mobo or power supply issue.  Could even be something else too.

Hmmmmm.

Dale

:-)  :-)



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-02 14:15   ` Lorenzo Bandieri
  2011-11-02 14:18     ` Lorenzo Bandieri
@ 2011-11-02 15:33     ` Dale
  1 sibling, 0 replies; 17+ messages in thread
From: Dale @ 2011-11-02 15:33 UTC (permalink / raw
  To: gentoo-user

Lorenzo Bandieri wrote:
>> OK.  For those that have never seen this before, here is what messages looks
>> like when a hard drive is going belly up:
>   (...)
>> So, if you see things like this in your log file, rescue data real soon.
>>   Make a note of this since this could happen on your rig one day.
>>
> Thanks, Dale. Actually I've never experienced an hard drive failure,
> but your thread made me want to do one as soon as possible :D
>
> (Besides that, "is going belly up" made me burst out laughing)
>
> Lorenzo
>
>

Well, I live in the sticks.  I see opossums and especially armadillos 
belly up all the time.  They look like those animal balloons.  lol  
Their little feet stuck up in the air and all.  If you ever run across 
one of those, do NOT poke it with a stick.  It can be bad real fast.

I just wish hard drives were as tough as the armadillos tho.

Dale

:-)  :-)



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-02  1:17 ` Dale
  2011-11-02  1:35   ` Dale
  2011-11-02 14:15   ` Lorenzo Bandieri
@ 2011-11-02 16:51   ` James Broadhead
  2011-11-02 17:16     ` Dale
  2 siblings, 1 reply; 17+ messages in thread
From: James Broadhead @ 2011-11-02 16:51 UTC (permalink / raw
  To: gentoo-user

On 2 November 2011 01:17, Dale <rdalek1967@gmail.com> wrote:
> Dale wrote:
>>
>> Hi,
>>
>> For the first time in my life, I think I have a drive failing on me.  Here
>> is the info:
>>
>> <<<< SNIP >>>>
>>
>> What you folks think?  Can I fix it somehow?  I got a good shovel handy
>> just in case.
>>
>> Dale
>>
>> :-)  :-)
>>
> Nov  1 19:08:09 localhost kernel: ata4.01: status: { DRDY }
> Nov  1 19:08:14 localhost kernel: ata4: link is slow to respond, please be
> patient (ready=0)
> Nov  1 19:08:19 localhost kernel: ata4: device not ready (errno=-16),
> forcing hardreset
> Nov  1 19:08:19 localhost kernel: ata4: soft resetting link

I have RMA'd many drives upon seeing errors like this. I eventually
tracked it down to a faulty SATA cable. :-/



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user] Does this drive need a funeral?
  2011-11-02 16:51   ` James Broadhead
@ 2011-11-02 17:16     ` Dale
  0 siblings, 0 replies; 17+ messages in thread
From: Dale @ 2011-11-02 17:16 UTC (permalink / raw
  To: gentoo-user

James Broadhead wrote:
> On 2 November 2011 01:17, Dale<rdalek1967@gmail.com>  wrote:
>> Dale wrote:
>>> Hi,
>>>
>>> For the first time in my life, I think I have a drive failing on me.  Here
>>> is the info:
>>>
>>> <<<<  SNIP>>>>
>>>
>>> What you folks think?  Can I fix it somehow?  I got a good shovel handy
>>> just in case.
>>>
>>> Dale
>>>
>>> :-)  :-)
>>>
>> Nov  1 19:08:09 localhost kernel: ata4.01: status: { DRDY }
>> Nov  1 19:08:14 localhost kernel: ata4: link is slow to respond, please be
>> patient (ready=0)
>> Nov  1 19:08:19 localhost kernel: ata4: device not ready (errno=-16),
>> forcing hardreset
>> Nov  1 19:08:19 localhost kernel: ata4: soft resetting link
> I have RMA'd many drives upon seeing errors like this. I eventually
> tracked it down to a faulty SATA cable. :-/
>
>

Well, this one does it with two different cables and it is IDE.  I have 
to admit tho, I was hoping it would work fine after I brought it down 
here and put it in my rig where it used to be.  I even checked the 
jumper to make sure it was set right.

I love grasping at straws tho.

Dale

:-)  :-)



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: [gentoo-user]  Does this drive need a funeral?
  2011-11-01 18:58 [gentoo-user] Does this drive need a funeral? Dale
  2011-11-01 19:07 ` Mark Knecht
  2011-11-02  1:17 ` Dale
@ 2011-11-02 21:01 ` Mick
  2 siblings, 0 replies; 17+ messages in thread
From: Mick @ 2011-11-02 21:01 UTC (permalink / raw
  To: gentoo-user

[-- Attachment #1: Type: Text/Plain, Size: 2260 bytes --]

On Tuesday 01 Nov 2011 18:58:14 Dale wrote:
> Hi,
> 
> For the first time in my life, I think I have a drive failing on me.
> Here is the info:
[snip ...]

> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: FAILED!
> Drive failure expected in less than 24 hours. SAVE ALL DATA.

That's not encouraging ...  :-/

Then this:

> SMART Attributes Data Structure revision number: 16
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
> UPDATED  WHEN_FAILED RAW_VALUE
>    1 Raw_Read_Error_Rate     0x000b   018   001   051    Pre-fail
> Always   FAILING_NOW 1904

The Value (018) being lower than the Threshold (051) indicates failure is 
imminent.  Of course it also screams at you that it's "FAILING NOW" in case 
you missed it!  :-))


>    3 Spin_Up_Time            0x0007   087   084   021    Pre-fail
> Always       -       2166
>    4 Start_Stop_Count        0x0032   099   099   040    Old_age
> Always       -       1288
>    5 Reallocated_Sector_Ct   0x0033   199   199   140    Pre-fail
> Always       -       1
>    7 Seek_Error_Rate         0x000b   200   200   051    Pre-fail
> Always       -       0
>    9 Power_On_Hours          0x0032   023   023   000    Old_age
> Always       -       56466
>   10 Spin_Retry_Count        0x0013   100   100   051    Pre-fail
> Always       -       0
>   11 Calibration_Retry_Count 0x0013   100   100   051    Pre-fail
> Always       -       0
>   12 Power_Cycle_Count       0x0032   099   099   000    Old_age
> Always       -       1039
> 194 Temperature_Celsius     0x0022   110   253   000    Old_age
> Always       -       33
> 196 Reallocated_Event_Count 0x0032   199   199   000    Old_age
> Always       -       1
> 197 Current_Pending_Sector  0x0012   199   199   000    Old_age
> Always       -       17
> 198 Offline_Uncorrectable   0x0012   200   200   000    Old_age
> Always       -       10
> 199 UDMA_CRC_Error_Count    0x000a   200   253   000    Old_age
> Always       -       1155
> 200 Multi_Zone_Error_Rate   0x0009   195   085   051    Pre-fail
> Offline      -       191

The remaining values look OK.
-- 
Regards,
Mick

[-- Attachment #2: This is a digitally signed message part. --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 17+ messages in thread

end of thread, other threads:[~2011-11-02 21:08 UTC | newest]

Thread overview: 17+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-11-01 18:58 [gentoo-user] Does this drive need a funeral? Dale
2011-11-01 19:07 ` Mark Knecht
2011-11-01 19:47   ` Dale
2011-11-01 19:58     ` Michael Mol
2011-11-01 20:04     ` Joost Roeleveld
2011-11-02  1:17 ` Dale
2011-11-02  1:35   ` Dale
2011-11-02 14:02     ` Michael Mol
2011-11-02 14:07       ` Neil Bothwick
2011-11-02 15:29         ` Dale
2011-11-02 14:15   ` Lorenzo Bandieri
2011-11-02 14:18     ` Lorenzo Bandieri
2011-11-02 14:52       ` Pandu Poluan
2011-11-02 15:33     ` Dale
2011-11-02 16:51   ` James Broadhead
2011-11-02 17:16     ` Dale
2011-11-02 21:01 ` Mick

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox