public inbox for gentoo-user@lists.gentoo.org
 help / color / mirror / Atom feed
* [gentoo-user]  Does this drive need a funeral?
@ 2011-11-01 18:58 Dale
  2011-11-01 19:07 ` Mark Knecht
                   ` (2 more replies)
  0 siblings, 3 replies; 17+ messages in thread
From: Dale @ 2011-11-01 18:58 UTC (permalink / raw
  To: gentoo-user

Hi,

For the first time in my life, I think I have a drive failing on me.  
Here is the info:

root@smoker / # smartctl -a /dev/sdc
smartctl 5.40 2010-10-16 r3189 [i686-pc-linux-gnu] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Caviar family
Device Model:     WDC WD800BB-00DKA0
Serial Number:    WD-WCAHL2497094
Firmware Version: 77.07W77
User Capacity:    80,026,361,856 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   6
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Tue Nov  1 13:52:49 2011 CDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
Drive failure expected in less than 24 hours. SAVE ALL DATA.
See vendor-specific Attribute list for failed Attributes.

General SMART Values:
Offline data collection status:  (0x85) Offline data collection activity
                                         was aborted by an interrupting 
command from host.
                                         Auto Offline Data Collection: 
Enabled.
Self-test execution status:      (  73) The previous self-test completed 
having
                                         a test element that failed and 
the test
                                         element that failed is not known.
Total time to complete Offline
data collection:                 (2478) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                         Auto Offline data collection 
on/off support.
                                         Suspend Offline collection upon new
                                         command.
                                         Offline surface scan supported.
                                         Self-test supported.
                                         Conveyance Self-test supported.
                                         Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                         power-saving mode.
                                         Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                         No General Purpose Logging support.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  38) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      
UPDATED  WHEN_FAILED RAW_VALUE
   1 Raw_Read_Error_Rate     0x000b   018   001   051    Pre-fail  
Always   FAILING_NOW 1904
   3 Spin_Up_Time            0x0007   087   084   021    Pre-fail  
Always       -       2166
   4 Start_Stop_Count        0x0032   099   099   040    Old_age   
Always       -       1288
   5 Reallocated_Sector_Ct   0x0033   199   199   140    Pre-fail  
Always       -       1
   7 Seek_Error_Rate         0x000b   200   200   051    Pre-fail  
Always       -       0
   9 Power_On_Hours          0x0032   023   023   000    Old_age   
Always       -       56466
  10 Spin_Retry_Count        0x0013   100   100   051    Pre-fail  
Always       -       0
  11 Calibration_Retry_Count 0x0013   100   100   051    Pre-fail  
Always       -       0
  12 Power_Cycle_Count       0x0032   099   099   000    Old_age   
Always       -       1039
194 Temperature_Celsius     0x0022   110   253   000    Old_age   
Always       -       33
196 Reallocated_Event_Count 0x0032   199   199   000    Old_age   
Always       -       1
197 Current_Pending_Sector  0x0012   199   199   000    Old_age   
Always       -       17
198 Offline_Uncorrectable   0x0012   200   200   000    Old_age   
Always       -       10
199 UDMA_CRC_Error_Count    0x000a   200   253   000    Old_age   
Always       -       1155
200 Multi_Zone_Error_Rate   0x0009   195   085   051    Pre-fail  
Offline      -       191

SMART Error Log Version: 1
ATA Error Count: 4449 (device log contains only the most recent five errors)
         CR = Command Register [HEX]
         FR = Features Register [HEX]
         SC = Sector Count Register [HEX]
         SN = Sector Number Register [HEX]
         CL = Cylinder Low Register [HEX]
         CH = Cylinder High Register [HEX]
         DH = Device/Head Register [HEX]
         DC = Device Command Register [HEX]
         ER = Error register [HEX]
         ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 4449 occurred at disk power-on lifetime: 759 hours (31 days + 15 
hours)
   When the command that caused the error occurred, the device was doing 
SMART Offline or Self-test.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 51 08 a7 73 a8 f4

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
   -- -- -- -- -- -- -- --  ----------------  --------------------
   01 00 c8 00 00 08 00 00      01:47:13.550  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:13.550  NOP [Abort queued commands]
   01 00 ef 00 00 45 00 00      01:47:13.550  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:13.550  NOP [Abort queued commands]
   01 00 c8 00 00 08 00 00      01:47:13.550  [RESERVED]

Error 4448 occurred at disk power-on lifetime: 759 hours (31 days + 15 
hours)
   When the command that caused the error occurred, the device was doing 
SMART Offline or Self-test.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 51 08 a7 73 a8 f4

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
   -- -- -- -- -- -- -- --  ----------------  --------------------
   01 00 c8 00 00 08 00 00      01:47:11.550  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:11.550  NOP [Abort queued commands]
   01 00 ef 00 00 45 00 00      01:47:11.550  [RESERVED]
   01 00 27 00 00 00 00 00      01:47:11.550  [RESERVED]
   04 00 a8 00 00 a7 73 00      01:47:11.550  [RESERVED]

Error 4447 occurred at disk power-on lifetime: 759 hours (31 days + 15 
hours)
   When the command that caused the error occurred, the device was doing 
SMART Offline or Self-test.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 51 08 a7 73 a8 f4

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
   -- -- -- -- -- -- -- --  ----------------  --------------------
   01 00 c8 00 00 08 00 00      01:47:09.500  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:09.500  NOP [Abort queued commands]
   01 00 ef 00 00 45 00 00      01:47:09.500  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:09.500  NOP [Abort queued commands]
   01 00 c8 00 00 08 00 00      01:47:09.500  [RESERVED]

Error 4446 occurred at disk power-on lifetime: 759 hours (31 days + 15 
hours)
   When the command that caused the error occurred, the device was doing 
SMART Offline or Self-test.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 51 08 a7 73 a8 f4

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
   -- -- -- -- -- -- -- --  ----------------  --------------------
   01 00 c8 00 00 08 00 00      01:47:07.550  [RESERVED]
   01 00 ec 00 00 00 00 00      01:47:07.550  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:07.550  NOP [Abort queued commands]
   01 00 ec 00 00 00 00 00      01:47:07.550  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:07.550  NOP [Abort queued commands]

Error 4445 occurred at disk power-on lifetime: 759 hours (31 days + 15 
hours)
   When the command that caused the error occurred, the device was doing 
SMART Offline or Self-test.

   After command completion occurred, registers were:
   ER ST SC SN CL CH DH
   -- -- -- -- -- -- --
   40 51 08 a7 73 a8 f4

   Commands leading to the command that caused the error were:
   CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
   -- -- -- -- -- -- -- --  ----------------  --------------------
   01 00 c8 00 00 08 00 00      01:47:05.600  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:05.600  NOP [Abort queued commands]
   01 00 ef 00 00 45 00 00      01:47:05.600  [RESERVED]
   00 00 00 00 00 00 00 00      01:47:05.600  NOP [Abort queued commands]
   01 00 c8 00 00 08 00 00      01:47:05.600  [RESERVED]

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: unknown failure    90%       
760         4294705476
# 2  Extended offline    Completed: unknown failure    90%       
759         4294705476
# 3  Extended offline    Completed: unknown failure    90%       
759         4294705476
# 4  Extended offline    Completed without error       00%      
1038         -
# 5  Short offline       Completed without error       00%      
1037         -
# 6  Extended offline    Completed without error       00%      
1075         -
# 7  Extended offline    Completed without error       00%       
305         -
# 8  Extended offline    Completed without error       00%       
660         -
# 9  Extended offline    Completed without error       00%       
213         -
#10  Extended offline    Completed without error       00%       
687         -
#11  Extended offline    Completed without error       00%       
686         -
#12  Extended offline    Completed without error       00%       
629         -

SMART Selective self-test log data structure revision number 1
  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
     1        0        0  Not_testing
     2        0        0  Not_testing
     3        0        0  Not_testing
     4        0        0  Not_testing
     5        0        0  Not_testing
Selective self-test flags (0x0):
   After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

root@smoker / #

What you folks think?  Can I fix it somehow?  I got a good shovel handy 
just in case.

Dale

:-)  :-)



^ permalink raw reply	[flat|nested] 17+ messages in thread

end of thread, other threads:[~2011-11-02 21:08 UTC | newest]

Thread overview: 17+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-11-01 18:58 [gentoo-user] Does this drive need a funeral? Dale
2011-11-01 19:07 ` Mark Knecht
2011-11-01 19:47   ` Dale
2011-11-01 19:58     ` Michael Mol
2011-11-01 20:04     ` Joost Roeleveld
2011-11-02  1:17 ` Dale
2011-11-02  1:35   ` Dale
2011-11-02 14:02     ` Michael Mol
2011-11-02 14:07       ` Neil Bothwick
2011-11-02 15:29         ` Dale
2011-11-02 14:15   ` Lorenzo Bandieri
2011-11-02 14:18     ` Lorenzo Bandieri
2011-11-02 14:52       ` Pandu Poluan
2011-11-02 15:33     ` Dale
2011-11-02 16:51   ` James Broadhead
2011-11-02 17:16     ` Dale
2011-11-02 21:01 ` Mick

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox