LINUX.ORG.RU

Пациент «мертв» ?

 ,


0

2

Диск не монтируется, ошибки в dmesg.

Dmesg

 ata1.00: exception Emask 0x0 SAct 0x20000 SErr 0x0 action 0x0
 ata1.00: irq_stat 0x40000008
 ata1.00: failed command: READ FPDMA QUEUED
 ata1.00: cmd 60/08:88:00:10:00/00:00:00:00:00/40 tag 17 ncq dma 4096 in
          res 51/40:08:00:10:00/00:00:00:00:00/40 Emask 0x409 (media error) <F>
 ata1.00: status: { DRDY ERR }
 ata1.00: error: { UNC }
 ata1.00: configured for UDMA/133
 sd 0:0:0:0: [sdb] tag#17 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE cmd_age=4s
 sd 0:0:0:0: [sdb] tag#17 Sense Key : Medium Error [current]
 sd 0:0:0:0: [sdb] tag#17 Add. Sense: Unrecovered read error - auto reallocate failed
 sd 0:0:0:0: [sdb] tag#17 CDB: Read(10) 28 00 00 00 10 00 00 00 08 00
 blk_update_request: I/O error, dev sdb, sector 4096 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
 ata1: EH complete
 ata1.00: exception Emask 0x0 SAct 0x800000 SErr 0x0 action 0x0
 ata1.00: irq_stat 0x40000008
 ata1.00: failed command: READ FPDMA QUEUED
 ata1.00: cmd 60/08:b8:00:10:00/00:00:00:00:00/40 tag 23 ncq dma 4096 in
          res 51/40:08:00:10:00/00:00:00:00:00/40 Emask 0x409 (media error) <F>
 ata1.00: status: { DRDY ERR }
 ata1.00: error: { UNC }
 ata1.00: configured for UDMA/133
 sd 0:0:0:0: [sdb] tag#23 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE cmd_age=4s
 sd 0:0:0:0: [sdb] tag#23 Sense Key : Medium Error [current]
 sd 0:0:0:0: [sdb] tag#23 Add. Sense: Unrecovered read error - auto reallocate failed
 sd 0:0:0:0: [sdb] tag#23 CDB: Read(10) 28 00 00 00 10 00 00 00 08 00
 blk_update_request: I/O error, dev sdb, sector 4096 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
 Buffer I/O error on dev sdb, logical block 512, async page read

Smartctl -a

smartctl 6.6 2017-11-05 r4594 [x86_64-linux-5.8.0-0.bpo.2-amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     HGST Travelstar Z5K1000
Device Model:     HGST HTS541075A7E630
Serial Number:    S0A100SNG0SXZL
LU WWN Device Id: 5 000cca 754c059d9
Firmware Version: SE2OA4A0
User Capacity:    750,156,374,016 bytes [750 GB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Form Factor:      2.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS T13/1699-D revision 6
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sat Dec  5 22:06:21 2020 MSK
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)	Offline data collection activity
					was never started.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		(   45) seconds.
Offline data collection
capabilities: 			 (0x5b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					Offline surface scan supported.
					Self-test supported.
					No Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 ( 203) minutes.
SCT capabilities: 	       (0x003d)	SCT Status supported.
					SCT Error Recovery Control supported.
					SCT Feature Control supported.
					SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   095   095   062    Pre-fail  Always       -       851968
  2 Throughput_Performance  0x0005   100   100   040    Pre-fail  Offline      -       0
  3 Spin_Up_Time            0x0007   143   143   033    Pre-fail  Always       -       1
  4 Start_Stop_Count        0x0012   033   033   000    Old_age   Always       -       106669
  5 Reallocated_Sector_Ct   0x0033   100   100   005    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000b   100   100   067    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0005   100   100   040    Pre-fail  Offline      -       0
  9 Power_On_Hours          0x0012   070   070   000    Old_age   Always       -       13430
 10 Spin_Retry_Count        0x0013   100   100   060    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   099   099   000    Old_age   Always       -       2762
191 G-Sense_Error_Rate      0x000a   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   099   099   000    Old_age   Always       -       219
193 Load_Cycle_Count        0x0012   061   061   000    Old_age   Always       -       391991
194 Temperature_Celsius     0x0002   250   250   000    Old_age   Always       -       24 (Min/Max 5/46)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0022   100   100   000    Old_age   Always       -       8
198 Offline_Uncorrectable   0x0008   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x000a   200   200   000    Old_age   Always       -       0
223 Load_Retry_Count        0x000a   100   100   000    Old_age   Always       -       0

SMART Error Log Version: 1
ATA Error Count: 395 (device log contains only the most recent five errors)
	CR = Command Register [HEX]
	FR = Features Register [HEX]
	SC = Sector Count Register [HEX]
	SN = Sector Number Register [HEX]
	CL = Cylinder Low Register [HEX]
	CH = Cylinder High Register [HEX]
	DH = Device/Head Register [HEX]
	DC = Device Command Register [HEX]
	ER = Error register [HEX]
	ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 395 occurred at disk power-on lifetime: 13430 hours (559 days + 14 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 01 00 10 00 00  Error: UNC at LBA = 0x00001000 = 4096

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 01 b0 00 10 00 40 00      00:02:44.665  READ FPDMA QUEUED
  ef 10 02 00 00 00 a0 00      00:02:44.665  SET FEATURES [Enable SATA feature]
  27 00 00 00 00 00 e0 00      00:02:44.664  READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 a0 00      00:02:44.663  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      00:02:44.663  SET FEATURES [Set transfer mode]

Error 394 occurred at disk power-on lifetime: 13430 hours (559 days + 14 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 02 02 10 00 00  Error: UNC at LBA = 0x00001002 = 4098

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 02 28 02 10 00 40 00      00:02:40.621  READ FPDMA QUEUED
  ef 10 02 00 00 00 a0 00      00:02:40.621  SET FEATURES [Enable SATA feature]
  27 00 00 00 00 00 e0 00      00:02:40.620  READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 a0 00      00:02:40.619  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      00:02:40.619  SET FEATURES [Set transfer mode]

Error 393 occurred at disk power-on lifetime: 13430 hours (559 days + 14 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 02 02 10 00 00  Error: UNC at LBA = 0x00001002 = 4098

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 02 68 02 10 00 40 00      00:02:36.577  READ FPDMA QUEUED
  ef 10 02 00 00 00 a0 00      00:02:36.577  SET FEATURES [Enable SATA feature]
  27 00 00 00 00 00 e0 00      00:02:36.577  READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 a0 00      00:02:36.576  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      00:02:36.575  SET FEATURES [Set transfer mode]

Error 392 occurred at disk power-on lifetime: 13430 hours (559 days + 14 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 02 02 10 00 00  Error: UNC at LBA = 0x00001002 = 4098

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 02 20 02 10 00 40 00      00:02:32.522  READ FPDMA QUEUED
  ef 10 02 00 00 00 a0 00      00:02:32.521  SET FEATURES [Enable SATA feature]
  27 00 00 00 00 00 e0 00      00:02:32.521  READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 a0 00      00:02:32.520  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      00:02:32.520  SET FEATURES [Set transfer mode]

Error 391 occurred at disk power-on lifetime: 13430 hours (559 days + 14 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 00 10 00 00  Error: UNC at LBA = 0x00001000 = 4096

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 08 88 00 10 00 40 00      00:02:28.465  READ FPDMA QUEUED
  ef 10 02 00 00 00 a0 00      00:02:28.465  SET FEATURES [Enable SATA feature]
  27 00 00 00 00 00 e0 00      00:02:28.465  READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 a0 00      00:02:28.464  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      00:02:28.464  SET FEATURES [Set transfer mode]

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Aborted by host               90%      4152         -
# 2  Extended offline    Completed without error       00%      3329         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
★★

197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 8

8 секторов не читаются. Вылечил 3 диска с такими проблемами, до сих пор живы, смарт не ругается. Вспоминать как лечить лень, поэтому кину первую найденную ссылку на русском https://www.alexeykopytko.com/2018/smartctl-dd/

anonymous ()

Судя по 1 на всё в порядке с SATA шнурком/разъёмом (дребезг контактов?). Судя по 4, 9, 12, 193 пере-включение в среднем каждые 4-5 часов, остановка шпинделя каждые 7-8 минут, парковка головки кадые 3-4 минуты - типичный ноут. 197 могут образоваться при внезапных выключениях, их надо найти и перезаписать (так лечил hdd на ноуте).

anonymous ()

Мёртв, выбрасывай, и пакупай новый.
Там блины щас идут с керамики, а не металла.
Есть брак который проявляется через пол года. Потом он переодически подглючивает но кое как работает.
Лечить викториями или fsck.ext4 -c /dev/sda1 бесполезно.
Он серавно будет глючить. Бери WD Black и не парься.

red_rain ()
Ответ на: комментарий от Samamy

Если лечил перезаписью сектора, посмотри еще раз smart 196 Reallocated_Event_Count и 198 Offline_Uncorrectable. Если увеличилось, то было повреждение поверхности платины, что не очень хорошо.

anonymous ()