2010/12/13 fs error †やはり、rsync でリモートサーバへバックアップをとったら fs error が出ました。 [245688.449699] EXT4-fs error (device sda3): htree_dirblock_to_tree: bad entry in directory #15109317: directory entry across blocks - block=60310786offset=0(0), inode=3633236108, rec_len=180364, name_len=142 [245688.449721] Aborting journal on device sda3-8. [245688.450122] EXT4-fs (sda3): Remounting filesystem read-only Clonezilla 1.2.3-27 Live CD で boot して、fsck をかけ回復しました。 user@debian:~$ sudo fsck /dev/sda3 fsck from util-linux-ng 2.16.2 e2fsck 1.41.9 (22-Aug-2009) /dev/sda3: recovering journal Clearing orphaned inode 11403383 (uid=1000, gid=1000, mode=0100600, size=18432) Clearing orphaned inode 11403382 (uid=1000, gid=1000, mode=0100600, size=122539) Clearing orphaned inode 2903394 (uid=0, gid=0, mode=0100644, size=2564) Clearing orphaned inode 5117767 (uid=0, gid=0, mode=040755, size=4096) Clearing orphaned inode 5117786 (uid=0, gid=0, mode=0100644, size=136) Clearing orphaned inode 5117795 (uid=0, gid=0, mode=0100644, size=1384456) Clearing orphaned inode 5117796 (uid=0, gid=0, mode=0100644, size=13512) Clearing orphaned inode 5117797 (uid=0, gid=0, mode=0100644, size=14602852) Clearing orphaned inode 5117890 (uid=0, gid=0, mode=040755, size=4096) Clearing orphaned inode 5117891 (uid=0, gid=0, mode=0100644, size=396838) Clearing orphaned inode 5117895 (uid=0, gid=0, mode=0100644, size=336148) Clearing orphaned inode 5117902 (uid=0, gid=0, mode=0100644, size=2097673) Clearing orphaned inode 5117906 (uid=0, gid=0, mode=040755, size=4096) Clearing orphaned inode 5117948 (uid=0, gid=0, mode=0100644, size=42292) Clearing orphaned inode 5117949 (uid=0, gid=0, mode=0100644, size=30008) Clearing orphaned inode 2359628 (uid=0, gid=0, mode=0100644, size=26152) Clearing orphaned inode 2359565 (uid=0, gid=0, mode=0100644, size=719540) Clearing orphaned inode 2359507 (uid=0, gid=0, mode=0100644, size=191280) Clearing orphaned inode 2359309 (uid=0, gid=0, mode=0100644, size=140828) Clearing orphaned inode 2636259 (uid=0, gid=0, mode=0100644, size=9731148) Clearing orphaned inode 15601116 (uid=0, gid=0, mode=0100644, size=294664) Clearing orphaned inode 15601112 (uid=0, gid=0, mode=0100644, size=1364764) Clearing orphaned inode 2631097 (uid=0, gid=0, mode=0100644, size=8098204) Clearing orphaned inode 11403364 (uid=1000, gid=1000, mode=0100600, size=56712) Clearing orphaned inode 11403334 (uid=1000, gid=1000, mode=0100600, size=53464) Clearing orphaned inode 11403332 (uid=1000, gid=1000, mode=0100600, size=317978) Clearing orphaned inode 2759066 (uid=1000, gid=1000, mode=0100600, size=2048) Clearing orphaned inode 2758210 (uid=1000, gid=1000, mode=0100600, size=0) Clearing orphaned inode 2753516 (uid=1000, gid=1000, mode=0100600, size=2576) Clearing orphaned inode 12978203 (uid=1000, gid=1000, mode=0100600, size=4096) Clearing orphaned inode 11403300 (uid=1000, gid=1000, mode=0100600, size=36824) Clearing orphaned inode 11403274 (uid=1000, gid=1000, mode=0100600, size=79992) Clearing orphaned inode 11403269 (uid=1000, gid=1000, mode=0100600, size=518026) Clearing orphaned inode 12978210 (uid=1000, gid=1000, mode=0100644, size=32768) Clearing orphaned inode 12978221 (uid=1000, gid=1000, mode=0100600, size=15412) /dev/sda3: clean, 1333035/27107328 files, 36744359/108406255 blocks 2010/12/06 HDD障害 不良セクター †clonezilla でイメージバックアップを取得しようとしたら、Partclone がエラーで途中で止まっていました。 Partclone v0.1.9 (Rev:343M) http://partclone.org Starting to clone device (/dev/sda3) to image (-) Reading Super Block Calculating bitmap... File system: EXTFS Device size: 263998 MB Space in use: 147176 MB Block size: 4096 Byte Used block count: 35931515 Total block 64452415 io_all: errno = Input/output error(5) ************************************************************************* * WARNING: The disk has bad sector. This means physical damage on the * * disk surface caused by deterioration, manufacturing faults or other * * reason. The reliability of the disk may stay stable or degrade fast. * * Use the --rescue option to efficiently save as much data as possible! * ************************************************************************* Expert モードで resuce オプションをつけてバックアップを取得しなおすと、なんとか取得できたようです。 SAMRT でテストしてみます。 $ sudo smartctl -t short /dev/sda [sudo] password for kenji: smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION === Sending command: "Execute SMART Short self-test routine immediately in off-line mode". Drive command "Execute SMART Short self-test routine immediately in off-line mode" successful. Testing has begun. Please wait 2 minutes for test to complete. Test will complete after Mon Dec 6 08:40:56 2010 Use smartctl -X to abort test. テスト結果を表示します。 $ sudo smartctl -l selftest /dev/sda smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF READ SMART DATA SECTION === SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 90% 1208 436750599 # 2 Extended offline Completed without error 00% 172 - # 3 Short offline Completed without error 00% 167 - やはりエラーが出ています。 Current_Pending_Sector の値を調べます。 $ sudo smartctl -A /dev/sda smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF READ SMART DATA SECTION === SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 2104 3 Spin_Up_Time 0x0027 147 147 021 Pre-fail Always - 1608 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 692 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 099 099 000 Old_age Always - 1208 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 689 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 31 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 836 194 Temperature_Celsius 0x0022 121 094 000 Old_age Always - 22 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 199 000 Old_age Always - 49 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 49 になってます。
|