Top

2010/12/13 fs error

やはり、rsync でリモートサーバへバックアップをとったら fs error が出ました。

[245688.449699] EXT4-fs error (device sda3): htree_dirblock_to_tree: bad entry in directory #15109317: directory entry across blocks - block=60310786offset=0(0), inode=3633236108, rec_len=180364, name_len=142
[245688.449721] Aborting journal on device sda3-8.
[245688.450122] EXT4-fs (sda3): Remounting filesystem read-only

Clonezilla 1.2.3-27 Live CD で boot して、fsck をかけ回復しました。

user@debian:~$ sudo fsck /dev/sda3
fsck from util-linux-ng 2.16.2
e2fsck 1.41.9 (22-Aug-2009)
/dev/sda3: recovering journal
Clearing orphaned inode 11403383 (uid=1000, gid=1000, mode=0100600, size=18432)
Clearing orphaned inode 11403382 (uid=1000, gid=1000, mode=0100600, size=122539)
Clearing orphaned inode 2903394 (uid=0, gid=0, mode=0100644, size=2564)
Clearing orphaned inode 5117767 (uid=0, gid=0, mode=040755, size=4096)
Clearing orphaned inode 5117786 (uid=0, gid=0, mode=0100644, size=136)
Clearing orphaned inode 5117795 (uid=0, gid=0, mode=0100644, size=1384456)
Clearing orphaned inode 5117796 (uid=0, gid=0, mode=0100644, size=13512)
Clearing orphaned inode 5117797 (uid=0, gid=0, mode=0100644, size=14602852)
Clearing orphaned inode 5117890 (uid=0, gid=0, mode=040755, size=4096)
Clearing orphaned inode 5117891 (uid=0, gid=0, mode=0100644, size=396838)
Clearing orphaned inode 5117895 (uid=0, gid=0, mode=0100644, size=336148)
Clearing orphaned inode 5117902 (uid=0, gid=0, mode=0100644, size=2097673)
Clearing orphaned inode 5117906 (uid=0, gid=0, mode=040755, size=4096)
Clearing orphaned inode 5117948 (uid=0, gid=0, mode=0100644, size=42292)
Clearing orphaned inode 5117949 (uid=0, gid=0, mode=0100644, size=30008)
Clearing orphaned inode 2359628 (uid=0, gid=0, mode=0100644, size=26152)
Clearing orphaned inode 2359565 (uid=0, gid=0, mode=0100644, size=719540)
Clearing orphaned inode 2359507 (uid=0, gid=0, mode=0100644, size=191280)
Clearing orphaned inode 2359309 (uid=0, gid=0, mode=0100644, size=140828)
Clearing orphaned inode 2636259 (uid=0, gid=0, mode=0100644, size=9731148)
Clearing orphaned inode 15601116 (uid=0, gid=0, mode=0100644, size=294664)
Clearing orphaned inode 15601112 (uid=0, gid=0, mode=0100644, size=1364764)
Clearing orphaned inode 2631097 (uid=0, gid=0, mode=0100644, size=8098204)
Clearing orphaned inode 11403364 (uid=1000, gid=1000, mode=0100600, size=56712)
Clearing orphaned inode 11403334 (uid=1000, gid=1000, mode=0100600, size=53464)
Clearing orphaned inode 11403332 (uid=1000, gid=1000, mode=0100600, size=317978)
Clearing orphaned inode 2759066 (uid=1000, gid=1000, mode=0100600, size=2048)
Clearing orphaned inode 2758210 (uid=1000, gid=1000, mode=0100600, size=0)
Clearing orphaned inode 2753516 (uid=1000, gid=1000, mode=0100600, size=2576)
Clearing orphaned inode 12978203 (uid=1000, gid=1000, mode=0100600, size=4096)
Clearing orphaned inode 11403300 (uid=1000, gid=1000, mode=0100600, size=36824)
Clearing orphaned inode 11403274 (uid=1000, gid=1000, mode=0100600, size=79992)
Clearing orphaned inode 11403269 (uid=1000, gid=1000, mode=0100600, size=518026)
Clearing orphaned inode 12978210 (uid=1000, gid=1000, mode=0100644, size=32768)
Clearing orphaned inode 12978221 (uid=1000, gid=1000, mode=0100600, size=15412)
/dev/sda3: clean, 1333035/27107328 files, 36744359/108406255 blocks

2010/12/06 HDD障害 不良セクター

clonezilla でイメージバックアップを取得しようとしたら、Partclone がエラーで途中で止まっていました。

Partclone v0.1.9 (Rev:343M) http://partclone.org
Starting to clone device (/dev/sda3) to image (-)
Reading Super Block
Calculating bitmap...
File system: EXTFS
Device size: 263998 MB
Space in use: 147176 MB
Block size: 4096 Byte
Used block count: 35931515
Total block 64452415
io_all: errno = Input/output error(5)
*************************************************************************
* WARNING: The disk has bad sector. This means physical damage on the   *
* disk surface caused by deterioration, manufacturing faults or other   *
* reason. The reliability of the disk may stay stable or degrade fast.  *
* Use the --rescue option to efficiently save as much data as possible! *
*************************************************************************

Expert モードで resuce オプションをつけてバックアップを取得しなおすと、なんとか取得できたようです。

SAMRT でテストしてみます。

$ sudo smartctl -t short /dev/sda
[sudo] password for kenji: 
smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART Short self-test routine immediately in off-line mode".
Drive command "Execute SMART Short self-test routine immediately in off-line mode" successful.
Testing has begun.
Please wait 2 minutes for test to complete.
Test will complete after Mon Dec  6 08:40:56 2010

Use smartctl -X to abort test.

テスト結果を表示します。

$ sudo smartctl -l selftest /dev/sda
smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed: read failure       90%      1208         436750599
# 2  Extended offline    Completed without error       00%       172         -
# 3  Short offline       Completed without error       00%       167         -

やはりエラーが出ています。

Current_Pending_Sector の値を調べます。

$ sudo smartctl -A /dev/sda
smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       2104
  3 Spin_Up_Time            0x0027   147   147   021    Pre-fail  Always       -       1608
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       692
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   099   099   000    Old_age   Always       -       1208
 10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       689
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       31
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       836
194 Temperature_Celsius     0x0022   121   094   000    Old_age   Always       -       22
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   199   000    Old_age   Always       -       49
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

49 になってます。


トップ   編集 凍結 差分 バックアップ 添付 複製 名前変更 リロード   新規 一覧 単語検索 最終更新   ヘルプ   最終更新のRSS
Last-modified: 2010-12-13 (月) 20:12:22 (4904d)