一个关于H70小机硬盘的问题
具体情况是这样的,某天突然听到H70旁有报警声音,查看日志是每1分钟报警一次。这个小机自带有3块硬盘,分别是hdisk0,hdisk1,hdisk14, 其中hdisk0和hdisk14是rootvg,hdisk1是hdisk0的镜像,hdisk14没有镜像,上面有6个PP的数据,都是/usr文件系统,这三块硬盘都是9G的,阵列上有12块。错误日志为:
A668F553 0301085008 P H hdisk14 DISK OPERATION ERROR
A668F553 0301085008 P H hdisk14 DISK OPERATION ERROR
A668F553 0301084908 P H hdisk14 DISK OPERATION ERROR
A668F553 0301084908 P H hdisk14 DISK OPERATION ERROR
A668F553 0301084808 P H hdisk14 DISK OPERATION ERROR
详细信息:
LABEL: DISK_ERR2
IDENTIFIER: A668F553
Date/Time: Sat Mar 1 08:51:14 BEIS
Sequence Number: 425
Machine Id: 000155394C00
Node Id: wzcb02_boot
Class: H
Type: PERM
Resource Name: hdisk14
Resource Class: disk
Resource Type: scsd
Location: 10-60-00-10,0
VPD:
Manufacturer................IBM
Machine Type and Model......DGHS09U
FRU Number..................59H6926
ROS Level and ID............30334530
Serial Number...............6829F188
EC Level....................E31898
Part Number.................59H6816
Device Specific.(Z0)........000003029F00013A
Device Specific.(Z1)........GAGSPR603E
Device Specific.(Z2)........09RI
Device Specific.(Z3)........99250
Device Specific.(Z4)........0001
Device Specific.(Z5)........22
Device Specific.(Z6)........E31777
Description
DISK OPERATION ERROR
Probable Causes
DASD DEVICE
Failure Causes
DISK DRIVE
DISK DRIVE ELECTRONICS
Recommended Actions
PERFORM PROBLEM DETERMINATION PROCEDURES
Detail Data
SENSE DATA
060A 0000 0000 0000 0000 0000 0000 0000 0102 0000 7000 0200 0000 0018 0000 0000
0400 0100 0000 0000 018B 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0009 0002 AFC0
当时用命令检测了下状态,然后咨询了下公司,他建议我重新启动,重新启动后发现,
#lspv
hdisk0 000153297464659f rootvg
hdisk1 00015539d5d26234 rootvg
hdisk15 none None
#lsvg -p rootvg
rootvg:
PV_NAME PV STATE TOTAL PPs FREE PPs FREE DISTRIBUTION
hdisk0 active 542 0 00..00..00..00..00
hdisk1 active 542 58 00..00..00..00..58
hdisk14 missing 542 536 109..108..102..108..109
#lsdev -Cc disk
hdisk0 Available 10-60-00-8,0 16 Bit SCSI Disk Drive
hdisk1 Available 10-60-00-9,0 16 Bit SCSI Disk Drive
hdisk14 Defined 10-60-00-10,0 16 Bit SCSI Disk Drive
hdisk15 Available 10-60-00-10,0 16 Bit SCSI Disk Drive
#lsvg -l rootvg
rootvg:
LV NAME TYPE LPs PPs PVs LV STATE MOUNT POINT
hd5 boot 1 2 2 closed/syncd N/A
hd6 paging 64 128 2 open/syncd N/A
hd8 jfslog 1 2 2 open/syncd N/A
hd4 jfs 7 14 2 open/syncd /
hd2 jfs 70 140 3 open/stale /usr
hd9var jfs 7 14 2 open/syncd /var
hd3 jfs 63 126 2 open/syncd /tmp
hd1 jfs 269 538 2 open/syncd /home
hd10opt jfs 2 4 2 open/syncd /opt
lg_dumplv sysdump 64 64 1 open/syncd N/A
#lspv -p hdisk14
hdisk14:
PP RANGE STATE REGION LV ID TYPE MOUNT POINT
1-109 free outer edge
110-217 free outer middle
218-319 free center
320-323 used center hd2 jfs /usr
324-324 stale center hd2 jfs /usr
325-325 used center hd2 jfs /usr
326-433 free inner middle
434-542 free inner edge
#lspv -p hdisk15
0516-304 : Unable to find device id 00000000000000000000000000000000 in the Device
Configuration Database.
是不是硬盘坏了,我不知道能不能确认。重启后,系统认了块hdisk15,但实际还是hdisk14。
后来又执行了以下命令
#rmdev -Rdl hdisk14
#rmdev -Rdl hdisk15
#cfgmgr -v
系统又重新认为了hdisk14,
#lspv
hdisk0 000153297464659f rootvg
hdisk1 00015539d5d26234 rootvg
hdisk14 none None
#lsvg -p rootvg
rootvg:
PV_NAME PV STATE TOTAL PPs FREE PPs FREE DISTRIBUTION
hdisk0 active 542 0 00..00..00..00..00
hdisk1 active 542 58 00..00..00..00..58
0516-304 lsvg: Unable to find device id 00015329d5c802c7 in the Device
Configuration Database.
00015329d5c802c7 missing 542 536 109..108..102..108..109
#lsdev -Cc disk
hdisk0 Available 10-60-00-8,0 16 Bit SCSI Disk Drive
hdisk1 Available 10-60-00-9,0 16 Bit SCSI Disk Drive
hdisk14 Available 10-60-00-10,0 16 Bit SCSI Disk Drive
lspv -p hdisk14
0516-304 : Unable to find device id 00000000000000000000000000000000 in the Device
Configuration Database.
现在就是hdisk14没有pvid,在ODM中找不到,是不是需要执行#chdev -a hdisk14 -l pv=yes ??
另外,如果重新找到该盘的话,数据是不是仍然可以保留?