Difference between revisions of "RAID"

Revision as of 12:46, 29 October 2011

Il Raid è la soluzione migliore per non rischiare di perdere tutto per colpa di discacci malfnzionanti.

Appunto, ho questi due discacci (Maxtor molto cattivi) li metto a specchio per:

guadagnare velocità di lettura
se un disco muore, l'altro farà il suo lavoro

mdadm

Gestore + comune e semplice di dispositivi RAID

comandi al volo

crea bitmap

mdadm /dev/md3 -Gb internal

oppure

mdadm --grow --bitmap=internal /dev/md3

mdadm: set /dev/sda1 faulty in /dev/md0

mdadm /dev/md0 -f /dev/sda1

mdadm: hot removed /dev/sda1

mdadm /dev/md0 -r /dev/sda1

mdadm: hot added /dev/sda1

mdadm /dev/md0 -a /dev/sda1

creo un raid multipath con 4 dischi

mdadm -C /dev/md0 --level=multipath --raid-devices=4 /dev/sda1 /dev/sdb1 /dev/sdc1 /dev/sdd1
Continue creating array? yes
mdadm: array /dev/md0 started.

Verificare la consistenza:

echo check >> /sys/block/mdX/md/sync_action
watch -n .1 cat /proc/mdstat

Controllare la velocità del RAID

cat /proc/sys/dev/raid/speed_limit_max

per un massimo di 30MB/sec.

echo "30000" > /proc/sys/dev/raid/speed_limit_max

recuperare un raid1

livecd ~ # mdadm --assemble /dev/md1 /dev/hda5 /dev/hdc5
mdadm: /dev/md1 has been started with 2 drives.
livecd ~ # cat /proc/mdstat
Personalities : [raid1]
md1 : active raid1 hda5[0] hdc5[1]
     19542976 blocks [2/2] [UU]
     bitmap: 0/150 pages [0KB], 64KB chunk
unused devices: <none>

Installare il sistema su RAID software

mknod /dev/md1 b 9 1
mknod /dev/md2 b 9 2
mknod /dev/md3 b 9 3
mdadm --create --verbose /dev/md1 --level=1 --raid-devices=2 /dev/sda1 /dev/sdb1
mdadm --create --verbose /dev/md2 --level=1 --raid-devices=2 /dev/sda2 /dev/sdb2
mdadm --create --verbose /dev/md3 --level=1 --raid-devices=2 /dev/sda3 /dev/sdb3
mdadm --detail --scan > /etc/mdadm.conf
mdadm /dev/md1 -Gb internal
mdadm /dev/md2 -Gb internal
mdadm /dev/md3 -Gb internal
grub
 grub> root (hd0,x)
 grub> setup (hd0)
 grub> quit

Creare un raid0 (Dis+Disk+Disk..)

localhost ~ # mknod /dev/md0 b 9 0
localhost ~ # mdadm --create /dev/md0 --chunk=64 --level=raid0 --raid-devices=2 /dev/hda7 /dev/hdc1
mdadm: /dev/hda7 appears to contain a reiserfs file system
    size = 4891648K
Continue creating array? y
mdadm: array /dev/md0 started.
localhost ~ # cat /proc/mdstat
Personalities : [linear] [raid0] [raid1] [raid10] [raid6] [raid5] [raid4]
md0 : active raid0 hdc1[1] hda7[0]
     303363200 blocks 64k chunks
unused devices: <none>

Creare un raid1 (Disk=Disk+Disk=Disk+..)

vediamo i modelli di disco che ho a disposizione: (sapendo che sono PS e SM)

[root@elwood ~]# cat /proc/ide/ide[01]/hd[bc]/model
	Maxtor 6Y160P0
	Maxtor 6L160P0

Il primo è un modello difettuoso di fabbrica (6Y***P0) Il secondo è un pò meglio

Faacio una partizione per disco (devono essere identiche) e togglo il file syste a linux raid autodetect (tipo fd)

# fdisk -l /dev/hd[bc]
  Device Boot      Start         End      Blocks   Id  System
/dev/hdb1               1       19929   160079661   83  Linux raid autodetect

  Device Boot      Start         End      Blocks   Id  System
/dev/hdc1               1       19929   160079661   83  Linux raid autodetect

formattimao il file system (ext3)

[root@elwood ~]# mkfs.ext3 -L raid1 -m0 /dev/hdb1
[root@elwood ~]# mkfs.ext3 -L raid1 -m0 /dev/hdc1

costruiamo e facciamo partire il raid:

[root@elwood ~]# mdadm --create /dev/md0 --level=1 --raid-devices=2 /dev/hdb1 /dev/hdc1
mdadm: /dev/hdb1 appears to contain an ext2fs file system
   size=160079660K  mtime=Mon May 29 22:32:40 2006
mdadm: /dev/hdc1 appears to contain an ext2fs file system
   size=160079660K  mtime=Thu Jan  1 01:00:00 1970
Continue creating array? (y/n) y
mdadm: array /dev/md0 started.

Ora vedo cosa è successo:

[root@elwood ~]# cat /proc/mdstat
Personalities : [raid1]
md0 : active raid1 hdc1[1] hdb1[0]
     160079552 blocks [2/2] [UU]
     [>....................]  resync =  0.4% (652864/160079552) finish=44.7min speed=59351K/sec

Con il seguente comando facciamo partire a ogni avvio la monitoria del raid che ci manderà e-mails

[root@elwood ~]# echo " nohup mdadm --monitor --mail=aledelg@gmail.com --delay=300 /dev/md0 & " >> /etc/rc.local

Scriviamo un paio di linee in mdadm.conf per far riavviare il raid al riavvio (non sempre necessario)

[root@elwood ~]# vi /etc/mdadm.conf
DEVICE /dev/hd[bc]1
ARRAY /dev/md0 devices=/dev/hdb1,/dev/hdc1

Rimuovere un disco

rimuovo la partizione /dev/hdb1 dal RAID /dev/md5

fricco ~ # mdadm --manage /dev/md5 --fail /dev/hdb1
mdadm: set /dev/hdb1 faulty in /dev/md5
fricco ~ # cat /proc/mdstat
Personalities : [linear] [raid0] [raid1]
md5 : active raid1 hdd1[1] hdb1[2](F)
     244187904 blocks [2/1] [_U]
md0 : active raid0 hdc1[1] hda7[0]
     303363200 blocks 64k chunks
unused devices: <none>
fricco ~ # mdadm --manage /dev/md5 -r /dev/hdb1
mdadm: hot removed /dev/hdb1

reset raid information of a disk

mdadm --zero-superblock /dev/<disk>

Sostituire un disco in raid1 Corrotto

La configurazione è la seguente:

macchina ~ # cat /proc/mdstat |grep ^md
md10 : active raid1 hdd2[1] hda2[0]
md100 : active raid1 hdd3[1] hda3[0]
md160 : active raid1 hdd5[1] hdb1[0]
md0 : active raid1 hdd1[1] hda1[0]

i dischi installati sono;

macchina ~ # fdisk -l 2> /dev/null|grep 'Disk /dev/hd'
Disk /dev/hda: 80.0 GB, 80000000000 bytes
Disk /dev/hdb: 163.9 GB, 163928604672 bytes
Disk /dev/hdd: 251.0 GB, 251000193024 bytes

il disco /dev/hdd è danneggiato:

macchina ~ # dmesg |grep hdd|tail
end_request: I/O error, dev hdd, sector 162802377
raid1: hdd5: rescheduling sector 100823408
hdd: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hdd: dma_intr: error=0x40 { UncorrectableError }, LBAsect=162802376, high=9, low=11807432,  sector=162802369
end_request: I/O error, dev hdd, sector 162802369
raid1:md160: read error corrected (8 sectors at 100823536 on hdd5)
hdd: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hdd: dma_intr: error=0x40 { UncorrectableError }, LBAsect=162802377, high=9, low=11807433,  sector=162802377
end_request: I/O error, dev hdd, sector 162802377
raid1:md160: read error corrected (8 sectors at 100823544 on hdd5)

Faccio il backup della tabella di partizioni:

sfdisk -d /dev/hdd > hdd_partition_table

Spengo la macchina, e monto il nuovo disco:

[... reboot ...]

ricario la tabella delle partizioni del vecchio disco, nel nuovo:

sfdisk /dev/hdd < hdd_partition_table

Añado, una por una las particiones del nuevo disco a los RAIDs

aledg ~ # mdadm /dev/md160 -a /dev/hdd5
mdadm: added /dev/hdd5
aledg ~ # mdadm /dev/md0 -a /dev/hdd1
mdadm: added /dev/hdd1
aledg ~ # mdadm /dev/md10 -a /dev/hdd2
mdadm: added /dev/hdd2
aledg ~ # mdadm /dev/md100 -a /dev/hdd3
mdadm: added /dev/hdd3

Miramos si esta reconstruyendo los raids, el primero que he areglado se estarà reconstruyendo, los otros , tienen que estar en estado "DELAYED" osea que se completaran uno a la vez:

aledg ~ # cat /proc/mdstat
Personalities : [raid0] [raid1] [multipath] [faulty]
md10 : active raid1 hdd2[2] hda2[0]
     10241344 blocks [2/1] [U_]
       resync=DELAYED
     bitmap: 28/157 pages [112KB], 32KB chunk
md100 : active raid1 hdd3[2] hda3[0]
     20482752 blocks [2/1] [U_]
       resync=DELAYED
     bitmap: 17/157 pages [68KB], 64KB chunk
md160 : active raid1 hdd5[2] hdb1[0]
     160079552 blocks [2/1] [U_]
     [>....................]  recovery =  0.4% (690112/160079552) finish=141.1min speed=18812K/sec
md0 : active raid1 hdd1[2] hda1[0]
     264960 blocks [2/1] [U_]
       resync=DELAYED
     bitmap: 0/33 pages [0KB], 4KB chunk

@@ Line 114: / Line 114: @@
   fricco ~ # mdadm --manage /dev/md5 -r /dev/hdb1
   mdadm: hot removed /dev/hdb1
+== reset raid information of a disk ==
+ mdadm --zero-superblock /dev/<disk>
 == Sostituire un disco in raid1 Corrotto ==
 * La configurazione è la seguente:

Anonymous

Search

Difference between revisions of "RAID"

Namespaces

More

Page actions

Revision as of 12:46, 29 October 2011

Contents

mdadm

comandi al volo

recuperare un raid1

Installare il sistema su RAID software

Creare un raid0 (Dis+Disk+Disk..)

Creare un raid1 (Disk=Disk+Disk=Disk+..)

Rimuovere un disco

reset raid information of a disk

Sostituire un disco in raid1 Corrotto

Navigation

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Difference between revisions of "RAID"

Revision as of 12:46, 29 October 2011

mdadm

comandi al volo

recuperare un raid1

Installare il sistema su RAID software

Creare un raid0 (Dis+Disk+Disk..)

Creare un raid1 (Disk=Disk+Disk=Disk+..)

Rimuovere un disco

reset raid information of a disk

Sostituire un disco in raid1 Corrotto

Navigation

Wiki tools

Page tools