Lehrveranstaltung Speichersysteme Sommersemester 2009

Ähnliche Dokumente

TSM 5.2 Experiences Lothar Wollschläger Zentralinstitut für Angewandte Mathematik Forschungszentrum Jülich

Dienstleistungsmanagement Übung 5

Ressourcenmanagement in Netzwerken SS06 Vorl. 12,

Felix Großkreuz Philipps-Universität Marburg Fachbereich 12 Seminar IT-Administration SS2011

Einkommensaufbau mit FFI:

Fakten statt Bauchgefühl: RAID Mathematik für Admins

WAS IST DER KOMPARATIV: = The comparative

Bayesian Networks. Syntax Semantics Parametrized Distributions Inference in Bayesian Networks. Exact Inference. Approximate Inference

Grundlagen der Rechnerarchitektur

Lehrveranstaltung Speichersysteme Sommersemester 2009

Symbio system requirements. Version 5.1

RAID Redundant Array of Independent [Inexpensive] Disks

Der Wetterbericht für Deutschland. Read the weather reports and draw the correct weather symbols for each town.

FACHKUNDE FüR KAUFLEUTE IM GESUNDHEITSWESEN FROM THIEME GEORG VERLAG

Electrical tests on Bosch unit injectors

Therefore the respective option of the password-protected menu ("UPDATE TUBE DATA BASE") has to be selected:

EXPOSÉ. Mercedes-Benz 220 Cabriolet A ID 1246

TomTom WEBFLEET Tachograph

There are 10 weeks this summer vacation the weeks beginning: June 23, June 30, July 7, July 14, July 21, Jul 28, Aug 4, Aug 11, Aug 18, Aug 25

Magic Figures. We note that in the example magic square the numbers 1 9 are used. All three rows (columns) have equal sum, called the magic number.

Ein Stern in dunkler Nacht Die schoensten Weihnachtsgeschichten. Click here if your download doesn"t start automatically

Englisch. Schreiben. 18. September 2015 HTL. Standardisierte kompetenzorientierte schriftliche Reife- und Diplomprüfung. Name: Klasse/Jahrgang:

EXPOSÉ. Mercedes-Benz 220 Cabriolet A ID 1293

Exercise (Part II) Anastasia Mochalova, Lehrstuhl für ABWL und Wirtschaftsinformatik, Kath. Universität Eichstätt-Ingolstadt 1

Aufbau eines IT-Servicekataloges am Fallbeispiel einer Schweizer Bank

Weather forecast in Accra

Titelbild1 ANSYS. Customer Portal LogIn

prorm Budget Planning promx GmbH Nordring Nuremberg

Franke & Bornberg award AachenMünchener private annuity insurance schemes top grades

miditech 4merge 4-fach MIDI Merger mit :

German translation: technology

Where are we now? The administration building M 3. Voransicht

Wozu dient ein Logikanalysator?

H o c h s c h u l e D e g g e n d o r f H o c h s c h u l e f ü r a n g e w a n d t e W i s s e n s c h a f t e n

1. General information Login Home Current applications... 3

v+s Output Quelle: Schotter, Microeconomics, , S. 412f

Unternehmen-IT sicher in der Public Cloud

Prediction Market, 28th July 2012 Information and Instructions. Prognosemärkte Lehrstuhl für Betriebswirtschaftslehre insbes.

Cloud Architektur Workshop

Was heißt Denken?: Vorlesung Wintersemester 1951/52. [Was bedeutet das alles?] (Reclams Universal-Bibliothek) (German Edition)

FIRMWARE UPDATE TAPMOTION TD

Number of Maximal Partial Clones

Preisliste für The Unscrambler X

CABLE TESTER. Manual DN-14003

Englisch-Grundwortschatz

Redundant Array of Inexpensive Disks

Accelerating Information Technology Innovation

Routing in WSN Exercise

Tube Analyzer LogViewer 2.3

Contents. Interaction Flow / Process Flow. Structure Maps. Reference Zone. Wireframes / Mock-Up

PONS DIE DREI??? FRAGEZEICHEN, ARCTIC ADVENTURE: ENGLISCH LERNEN MIT JUSTUS, PETER UND BOB

After sales product list After Sales Geräteliste

Finite Difference Method (FDM)

Wer bin ich - und wenn ja wie viele?: Eine philosophische Reise. Click here if your download doesn"t start automatically

Summary Details for Performance, Duration and Acoustic Measurements for the. Aircon 10S Wind Turbine. UK MCS Certification Summary

Handbuch der therapeutischen Seelsorge: Die Seelsorge-Praxis / Gesprächsführung in der Seelsorge (German Edition)

Mitglied der Leibniz-Gemeinschaft

Das Zeitalter der Fünf 3: Götter (German Edition)

The poetry of school.

Eingebettete Taktübertragung auf Speicherbussen

FEM Isoparametric Concept

Im Fluss der Zeit: Gedanken beim Älterwerden (HERDER spektrum) (German Edition)

Exercise (Part XI) Anastasia Mochalova, Lehrstuhl für ABWL und Wirtschaftsinformatik, Kath. Universität Eichstätt-Ingolstadt 1

DIE NEUORGANISATION IM BEREICH DES SGB II AUSWIRKUNGEN AUF DIE ZUSAMMENARBEIT VON BUND LNDERN UND KOMMUNEN

Betriebssysteme K_Kap11C: Diskquota, Raid

Level 2 German, 2015

The Solar Revolution New Ways for Climate Protection with Solar Electricity

7/15/2014. Stratus everrun Enterprise. Ist die Audio Qualität OK? Fragen bitte per Chat, diese werden im Anschluss beantwortet.

Schöpfung als Thema des Religionsunterrichts in der Sekundarstufe II (German Edition)

IDS Lizenzierung für IDS und HDR. Primärserver IDS Lizenz HDR Lizenz

Die allerwichtigsten Raid Systeme

Verbesserung der Fehlertoleranz lagerloser Elektromotoren

Critical Chain and Scrum

1.1 VoIP - Kein Notruf möglich. 1.2 VoIP - Vorrang von Notrufen

42 Zitate großer Philosophen: Über das Leben, das Universum und den ganzen Rest (German Edition)

Level 1 German, 2012

German Section 33 - Online activitites

VGM. VGM information. HAMBURG SÜD VGM WEB PORTAL - USER GUIDE June 2016

EVANGELISCHES GESANGBUCH: AUSGABE FUR DIE EVANGELISCH-LUTHERISCHE LANDESKIRCHE SACHSEN. BLAU (GERMAN EDITION) FROM EVANGELISCHE VERLAGSAN

Martin Luther. Click here if your download doesn"t start automatically

Einsatz einer Dokumentenverwaltungslösung zur Optimierung der unternehmensübergreifenden Kommunikation

MATLAB driver for Spectrum boards

Die Bedeutung neurowissenschaftlicher Erkenntnisse für die Werbung (German Edition)

FEM Isoparametric Concept

Sepiola Mockups. Overview. Show notes. Primäre Navigation anklicken um zum gewünschten Mockups zu gehen. Backup usage. Overview.

Warum nehme ich nicht ab?: Die 100 größten Irrtümer über Essen, Schlanksein und Diäten - Der Bestseller jetzt neu!

A Classification of Partial Boolean Clones

VGM. VGM information. HAMBURG SÜD VGM WEB PORTAL USER GUIDE June 2016

Top-Antworten im Bewerbungsgespräch für Dummies (German Edition)

Newest Generation of the BS2 Corrosion/Warning and Measurement System

Titelmasterformat durch Klicken bearbeiten

ORM & OLAP. Object-oriented Enterprise Application Programming Model for In-Memory Databases. Sebastian Oergel

CALCULATING KPI QUANTITY-INDEPENDENT ROUTE TIME

Funktion der Mindestreserve im Bezug auf die Schlüsselzinssätze der EZB (German Edition)

Hardwarekonfiguration an einer Siemens S7-300er Steuerung vornehmen (Unterweisung Elektriker / - in) (German Edition)

Wie man heute die Liebe fürs Leben findet

Softwareupdate-Anleitung // AC Porty L Netzteileinschub

Geometrie und Bedeutung: Kap 5

UM ALLE DATEN ZU KOPIEREN. ZUNÄCHST die Daten des alten Telefons auf einen Computer kopieren

Transkript:

Lehrveranstaltung Speichersysteme Sommersemester 2009 Kapitel 5: RAID André Brinkmann

IntroducDon into Disk Arrays Inhaltsverzeichnis Why Disk Arrays? MTTF, MTTR, MTTDL RAID 0, RAID 1, RAID 5 MulDple Disk Failures and RAID 6

Use Arrays of Small Disks Katz and Patterson asked in 1987: Can smaller disks be used to close gap in performance between disks and CPUs? Conventional: 4 disk designs 3.5 5.25 10 14 Low End High End Disk Array: 1 disk design 3.5 Folien basieren auf Vorlesung von Prof. D. Patterson (Berkeley)

Ersetzung kleiner Anzahl großer Disks durch große Anzahl kleiner Disks (Beispiel 1988) IBM 3390K IBM 3.5 0061 x70 Improvement Capacity 20 GB 320 MB 23 GB 1x Volume 97 `2 0.1 `2 11 `2 9x Power ConsumpDon 3 KW 11 W 1 KW 3x Bandwidth 15 MB/s 1.5 MB/s 120 MB/s 8x IO Rate 600 IOs/s 55 IOs/s 3900 IOs/s 6x MTTF 250 KHrs 50 KHrs?? Hrs Costs $250,000 $2,000 $150,000 1,6x FestplaUenfelder haben PotenDelle hohe Performanz Hohe Anzahl MBs/Volumen und MBs/KW aber Was ist mit der Ausfallsicherheit? Folien basieren auf Vorlesung von Prof. D. Patterson (Berkeley)

MTTF, MTTR, MTTDL Mean Time to Failure (MTTF): Expected Dme undl a disk fails. MTTF can be defined in terms of the expected value of the failure density funcdon f: Mean Time to Repair (MTTR): Expected Dme from the failure of a disk undl compledng its recovery (if possible) Mean Time between Data Loss (MTTDL): Expected Dme from stardng a storage system undl the loss of data

Bathtub Curve MTTF calculated for normal operadng period of a disk Nearly constant over a period of 3 4 years Failure rate before and a`erwards significantly higher /&,*0'(&)*+,-' 1-,)"2' 3",4&*'"1-,&5#.' 1-,)"2'' 6-&,'"+%' 1-,)"2'' 7&%-'"(' (&)*+,-'!"#$%&#%'(&)*+,-',&%-',-.)"#'

RAID Parallel Disk Arrays are able to provide high bandwidth good properdes concerning MB/Volume und MB/KW, but what about reliability? MTTF of n disks: Inside the example: MTTF 70 = 50,000 h / 70 = 700 h MTTF of the disk array decreases from six years to a single month Arrays (without redundancy) too unreliable to be useful!

Reliability / Ausfallsicherheit Reliability / Ausfallsicherheit Kennzeichnet die Fähigkeit eines Systems oder einer Systemkomponente seine FunkDonalität unter definierten Bedingungen für eine spezifizierte Zeitdauer auszuführen. Wird in MTTF gemessen (Mean Time To Failure MiUler Zeit bis zu einem Ausfall) Availability / Verfügbarkeit Kennzeichnet den Grad, zu dem ein System oder eine Systemkomponente zugreimar ist. Annahme hier: System während der RekonstrukDon nicht zugreimar Folie basiert auf IEEE Standard Computer Dictionary: A Compilation of IEEE Standard Computer Glossaries und Torell / Avelar: Mean Time between Failure- Explanations and Standards

Redundant Array of Inexpensive Disks Files are striped across muldple disks Redundancy yields high data availability Availability: Service is sdll provided to user, even if some components have failed Disks will sdll fail Contents can be reconstructed from data redundantly stored in the array Capacity penalty to store redundant info Bandwidth penalty to update redundant info See D. A. Patterson, G. A. Gibson, R. H. Katz: A Case for Redundant Arrays of Inexpensive Disks (RAID)

RAID I (1989) SUN 4/280 WorkstaDon mit 128 MByte DRAM Vier SCSI Kontroller 28 5.25 FestplaUen und Spezieller Disk Striping So`ware Heute ist RAID eine 27 Mrd. $ Industrie 80% aller nicht PC FestplaUen werden als RAID Systeme verkau` Berkeley History, RAID I Folien basieren auf Vorlesung von Prof. D. Patterson (Berkeley)

RAID Level RAID = Redundant Array of Independent Disks Bekannte RAID Level 0: keine Redundanz (JBOD) 1: Mirroring 10: Striped Mirrors 2: Hamming Codes/ECC (nicht verwendet) 3: Byte Interleaved Parity 4: Block Interleaved Parity 5: Rotated Block Interleaved Parity 6: Double Parity (selten)

RAID 0 RAID 0 stripes data over set of disks Size of each data block is several Kbyte Increase of bandwidth for big accesses or for many parallel, but small accesses RAID 0 does not include redundancy informadon No protecdon against single disk failures Legend: x y means block x 0 1 from stripe y 2 3 4 0-0 1-0 2-0 3-0 4-0 0-1 1-1 2-1 3-1 4-1 0-2 1-2 2-2 3-2 4-2 0-3 1-3 2-3 3-3 4-3 LocaDon can be efficiently calculated for n disks Stripe address y = Address / n Disk number x = Address % n Logical address 12 will be mapped to stripe 2 and disk 2

RAID 1 Every Disk is fully mirrored Very high availability can be achieved Bandwidth sacrifice on write: Logical write = two physical writes Reads can be opdmized Most expensive RAID soludon: 100% capacity overhead Legend: xy means block x on disk y 0 1 0-0 0-1 1-0 1-1 2-0 2-1 3-0 3-1

Parity RAID ProperDes of previous RAID levels: Mirroring produces high overhead Striping does not include failure correcdon FuncDon required with low capacity overhead good failure protecdon properdes low compudng costs Idea of RAID 3 and RAID 4 Use striping plus Parity computadon Parity computed using XOR Example: Divide data block 1101 in 4 sub blocks plus one Parity block 0 1 2 3 4 0-0 1 1-0 1 2-0 0 3-0 1 4-0 1

Striping unit Block used to distribute data Stripe Terminology Set of striping units that share parity computadon Parity block Block that keeps the parity of a stripe Same size as a striping unit

Small Write Problem Read performance of Parity RAID nearly as good as performance of RAID 0 but each small write to a single block x needs to update the parity block! SoluDon 1: Read all other data blocks except the changed one Calculate new parity Write the new data block and the parity block Overhead propordonal to stripe size SoluDon 2: Use properdes of XOR funcdon: and We can read the ONE old data block x and the old parity block p We know the new data block x new Just calculate new parity block to:

RAID 5 RAID 4 produces bouleneck at the parity disk Each write access to an arbitrary block produces one write request at the parity disk RAID 4 does not scale concerning the stripe size Idea of RAID 5: Distribute the funcdon of the parity disk over all disks 0 1 2 3 4 0-0 1-0 2-0 3-0 0-1 1-1 2-1 3-1 Legend: x y means block x from stripe y 0-2 1-2 2-2 3-2 0-3 1-3 2-3 3-3 0-3 1-3 2-3 3-3

Does redundancy help? Failure probability of 1 error correcdng codes for n disks can be calculated to Failure probability during recovery depends on Time to recover the data MTTF of the remaining devices

AssumpDons Error does not occur in wear out phase MTTF (of one disk) is constant ExponenDal failure distribudon leads to failure density funcdon: λ is probability that an element fails; described in failures per unit of measurement It holds that

Probability of second Failure Probability of a second failure as integral over density funcdon: Here: and therefore

Probability of second Failure ExponenDal funcdon can be calculated as series with MTTR << MTTF: and MTTDL can be calculated as

Does redundancy help? Standard RAID schemes are able to increase MTTDL for n disks from without data protecdon to Drawbacks wridng becomes slower complexity of implementadon and administradon significantly increases

Is this safe enough? AssumpDon for storage cluster environment: 1 PByte of data stored on 2000 computers Environment is grouped into 200 RAID 5 systems with 10 disks each MTTF of each computer (including disks) is 1000 days Recovery Dme of a computer is 1 day MTTDL = 1 200 (1000)2 10 9 1d 55d ProtecDon against single disk failures not enough in large scale environments Example taken from Lustre Manual v1.6, August 2007