Dynamische Erweiterung lokaler Batch Systeme durch Cloud Resourcen

Ähnliche Dokumente
JONATHAN JONA WISLER WHD.global

GridMate The Grid Matlab Extension

Integration of D-Grid Sites in NGI-DE Monitoring

Unternehmen-IT sicher in der Public Cloud

Infrastructure as a Service (IaaS) Solutions for Online Game Service Provision

Virtual PBX and SMS-Server

Cloud Computing in der öffentlichen Verwaltung

Virtualisierungskonzepte im Betrieb von gemeinsam genutzten Rechenclustern

Cloud Computing mit mathematischen Anwendungen

The Cumulus Project: Build a Scientific Cloud for a Data Center

Algorithms for graph visualization

Customer-specific software for autonomous driving and driver assistance (ADAS)

VGM. VGM information. HAMBURG SÜD VGM WEB PORTAL USER GUIDE June 2016

The Cumulus Project: Build a Scientific Cloud for a Data Center

Implementing a Software- Defined DataCenter MOC 20745

Open Source Data Center Virtualisierung mit OpenNebula LinuxTag Berlin. Bernd Erk

Einsatz einer Dokumentenverwaltungslösung zur Optimierung der unternehmensübergreifenden Kommunikation

Introducing PAThWay. Structured and methodical performance engineering. Isaías A. Comprés Ureña Ventsislav Petkov Michael Firbach Michael Gerndt

TSM 5.2 Experiences Lothar Wollschläger Zentralinstitut für Angewandte Mathematik Forschungszentrum Jülich

Microsoft Azure Fundamentals MOC 10979


Oracle Public Cloud benutzen

Oracle Integration Cloud Service

Walter Buchmayr Ges.m.b.H.

Magic Figures. We note that in the example magic square the numbers 1 9 are used. All three rows (columns) have equal sum, called the magic number.

Karlsruhe Institute of Technology Die Kooperation von Forschungszentrum Karlsruhe GmbH und Universität Karlsruhe (TH)

Titelmasterformat durch Klicken bearbeiten

Order Ansicht Inhalt

SARA 1. Project Meeting

Where are we now? The administration building M 3. Voransicht

Platform as a Service (PaaS) & Containerization

sciebo The Academic NRW Cloud Storage Service

IDS Lizenzierung für IDS und HDR. Primärserver IDS Lizenz HDR Lizenz

Mash-Up Personal Learning Environments. Dr. Hendrik Drachsler

Star Trek: die Serien, die Filme, die Darsteller: Interessante Infod, zusammengestellt aus Wikipedia-Seiten (German Edition)

Fachübersetzen - Ein Lehrbuch für Theorie und Praxis

ETHISCHES ARGUMENTIEREN IN DER SCHULE: GESELLSCHAFTLICHE, PSYCHOLOGISCHE UND PHILOSOPHISCHE GRUNDLAGEN UND DIDAKTISCHE ANSTZE (GERMAN

p^db=`oj===pìééçêíáåñçêã~íáçå=

ITK-Trends 2010: Hardware and Software. Engineered to work together. Rolf Kersten EMEA Hardware Product Marketing, Oracle

Exercise (Part II) Anastasia Mochalova, Lehrstuhl für ABWL und Wirtschaftsinformatik, Kath. Universität Eichstätt-Ingolstadt 1

Open Source Data Center Virtualisierung mit OpenNebula CeBIT Bernd Erk

VGM. VGM information. HAMBURG SÜD VGM WEB PORTAL - USER GUIDE June 2016

FlexFrame for Oracle. Torsten Schlautmann OPITZ CONSULTING Gummersbach GmbH

Mit Legacy-Systemen in die Zukunft. adviion. in die Zukunft. Dr. Roland Schätzle

ELBA2 ILIAS TOOLS AS SINGLE APPLICATIONS

Ressourcenmanagement in Netzwerken SS06 Vorl. 12,

Modul 6 Virtuelle Private Netze (VPNs) und Tunneling

NEWSLETTER. FileDirector Version 2.5 Novelties. Filing system designer. Filing system in WinClient

Exercise (Part V) Anastasia Mochalova, Lehrstuhl für ABWL und Wirtschaftsinformatik, Kath. Universität Eichstätt-Ingolstadt 1

Long-term archiving of medical data new certified cloud-based solution offers high security and legally approved data management

Rev. Proc Information

Exercise (Part XI) Anastasia Mochalova, Lehrstuhl für ABWL und Wirtschaftsinformatik, Kath. Universität Eichstätt-Ingolstadt 1

Level 2 German, 2015

Context-adaptation based on Ontologies and Spreading Activation

Praktikum Entwicklung Mediensysteme (für Master)

Martin Luther. Click here if your download doesn"t start automatically

DAS ERSTE MAL UND IMMER WIEDER. ERWEITERTE SONDERAUSGABE BY LISA MOOS

Serverless Architecture with Azure-Functions

Level 1 German, 2014

TMF projects on IT infrastructure for clinical research

Tuning des Weblogic /Oracle Fusion Middleware 11g. Jan-Peter Timmermann Principal Consultant PITSS

Dexatek's Alexa Smart Home Skills Instruction Guide

Cameraserver mini. commissioning. Ihre Vision ist unsere Aufgabe

Business-centric Storage How appliances make complete backup solutions simple to build and to sell

Level 1 German, 2012

Level 2 German, 2013

Using TerraSAR-X data for mapping of damages in forests caused by the pine sawfly (Dprion pini) Dr. Klaus MARTIN

Creating OpenSocial Gadgets. Bastian Hofmann

Upgrading Your Skills to MCSA: Windows Server 2016 MOC 20743

Wie man heute die Liebe fürs Leben findet

Privacy-preserving Ubiquitous Social Mining via Modular and Compositional Virtual Sensors

WP2. Communication and Dissemination. Wirtschafts- und Wissenschaftsförderung im Freistaat Thüringen

Industrie 4.0 Predictive Maintenance. Kay Jeschke SAP Deutschland AG & Co. KG., Februar, 2014

HUMANGENETIK IN DER WELT VON HEUTE: 12 SALZBURGER VORLESUNGEN (GERMAN EDITION) BY FRIEDRICH VOGEL

Erweiterung einer D-Grid Ressource um eine Compute Cloud Schnittstelle

prorm Budget Planning promx GmbH Nordring Nuremberg

Agenda. Ausgangssituation (Beispiel) PaaS oder IaaS? Migrationspfade Deep Dives. IaaS via Azure Site Recovery PaaS via SQL Deployment Wizard

Exercise (Part VIII) Anastasia Mochalova, Lehrstuhl für ABWL und Wirtschaftsinformatik, Kath. Universität Eichstätt-Ingolstadt 1

How to access licensed products from providers who are already operating productively in. General Information Shibboleth login...

IBM Spectrum Protect for Virtual Environments

ONLINE LICENCE GENERATOR

Cloud und Big Data als Sprungbrett in die vernetzte Zukunft am Beispiel Viessmann

Engineering the Factory of the Future Now.Next.Beyond. Heiko Schwindt VP Automation & Electrification Solutions, Bosch Rexroth

Weather forecast in Accra

Die UN-Kinderrechtskonvention. Darstellung der Bedeutung (German Edition)

7/15/2014. Stratus everrun Enterprise. Ist die Audio Qualität OK? Fragen bitte per Chat, diese werden im Anschluss beantwortet.

TalkIT: Internet Communities Tiroler Zukunftsstiftung Donnerstag,

Die "Badstuben" im Fuggerhaus zu Augsburg

Number of Maximal Partial Clones

PONS DIE DREI??? FRAGEZEICHEN, ARCTIC ADVENTURE: ENGLISCH LERNEN MIT JUSTUS, PETER UND BOB

Provisioning SQL Databases MOC 20765

8690-ntlBrandGuideCovers_SinCH 2/2/06 10:46 AM Page 1

Symbio system requirements. Version 5.1

auf differentiellen Leitungen

Online Learning in Management

Mitglied der Leibniz-Gemeinschaft

LiLi. physik multimedial. Links to e-learning content for physics, a database of distributed sources

Nießbrauch- und Wohnrechtsverträge richtig abschließen (German Edition)

CNC ZUR STEUERUNG VON WERKZEUGMASCHINEN (GERMAN EDITION) BY TIM ROHR

Transkript:

Dynamische Erweiterung lokaler Batch Systeme durch Cloud Resourcen, Thomas Hauth, Armin Scheurer, Oliver Oberst, Günter Quast, Marcel Kunze INSTITUT FÜR EXPERIMENTELLE KERNPHYSIK (EKP) FAKULTÄT FÜR PHYSIK KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft www.kit.edu

What is Cloud Computing? Software as a Service Increasing Abstraction (SaaS) 2 the Application Platform as a Service (PaaS) the Runtime Environment Infrastructure as a Service (IaaS) the Virtual Machine

Infrastructure as a Service (IaaS) User supplied Machine Image(s) Cloud Controller boot / shutdown.. SL 5 Win2k3.. Debian 5 Node Controller Virtualized Ubuntu 10.10 Node Controller Virtualized Virtualized Node Controller pause / migrate <any OS>.. <any OS> There are Public Clouds (commercial, e.g. Amazon Web Services) and Private Clouds (non-commercial, e.g. Cloud installations at Steinbuch Centre for Computing (SCC), KIT). Most Public Cloud Provider offer a pay-as-you-go model used. Network bandwidth is still expensive for I/O intense HEP data analyses. Only pay for services actually Public Clouds are currently not an affordable solution A Private Cloud Interface can be set on top of already existing machines. 3

Why do we need to extend to the Cloud? Typical scenario during HEP analyses: During periods of high demand (before conferences, finishing publications, etc.) local clusters are often completely occupied. Buying a new Cluster is expensive and most of the time it will be underutilized. Solution: Cloud burst. Elasticity of Cloud Providers allows to dynamically add / remove Cloud resources and thus to handle peak loads. Constraints in Hard- and Software setup (e.g. Scientific Linux) prevent usage of many shared resources by HEP community. Through virtualization these resources can be made available. Installation and maintenance of HEP and Grid specific tools is a time consuming task Management can be simplified by using prepacked images (e.g. CernVM and CernVMFS) 4

Cloud Resources @ SCC OpenNebula Cloud: 2x Xeon E5520 (Nehalem) 2.27 GHz 60 nodes, each with 8 cores 500 cores 36 GB RAM 2 TB HD Eucalyptus Cloud 2x Xeon 5400 series 2,33 GHz 6 nodes, each with 8 cores 50 cores 16 GB RAM 100 GB HD 5

The ROCED Concept Responsive on-demand Cloud Enabled Deployment (ROCED) Scheduler for integrating cloud resources into local Batch Systems Written in Python 2.6 Features a modular design: Every part is implemented as an Adapter. New Adapters can be added easily. 6

ROCED Workflow (1) Batch Server cloud.q 1) monitor queue Cloud Provider 5) remove & shutdown Requirement networking and virtualization ROCED Integration 4) execute job Site 2) boot VM 3) add node execd Virtual Machine 1) Passive monitoring of one or more batch server queues. 2) Depending on queue size ROCED decides how many machines are needed. 3) The needed virtual machines are started by ROCED. 4) After startup, the new hosts are added, and become available for job processing. 5) After job execution, the VMs are automaticaly removed from the batch system and shut down. 7

ROCED Workflow (2) number of running cloud instances jobs in queue scaled by 0.02 number of registered batch nodes Stress test of ROCED (here with Amazon/Torque) 8

Setup ROCED runs on the same machine as the GridEngine. GridEngine communicates with its nodes and users via TCP. Commands to the OpenNebula host are sent via XMLRPC call. The Communication between the Cloud nodes, ROCED and the OpenNebula Server are done via SSH. No modifications to the firewall (besides VPN tunnel) needed. 9

Outlook ROCED is currently running in parallel to EKP production cluster. Due to a dedicated cloud queue, selected users can submit jobs to the cloud instances. Projects for the near future: Evaluate performance on MC production (CernVM + CernVMFS) on the cloud and possibility to use public clouds for this purpose as well. Evaluate scalability & performance using I/O intense HEP jobs on private clouds with fast network access to read and write larger amounts of data. Include multiple Cloud Sites on the same batch system. Optimize already existing cost algorithms for automatic deployment on cheapest Cloud Provider. Write documentation. ROCED has still prototype status release is under discussion. Thank you for your attention! 10