Beruflich Dokumente
Kultur Dokumente
Outline
Introduction to Grid Computing Some Definitions Grid Architecture The Programming Problem The Globus Toolkit
Introduction, Security, Resource Management, Information Services, Data Management
Enable communities (virtual organizations) to share geographically distributed resources as they pursue common goals -- assuming the absence of
central location, central control, omniscience, existing trust relationships.
Introduction to Grid Computing 3
December 4, 2002
wide-area dissemination
real-time collection
archival storage
tomographic reconstruction DOE X-ray grand challenge: ANL, USC/ISI, NIST, U.Chicago
December 4, 2002 Introduction to Grid Computing 5
~100 MBytes/sec
Tier 0
Italy Regional Centre
Tier 1
Tier 2
~622 Mbits/sec Institute Institute Institute ~0.25TIPS Physics data cache ~1 MBytes/sec
Caltech ~1 TIPS
Tier2 Centre Centre Tier2 Tier2 Centre Tier2 Centre ~1 TIPS ~1 TIPS ~1 TIPS ~1 TIPS
Institute
Physicists work on analysis channels. Each institute will have ~10 physicists working on one or more channels; data for these channels should be cached by the institute server
Tier 4
Physicist workstations
December 4, 2002
December 4, 2002
Broader Context
Grid Computing has much in common with major industrial thrusts
Business-to-business, Peer-to-peer, Application Service Providers, Storage Service Providers, Distributed Computing, Internet Computing
Why Now?
Moores law improvements in computing produce highly functional endsystems The Internet and burgeoning wired and wireless provide universal connectivity Changing modes of working and problem solving emphasize teamwork, computation Network exponentials produce dramatic changes in geometry and geography
December 4, 2002
11
Network Exponentials
Network vs. computer performance
Computer speed doubles every 18 months Network speed doubles every 9 months Difference = order of magnitude per 5 years
1986 to 2000
Computers: x 500 Networks: x 340,000
2001 to 2010
Computers: x 60 Networks: x 4000
Moores Law vs. storage improvements vs. optical improvements. Graph from Scientific American (Jan2001) by Cleo Vilett, source Vined Khoslan, Kleiner, Caufield and Perkins.
December 4, 2002 Introduction to Grid Computing 12
Close collaboration with real Grid projects in science and industry Development and promotion of standard Grid protocols to enable interoperability and shared infrastructure Development and promotion of standard Grid software APIs and SDKs to enable portability and code sharing The Globus Toolkit: Open source, reference software base for building grid infrastructure and applications Global Grid Forum: Development of standard protocols and APIs for Grid computing
December 4, 2002 Introduction to Grid Computing 13
Focus
Create & deploy group collaboration systems using commodity technologies Grid testbed linking IBM laboratories
g IBM g www.cs.sandia.gov/
Create operational Grid providing access to resources at three U.S. DOE weapons discom DOE Defense Programs laboratories Create operational Grid providing access to resources & applications at U.S. DOE science laboratories & partner universities Delivery and analysis of large climate model datasets for the climate research community Create & apply an operational grid for applications in high energy physics, environmental science, bioinformatics
14
g eu-datagrid.org
European Union
December 4, 2002
EuroGrid, Grid g eurogrid.org Interoperability European Union New (GRIP) Fusion Collaboratory
g fusiongrid.org
New DOE Off. Science
Globus Project
g globus.org
DARPA, DOE, NSF, NASA, Msoft
GridLab GridPP
g gridlab.org
New European Union
g gridpp.ac.uk
New U.K. eScience
g grids-center.org Grid Research Integration Dev. & NSF Support Center New
December 4, 2002
URL/Sponsor
g hipersoft.rice.edu/
grads; NSF
Focus
Research into program development technologies for Grid applications Technology R&D for data analysis in physics expts: ATLAS, CMS, LIGO, SDSS Create and apply a production Grid for aerosciences and other NASA missions Create international Data Grid to enable large-scale experimentation on Grid technologies & applications Create and apply a production Grid for earthquake engineering Create and apply production Grids for data analysis in high energy and nuclear physics experiments
16
g griphyn.org
NSF
Information Power ipg.nasa.gov g NASA Grid International g ivdgl.org Virtual Data Grid NSF Laboratory New Network for g neesgrid.org Earthquake Eng. NSF Simulation Grid New Particle Physics Data Grid
g ppdg.net
DOE Science
December 4, 2002
URL/Sponsor
g teragrid.org
New NSF
Focus
U.S. science infrastructure linking four major resource sites at 40 Gb/s Support center for Grid projects within the U.K. Technologies for remote access to supercomputers
Also many technology R&D projects: e.g., Condor, NetSolve, Ninf, NWS See also www.gridforum.org
December 4, 2002
17
Caltech
External Networks Site Resources HPSS
Argonne
External Networks
NCSA/PACI 8 TF 240 TB
www.teragrid.org
18
Tier0/1 facility Tier2 facility Tier3 facility 10 Gbps link 2.5 Gbps link 622 Mbps link Other link
www.ivdgl.org
19
Grid Forum
www.gridforum.org
December 4, 2002
20
Some Definitions
December 4, 2002
22
Resource
An entity that is to be shared
E.g., computers, storage, data, software
December 4, 2002
23
Network Protocol
A formal description of message formats and a set of rules for message exchange
Rules may define sequence of message exchanges Protocol may define state-change in endpoint, e.g., file system state change
Examples of protocols
IP, TCP, TLS (was SSL), HTTP, Kerberos
December 4, 2002 Introduction to Grid Computing 24
Examples
GSS API (security), MPI (message passing)
December 4, 2002 Introduction to Grid Computing 26
December 4, 2002
27
Syntax
Rules for encoding information, e.g.
XML, Condor ClassAds, Globus RSL X.509 certificate format (RFC 2459) Cryptographic Message Syntax (RFC 2630)
December 4, 2002
28
Application MPI API MPICH-P4 SDK Different message formats, exchange sequences, etc. MPICH-P4 protocol TCP/IP
30
Grid Architecture
Guidance
Identify key areas in which services are required
Prescriptive
Define standard Intergrid protocols and APIs to facilitate creation of interoperable Grid systems and portable applications
December 4, 2002 Introduction to Grid Computing 33
Adaptation Intrusion detection Resource management Accounting & payment Fault management System evolution Etc. Etc.
institutions
Requires systems
> protocols, services
December 4, 2002 Introduction to Grid Computing 36
December 4, 2002
37
Application
40
Connectivity APIs
Connectivity Protocols Local Access APIs and Protocols
Fabric Layer
December 4, 2002 Introduction to Grid Computing 41
Important Points
Built on Internet protocols & services
Communication, routing, name resolution, etc.
Layering here is conceptual, does not imply constraints on who can call what
Protocols/services/APIs/SDKs will, ideally, be largely self-contained Some things are fundamental: e.g., communication and security But, advantageous for higher-level functions to use common lower-level functions
December 4, 2002 Introduction to Grid Computing 42
Design principles
Keep participation cost low Enable local control Support for adaptation IP hourglass model
Core services
Local OS
43
December 4, 2002
Few constraints on low-level technology: connectivity and resource level protocols form the neck in the hourglass Defined by interfaces not physical characteristics
December 4, 2002
45
GSI: www.gridforum.org/security/gsi
46
Others emerging: Catalog access, code repository access, accounting, etc. All built on connectivity layer: GSI & IP
December 4, 2002 Introduction to Grid Computing
Replica catalogs Replication services Co-reservation and co-allocation services Workflow management services Etc.
December 4, 2002 Introduction to Grid Computing
Condor: www.cs.wisc.edu/condor
48
Resource Access to data, access to computers, access to network performance data Connect Communication, service discovery (DNS), authentication, authorization, delegation Fabric Storage systems, schedulers
December 4, 2002 Introduction to Grid Computing
49
Collective Coherency control, replica selection, task management, (App) virtual data catalog, virtual data code catalog, Collective Replica catalog, replica management, co-allocation, (Generic) certificate authorities, metadata catalogs, Resource Access to data, access to computers, access to network performance data,
Communication, service discovery (DNS), Connect authentication, authorization, delegation Fabric Storage systems, clusters, networks, network caches,
December 4, 2002 Introduction to Grid Computing 50
Most applications have been written from scratch, with or without Grid services Application-specific libraries have been shown to provide significant benefits No new language, programming model, etc., has yet emerged that transforms things
But certainly still quite possible
December 4, 2002 Introduction to Grid Computing 53
Requires services for authentication, resource allocation, executable staging, output, etc. Programs run in wide area without change
Modulo accommodating heterogeneous communication performance
www.globus.org/mpi
55
Cactus
www.cactuscode.org
56
Condor converts collections of distributively owned workstations and dedicated clusters into a distributed highthroughput computing facility Emphasis on policy management and reliability www.cs.wisc.org/condor
Introduction to Grid Computing 57
December 4, 2002
Object-Based Approaches
Grid-enabled CORBA
NASA Lewis, Rutgers, ANL, others CORBA wrappers for Grid protocols Some initial successes
Legion
U.Virginia Object models for Grid components (e.g., vault=storage, host=computer)
December 4, 2002
58
Portals
N-tier architectures enabling thin clients, with middle tiers using Grid functions
Thin clients = web browsers Middle tier = e.g. Java Server Pages, with Java CoG Kit, GPDK, GridPort utilities Bottom tier = various Grid resources
Globus Toolkit
A software toolkit addressing key technical problems in the development of Grid enabled tools, services, and applications
Offer a modular bag of technologies Enable incremental development of gridenabled tools and applications Implement standard Grid protocols and APIs Make available under liberal open source license
December 4, 2002
62
General Approach
Define Grid protocols & APIs
Protocol-mediated access to remote resources Integrate and extend existing standards On the Grid = speak Intergrid protocols
Key Protocols
The Globus Toolkit centers around four key protocols
Connectivity layer:
> Security: Grid Security Infrastructure (GSI)
Resource layer:
> Resource Management: Grid Resource Allocation Management (GRAM) > Information Services: Grid Resource Information Protocol (GRIP) > Data Transfer: Grid File Transfer Protocol (GridFTP)
December 4, 2002
65
GSI in Action
User
Single sign-on via grid-id & generation of proxy cred. Or: retrieval of proxy cred. from online repository
User Proxy
Proxy credential
Remote process creation requests* GSI-enabled Authorize Site A GRAM server Map to local id (Kerberos) Create process Generate credentials Computer Process
Kerberos ticket Local id Restricted proxy
Ditto
Site B (Unix)
Computer Process
Local id Restricted proxy
December 4, 2002
Resource Management
The Grid Resource Allocation Management (GRAM) protocol and client API allows programs to be started and managed on remote resources, despite local heterogeneity Resource Specification Language (RSL) is used to communicate requirements A layered architecture allows applicationspecific resource brokers and co-allocators to be defined in terms of GRAM services
Integrated with Condor, PBS, MPICH-G2,
December 4, 2002 Introduction to Grid Computing 67
RSL specialization
Queries & Info Information Service
Application
Ground RSL Co-allocator
Simple ground RSL Local resource managers GRAM LSF GRAM Condor
Introduction to Grid Computing
GRAM NQE
68
December 4, 2002
Reference implementations
Existing clients and servers: wuftpd, ncftp Flexible, extensible libraries in Globus Toolkit
December 4, 2002 Introduction to Grid Computing 69
Large numbers of distributed sensors with different properties Need for different views of this information, depending on community membership, security constraints, intended purpose, sensor type
December 4, 2002 Introduction to Grid Computing 70
Summary
The Grid problem: Resource sharing & coordinated problem solving in dynamic, multiinstitutional virtual organizations Grid architecture emphasizes systems problem
Protocols & services, to facilitate interoperability and shared infrastructure services
Globus Toolkit: APIs, SDKs, and tools which implement Grid protocols & services
Provides basic software infrastructure for suite of tools addressing the programming problem
December 4, 2002 Introduction to Grid Computing 72