Sie sind auf Seite 1von 32

Use cases and a practical example

for Grid and Cloud integration


Elisabetta Ronchieri, INFN CNAF
elisabetta.ronchieri@cnaf.infn.it
EPIKH School, Beijing, China
May 18, 2011
Outline
• Clouds and Grids
• An example of Grid and Cloud Integration
• Summary

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


The compulsory slide on
definitions: Grid vs. Cloud
GRID: The essence of the [definition] can be captured in a simple checklist,
according to which a Grid is a system that:
– coordinates resources that are not subject to centralized control...
– ... using standard, open, general-purpose protocols and interfaces...
– ... to deliver nontrivial qualities of service.
(I. Foster, What is the Grid? A three Point Checklist, 2002)

CLOUD: Cloud Computing is a model for enabling ubiquitous, convenient,


on-demand network access to a shared pool of configurable computing
resources (e.g., networks, servers, storage, applications, and services) that
can be rapidly provisioned and released with minimal management effort or
service provider interaction (NIST Working Definition of Cloud Computing).

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


In practice
Distributed computing infrastructures (whatever they
incarnation is) should
• provide solutions for resource discovery,
usage, policing
• honor contracted service level agreements
• ensure proper security enforcement
measures (authentication, authorization) are
taken.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


More pragmatically
Grids are about federating resources.
Clouds are about provisioning resources.

• Grids never found the right business model in the industry, so


they failed to get industry uptake
• What will happen when we want to federate clouds?
– Does anybody want to? How about authentication, authorization,
accounting, and brokering?
– Different domains, like industrial, e-Science, e-Government will have
different requirements and perceived risks for cloud computing.
‘Cloud is a semaless extension of the Grid’, Dan Read, OGF30.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


The Grid, from a User’s Perspective
How use Architecture characteristics
• Be part of a Virtual Organization. If • Emphasis on sharing resources at a
you can’t find one, you must set it up virtual organizational level
• Access the Grid via a User Interface, • Mainly adopted by scientific
authenticating via x.509 digital communities, with limited industry
certificate uptake
• Specify your job requirements via a • Typically batch-focused, with limited
Job Description Language (JDL) provision for interactive, dynamic
• Your job requirements will be usage.
matched against available resources.
If suitable resources are found, your
job will sooner or later run
somewhere.
• You will be able to check job status,
collect output, store, find and
retrieve data.
18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School
The Cloud, from a User’s
Perspective
How use Architecture characteristics
• Identify a Service Provider • Emphasis on ease of access to
• Allocate your seemingly infinite resources for individual users
desired resources, typically through • Initiated within the cmmercial sector,
Web Applications with wide success
• Gain access to your resources (which • Several level of abstraction are
can be services, software possible: infrastructure as a service,
organizations, hardware cores) platform as a service, software as a
through pay-as-you-go models service, and so on.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


Which use cases are really there
for the Cloud?
See the Whitepaper by the Cloud
Computing Use Case Discussion
Group
(http://www.opencloudmanifesto.or
g/Cloud_Computing_Use_Cases_Whi
tepaper-2_0.pdf)

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


Mantras: Service, Cost Savings, Consolidation,
Infinity, Utility. All On-demands
January 27, 2010- The UK government has unveiled a sweeping strategy to
create its own internal cloud computing system, such as that used by Google,
Microsoft and Amazon, as part of a radical plan that it claims could save up to
£23bn a year from an annual bill of at least £16bn.J
– The key part of the new strategy [...] will be the concentration of governement
computing power into a series of about a dozen highly secure data centres, each costing
up to $250m to build, which will replace more than 500 presently used by central
government, police forces and local authorities.
– By 2015, the strategy suggests, 80% of central governement desktops could be supplied
through a shared utility service.
– The new cloud system will not include the security services such as M15 or M16, which
have their own, separate systems.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


Can this be really used in the
scientific world?
It appears that at lest in some cases the answer can be yes. Take
for example the Gaia project* of the European Space Agency,
whose goal is to survey about a billion stars to make an
extremely precise three-dimensional map of our galaxy:
– For the full 1 billion star project numbers (the Gaia Science Operations
Development team) calculated that they will analyze 100 million
primary stars, plus 6 years of data, which will require a total of 16,200
hours of a 20-node EC2 cluster. That’s an estimated total computing
cost of 344,000 EUR. By comparison, an in-house solution would cost
roughly 720,000 EUR (at today’s prices) – which does not include
electricity or storage or sys-admin costs. Storage alone would be an
additional 100,000 EUR.

*(http://www.rssd.esa.int/index.php?project=GAIA&page=index)
18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School
Problems solved with Clouds? Not
really
Based on the successful multi-year experience built on Grids, some heavy
users note that for what regards Clouds:
• Inter-operation of multiple cloud providers is not a reality yet, and vendor lock-in
is a big issue;
• Political, legal, or security-related considerations discourage the idea of
outsourcing control to external entities;
• These concerns are particularly acute in the case of the interconnection of
different components: computing, storage and network resources;
• For example, given the level of optimization that was neded for the interaction
between storage and computing resources in High-Energy-Physics experiments, it
is debatable whether the same performance can be achieved by general purpose
infrastructures, like commercial clouds.
– Customers will pay either in terms of latencies, or in terms of extra costs.
Can we adapt and reuse our existing Grid related know how and
infrastructures?
18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School
New requirements to existing Grid
Infrastructures
While Grid Interfaces are widely used among large scientific communities,
Cloud computing offers significant advantages for many reasons such as pay-
as-you-go models, and simplified access.
• Ideally, though, one would like to adopt Cloud services so that:
– Resources are shared between acess interfaces (Grid, Cloud or else)
– Scalability is ensured
– Existing services and agreements are not required to change substantially
– Resource centers policies are honored and know-how is preserved
– New services can attract both existing and new customers.

These are both key challenges and opportunities for existing Grid
infrastructures.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


Examples of services requested
today
Some of the typical new service requests we receive:
• Customer-definable softwre environments. This is a feature that finds several uses
in traditional Grids as well;
• Setting up dynamic pools of virtual servers such as user interfaces, or worker
nodes for parallel interactive analysis. More generally, flexibility allocating
hardware resources through complex advance-reservation requests.
• Instantiating pre-packaged, ready-to-go services.
• Truly distributed, on-demand, Cloud storage
• Not everybody, speaks Grid: providing access to distributed, traditional Grid
infrastructures as if they were not Grids,. This might be offered to non-traditional
users, like public administrations, or to the private sector.

The key problem is one of integration between several access interfaces,


that is resources and multiple access interfaces (Grid, Cloud or other).

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


Grids and Clouds: common
grounds
Grids and Clouds (abstracting from the concept of a Grid job, which one
should regard as an implementation detail) basically target the use of
resources.
• The two terms come from differnt grounds, but really they are just
different interfaces to access resources:
– Users may actually benefit joining an existing infrastructure, rather than
building a new one.
– Sharing of data and resources across Grid and Cloud interfaces should be
encouraged.
– Leveraging on multi-year investments and know-how on Grids to
incrementally evolve and build new services is a strategic decision.
– Grids like EGEE/EGI are production infrastructures, serving the needs of many
(big and small) research communities.

The question is then how in practive can you integrate Grids and Cloud.
18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School
Interactions between computing, storage,
network resources: a simple example
A minimalistic workflow for analysis performed in the context of
a physics experiment is such that in general a user will:
• Decide which analysis to perform for his new research
• Develop the code which performs it; this is typically a high-level macro or
a plugin of some experiment-based software framework
• Ask a system about the data requirements
– Which files contain the needed information. This info is often in an
experiment-based metadata repository or file catalogue.
• Ask another system to process his analysis. This could happen via a Grid, a
Cloud, local (virtual) batch farms, or possibly even one’s own computer.
• Collect the results.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


A possible approach to the
workflow
One could:
• Carefully choose where to send a processing job, for example to the place
which best matches the needed data set;
• Use tools to create local replicas of the needed data files. High Energy
Physics data files are typically big and fairly static, so it is better to exploit
locality if possible.
– In the right places
– Eventually use tools also to push new produced data files to the
official repositories. Think here of complementarity between Grid and
Cloud tasks
– If overdone, this can be quite time and resource consuming
• Any variation is possible. For example pre-populate everything before
sending jobs.
– What is best is left to you to decide
18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School
Outline
• Clouds and Grids
• An example of Grid and Cloud Integration
• Summary

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


How to cloudify the [Scientific]
World, then?
• An evolutionary model is needed possibly
offering ways
– to interconnect multiple Distributed Computing
infrastructures
– to continue to support existing scientific computing
patterns
• A mix of public and private clouds is a
concrete possibility

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


A practical Grid and Cloud
integration example: WNoDeS
The Worker Nodes on Demand Service (WNoDeS) is a framework developed
by INFN.
• It is built around a tight integration with a LRMS (a batch system).
• It is running in production at the INFN Tier1 Computing Centre since
November 2009.
– It has been also deployed at INFN Legnaro.
– Tests are ongoing on other INFN sites.
• It is focused on making resource polymorphism easy and on providing
flexibility to both users and resource providers in a production
environment.
• It provides transparent user interfaces for Grids and Clouds.
• It reuses several existing and proven components (like Grid authentication
and authorization), current workflows, data center schedulers.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


Key WNoDeS Characteristics
The main architectural characteristics are:
• fully integration with existing computing resource scheduling, policing,
monitoring and accounting workflows
• on-demands virtual resource provisioning
• VLAN support to dynamically isolate Virtual Machines depending on the
service type and customer requests
– by using Linux KVM as VM manager
• Support for users to select and access WNoDeS-based resources through
Grid, Cloud interfaces, or also through direct job submissions
– by using either command line or a Web portal
• Scalability having to handle thousands of VMs
• No concept of ‘Cloud over Grid’ or ‘Grid over Cloud’
– Just use all resources and present them to the user as the user wants to see them

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


WNoDeS: overall architecture
framework
Key points:
1. On-demand virtual service
provisioning;
2. Flexible, integrated
scheduling policies;
3. Multiple access interfaces;
4. Multiple authentication
methods;
5. Integrated access to
existing infrasturctures;
6. Access to external
resources.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


WNoDeS: VM layer
Policing, resource allocation and
scheduling are directly demanded
to an underlying batch system
called LRMS.
• All physical nodes have the following
model:
• An hypervisor able to instantiate Virtual
Machines;
• A special Virtual Machine called bait,
part of LRMS, responsible to publish
available local resources to the LRMS
and to attract jobs on the local physical
system;
• Zero or more running Virtual Machines
to execute jobs, instantiated on-demand
by the local hyerpvisor upon appropriate
requests made by the local bait.
• Virtual Machines begin to exist in
the configuration requested by the
user, only when needed.
• When the Virtual Machine is not
needed any more, it may be
destroyed.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


WNoDeS: Grid and Local Access
WNoDeS-based resources may be
accessed transparently by users
through the local batch system and
the Grid.
By the Grid:
• Jobs of some VOs are automatically
directed on VMs without user
intervention.
• Grid users may explicitly specify the
VM they want to use, adding a CE
requirement statement in the JDL for
their jobs.
• Users must be authorized by resource
providers to locally use the selected
VM images through proper WNoDeS
configuration.
• This works today with the current
CE-CREAM software.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


WNoDeS: Cloud Access 1/2
WNoDeS allows generic
instantiation of compute cores.
• They are allocated to the user
requesting them, and released only
when the user explicitly says so
• User chooses resource size, operating
system, and gets root access to them
• It can be used to develop, instal, test
new software, run services, create
small ad-hoc farms, and so on.
Cloud resources are taken from the
same resource pool used e.g. for
Grid resources to optimize the
usage of resources.
WNoDeS does not need to dedicate
resources to batch, Grid, Cloud, but
one may do so if this is desired.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


WNoDeS: Cloud Access 2/2
WNoDeS supports the Open Cloud
Computing Interface (OCCI)
standard, at the moment rarely used
by users.
Cloud instantiations typically
happen through a Web-based portal
in which VOMS/gLite Argus for
authentication and authorization are
integrated allowing flexible policies
to determine who you are and what
you are allowed to do.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


WNoDeS: Virtual Interactive Pools
Access
WNoDeS integrates the Virtual
Interactive Pools interface.
• It is a way for a local user to
dynamically and automatically
instantiate a compute node for
interactive access.
• User can specify RAM, number of
CPUs, and so on.
• It can be used for software
development, to submit jobs, and so
on.
• It has been tested by the local CMS
group in a Tier-3 environment
hosted at CNAF and intergrated int
he bigger Tier-1

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


WNoDeS: Authentication Gateway
WNoDeS integrates different
mechanisms of user authentication
through the authentication gateway
layer.
• It lets Cloud users accessing Grid
resources allowing Cloud users to
exploit previous investments with
minimal changes to the sites.
• This solution provides the user an
X.509 short lived certificate and
registers the user into a
dedicated VO that needs to be
accepted by the site.
• It lets Grid users accessing Cloud
resources. using their Grid
credentials.
• This solution supports the
integration with VOMS/gLite
Argus to validate user’s
credentials and access policies.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


Outline
• Clouds and Grids
• An example of Grid and Cloud Integration
• Summary

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


Where are we?
The WNoDeS framework is running at the INFN Tier-1
• It is an example of a framework targeted at integrating Grid and Cloud
resources
– running in production mode at a sizable computing center.
• It encourages integration with regard to data production and
consumption using both Grid and Cloud interfaces.
– Data produced via the Cloud can be put in Grid-accessible storage
repositories, and viceversa.
• It is being developed by following an integrated approach:
– reusing some of the Grid developments of the past decade to achieve
interconnection of multiple Clouds;
– being the interconnection of Grids a reality today.
• WNoDeS 2 is targeted for public release in September 2011
– supporting among other things PBS/Torque and Platform LSF as batch systems.
18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School
What do we need?
We need to ride the Cloud wave in an organized way to keep on offering our
public (users and country) domains valuable distributed computing
services.
• At the policy level
– In relation to e-Infrastructures for e-Science or e-Governement; national and logical
digital agendas; privacy, legal and ethical issues; and sustainability (are there really
infinite resources?).
• At the technical level
– Investigating and investing into new possibilities; evaluating feasibility of current
solutions; working on development and implementation of Cloud frameworks.
• By following an integrated approach
– Reusing our existing know-how and multi-years experience.

‘Cloud is a seamless extension of the Grid’, Dan Reed, OGF 30.

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


That’s it – It’s shoppable although
perhaps not yet shippable
• [Cloud computing is] nothing more than a faddish term for the
established concept of computers linked by network. A cloud is water
vapor (Larry Ellison, co-founder and CEO, Oracle Corporation, September
2009).
• Q: What is Oracle’s Cloud Computing strategy? A: Oracle has two cloud
computing objectives. The first is to ensure that [it] is fully enterprise-
grade to enable enterprise adoption. [...] The second [is] to support both
public and private cloud computing to give customers choice (Oracle
Cloud Computing FAQ, October 2010).
• The truth is rarely pure an never simple (Oscar Wilde, The importance of
Being Earnest, 1895)

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School


Acknowledgements

Davide Salomoni, davide.salomoni@cnaf.infn.it

Further Info:
WNoDeS Web: http://web.infn.it/wnodes

18/05/2011 Elisabetta Ronchieri, EPIKH School, Beijing School

Das könnte Ihnen auch gefallen