Beruflich Dokumente
Kultur Dokumente
Outline
Kenapa perlu komputasi parallel: Masalah membutuhkan sumber daya komputasi tinggi Pengenalan komputasi parallel Cluster Computing Research in Cluster Computing Rocks for Cluster Computing
2
CAD/CAM
Digital Biology
Military Applications
Parallel I/O
Parallel I/O
10
11
12
13
Windows of Opportunities
Parallel Processing
Network RAM
Software RAID
Multipath Communication
Redundant array of inexpensive disks Use the arrays of workstation disks to provide cheap, highly available, & scalable file storage Possible to provide parallel I/O support to applications Use arrays of workstation disks to provide cheap, highly available, and scalable file storage
Use multiple networks for parallel data transfer between nodes
14
Computer Analogy
Using faster hardware Optimized algorithms and techniques used to solve computational tasks Multiple computers to solve a particular 15 task
Era of Computing
Rapid technical advances
the recent advances in VLSI technology software technology
OS, PL, development methodologies, & tools
Parallel computing
one of the best ways to overcome the speed bottleneck of a single processor good price/performance ratio of a small cluster-based parallel computer
16
In Summary
Need more computing power
Improve the operating speed of processors & other components
constrained by the speed of light, thermodynamic laws, & the high financial costs for processor fabrication
Technology Trends...
Performance of PC/Workstations components has almost reached performance of those used in supercomputers
The rate of performance improvements of commodity systems is much rapid compared to specialized systems.
18
Microprocessors (50% to 100% per year) Networks (Gigabit SANs); Operating Systems (Linux,...); Programming environment (MPI,); Applications (.edu, .com, .org, .net, .shop, .bank);
Technology Trends
19
Trend
[Traditional Usage] Workstations with Unix for science & industry vs PCbased machines for administrative work & work processing [Trend] A rapid convergence in processor performance and kernellevel functionality of Unix workstations and PC-based machines
20
Meiko Myrias Thinking Machines Saxpy Scientific Computer Systems (SCS) Soviet Supercomputer s 22 Suprenum
23
Towards Clusters
24
the rapid improvement in the availability of commodity high performance components for workstations and networks
Low-cost commodity supercomputing
from specialized traditional supercomputing platforms to cheaper, general purpose systems consisting of loosely coupled components built up from single or multiprocessor PCs or workstations
25
PDA Clusters
1960
1980s
1990
1995+
2000+ 26
27
What is Cluster ?
A cluster is a type of parallel or distributed processing system, which consists of a collection of interconnected stand-alone computers cooperatively working together as a single, integrated computing resource. A node a single or multiprocessor system with memory, I/O facilities, & OS generally 2 or more computers (nodes) connected together in a single cabinet, or physically separated & connected via a LAN appear as a single system to users and applications provide a cost-effective way to gain features and benefits
28
Cluster Architecture
Sequential Applications Sequential Applications Sequential Applications
Parallel Applications Parallel Applications Parallel Applications Parallel Programming Environment Cluster Middleware (Single System Image and Availability Infrastructure) PC/Workstation
Communications Software Network Interface Hardware
PC/Workstation
Communications Software Network Interface Hardware
PC/Workstation
Communications Software Network Interface Hardware
PC/Workstation
Communications Software Network Interface Hardware
Cluster Components
30
System CPUs
Processors
Intel x86 Processors Pentium Pro and Pentium Xeon AMD x86, Cyrix x86, etc. Digital Alpha Alpha 21364 processor integrates processing, memory controller, network interface into a single chip IBM PowerPC Sun SPARC SGI MIPS HP PA
32
System Disk
Disk and I/O
Overall improvement in disk access time has been less than 10% per year Amdahls law
Speed-up obtained by from faster processors is limited by the slowest system component
Parallel I/O
Carry out I/O operations in parallel, supported by parallel file system based on hardware or software RAID 33
The new concept in OS services support multiple threads of control in a process itself
parallelism within a process
multithreading POSIX thread interface is a standard programming environment
Trend Modularity MS Windows, IBM OS/2 34 Microkernel provide only essential OS services
Support distributed computing & is able to store & retrieve distributed information OpenWindows allows application to be run on remote systems
37
38
Giganet
ServerNet2
SCI ~80 6
~$1.5K
Now
Gigabit Ethernet
$1.5K
Now
Now 1000s
Now 1000s
Now 1000s
Q200 1000s
Now 64K
Now 1000s
Firmware on adapter
Software TCP/IP, VIA
3rd party
Quadrics/ Compaq
3rd Party
Compaq/3rd party
3rd Party
MPICH TCP/IP
41
42
1.28 Gbps full duplex interconnection network Use low latency cut-through routing switches, which is able to offer fault tolerance by automatic mapping of the network configuration Support both Linux & NT Advantages
Very low latency (5s, one-way point-to-point) Very high throughput Programmable on-board processor for greater flexibility Expensive: $1500 per host Complicated scaling: switches with more than 16 ports are unavailable
43
Disadvantages
Network service are designed as hierarchical stack of protocols with relatively low-level communication API, provide means to implement wide range of communication methodologies
RPC DSM Stream-based and message passing interface (e.g., MPI, PVM)
45
MPI (Message Passing Interface) PVM (Parallel Virtual Machine) Parametric Programming Software DSMs (Shmem) Compilers
Linux, NT, on many Supercomputers
RAD (rapid application development tools) Debuggers Performance Analysis Tools Visualization Tools
GUI based tools for PP modeling
46
48
Application Target
High Performance (HP) Clusters
Grand Challenging Applications
Node Ownership
Dedicated Clusters Non-dedicated clusters
Adaptive parallel computing Communal multiprocessing
50
Node Hardware
Clusters of PCs (CoPs)
Piles of PCs (PoPs)
52
Node Configuration
Homogeneous Clusters
All nodes will have similar architectures and run the same OSs
Heterogeneous Clusters
All nodes will have different architectures and run different OSs
53
Departmental Clusters (#nodes: 10s to 100s) Organizational Clusters (#nodes: many 100s) National Metacomputers (WAN/Internet-based) International Metacomputers (Internet-based, #nodes: 1000s to many millions)
Grid Computing Web-based Computing Peer-to-Peer Computing
54
Cluster Programming
55
Levels of Parallelism
PVM/MPI
Task i-l Task i Task i+1
Code-Granularity Code Item Large grain (task level) Program
Medium grain (control level) Function (thread) Fine grain (data level) Loop (Compiler)
Threads
Compilers CPU
a ( 0 ) =.. b ( 0 ) =..
a ( 1 )=.. b ( 1 )=..
a ( 2 )=.. b ( 2 )=..
Load
Parametric Computations
Nimrod-G and Gridbus Data Grid Broker
Automatic Parallelising Compilers Parallel Libraries & Computational Kernels (e.g., NetSolve)
57
Hardware DSM
61
URL
http://science.nas.nasa.gov/Software/AIMS
http://www.mcs.anl.gov/mpi/mpich http://www-pablo.cs.uiuc.edu/Projects/Pablo/ http://www.cs.wisc.edu/paradyn http://www-pablo.cs.uiuc.edu/Projects/Pablo/ http://www.pallas.de/pages/vampir.htm http://www.pallas.com/pages/dimemas.htm
AIMS
MPE Pablo Paradyn
SvPablo
Vampir Dimenma s Paraver
http://www.cepba.upc.es/paraver
64
PARMON
Called K-CAP Consist of compute nodes, a management node, & a client that can control and monitor the cluster K-CAP uses a Java applet to connect to the management node through a predefined URL address in the cluster
A comprehensive environment for monitoring large clusters Use client-server techniques to provide transparent access to all nodes to be monitored parmon-server & parmon-client
65
Cluster Applications
66
Cluster Applications
Numerous Scientific & engineering Apps. Business Applications: E-commerce Applications (Amazon, eBay); Database Applications (Oracle on clusters). Internet Applications: ASPs (Application Service Providers); Computing Portals; E-commerce and E-business. Mission Critical Applications: command control systems, banks, nuclear reactor control, star-wars, and handling life threatening situations.
67
Platform
PCs
Communications
OS
Other
MPI/PVM. Sockets and HPF
Solaris + AM, PVM, MPI, GLUnix + xFS HPF, Split-C NT or Linux connection and global resource manager + LSF Solaris + Globalization layer Java-fronted, FM, Sockets, Global Arrays, SHEMEM and MPI C++ and CORBA
68
Solaris MC
Solaris-supported
In 2003 List of Top 500 Supercomputers, over 50% of them are based on clusters and many of them are deployed in industries.
72
References
High Performance Cluster Computing: Architectures and Systems, Book Editor: Rajkumar Buyya, Slides: Hai Jin and Raj Buyya http://www.buyya.com/cluster Bahan kuliah Topik Dalam Komputasi Paralel, Fasilkom UI
73