Beruflich Dokumente
Kultur Dokumente
TeraGrid Architecture
Charlie Catlett
Argonne National Laboratory
Executive Director, TeraGrid Project Chair, Global Grid Forum May 2002
TeraGrid Objectives
Create unprecedented capability
integrated with extant PACI capabilities supporting a new class of scientific research
TeraGrid Timelines
Proposal Submitted To NSF
Jan 01
McKinley systems
Initial apps On McKinley
TeraGrid Operational
Jan 02 Jan 03
Early McKinleys TeraGrid clusters at TG sites for Testing/benchmarking TeraGrid Lite Systems and Grids testbed Core Grid services deployment Advanced Grid services testing TeraGrid Networking Deployment TeraGrid Operations Center Prototype Day Ops Production
TeraGrid Prototypes
Early access To McKinley At Intel TeraGrid prototype At SC2001, 60 Itanium Nodes, 10Gbs network Basic Grid svcs Linux clusters SDSC SP NCSA O2K
Applications
Charlie Catlett (catlett@mcs.anl.gov)
64 hosts
64 hosts
64 hosts
64 hosts
64 hosts
Addl Clusters, External Networks 64 inter-switch links 64 inter-switch links 64 inter-switch links
= 4 links
64 TB RAID
Local Display
(d) Visualization
(e) Compute
Caltech
ANL
SDSC
NCSA
768 McKinley Processors (3 Teraflops, 192 nodes) 250 TB RAID storage Charlie Catlett (catlett@mcs.anl.gov)
ANL: Visualization
HPSS
UniTree
1176p IBM SP Blue Horizon Myrinet Myrinet Sun E10K Myrinet Myrinet
1500p Origin
SDSC: Data-Intensive
Charlie Catlett (catlett@mcs.anl.gov)
NCSA: Compute-Intensive
VAX Fuzzball
Across the room Across the country
256 s (4 min)
1024MB
4 MB/s
1 MB/s
.007 MB/s
13k s (3.6h)
1 TB
0.5 GB/s
78 MB/s
n x GbE (large n)
10 TB
Charlie Catlett (catlett@mcs.anl.gov)
5 GB/s
10 TB
Physical
# denotes count
Caltech/JPL
Los Angeles
Chicago
Starlight
15mi
115mi
140mi
25mi
I-WIRE
Caltech Cluster
SDSC Cluster
NCSA Cluster
ANL Cluster
I-WIRE Geography
ANL
UChicago
Gleacher Ctr
Starlight / NU-C
UChicago
Main
UIC
UI-Chicago
Status:
Done: ANL, NCSA, Starlight Laterals in process: UC, UIC, IIT
Investigating extensions to Northwestern Evanston, Fermilab, OHare, Northern Illinois Univ, DePaul, etc.
I-294
I-55
U of Chicago
UIUC/NCSA
Charlie Catlett (catlett@mcs.anl.gov) Urbana (approx 140 miles South)
Starlight (NU-Chicago)
10 12 12
4
McLeodUSA
151/155 N. Michigan Doral Plaza
UIC
4
Level(3)
111 N. Canal
UIUC/NCSA
Fiber Providers: Qwest, Level(3), McLeodUSA 10 segments, 190 route miles; 816 fiber miles, longest segment: 140 miles 4 strands minimum to each site IIT ~$4M for fiber- all contracts are 20 year IRU
Charlie Catlett (catlett@mcs.anl.gov)
State/City Complex
2 2
UChicago
I-Wire Transport
TeraGrid Linear 3x OC192 1x OC48 First light: 6/02 Starlight Linear 4x OC192 4x OC48 ( 8x GbE) Operational Metro Ring 1x OC48 per site First light: 8/02 UIC Argonne Starlight (NU-Chicago)
UC Gleacher Ctr
450 N. Cityfront
Qwest
455 N. Cityfront
UIUC/NCSA
State/City Complex
James R. Thompson Ctr City Hall State of IL Bldg
IIT
UChicago
Each of these three ONI DWDM systems have capacity of up to 66 channels, up to 10 Gb/s per channel Protection available in Metro Ring on a per-site basis
Charlie Catlett (catlett@mcs.anl.gov)
Job Control
Mgmt Node
Common Storage
Myrinet Fabric
Charlie Catlett (catlett@mcs.anl.gov)
User
User Proxy
Globus Credential
1176p IBM SP Blue Horizon Myrinet Myrinet Sun E10K Myrinet Myrinet
1500p Origin
NCSA: Compute-Intensive
Charlie Catlett (catlett@mcs.anl.gov)
Extending TeraGrid
Adoption of TeraGrid specifications, protocols, APIs
What protocols does it speak, what data formats are expected, what features can I expect (how does it behave) Service Level Agreements (SLA)
but sites also provide additional services that can be discovered and exploited.
IA-64 Linux TeraGrid Cluster Runtime File-based Data Service IA-64 Linux Cluster Interactive Development
Charlie Catlett (catlett@mcs.anl.gov)
Standards
Certificate Authority Certificate Authority Certificate Authority Certificate Authority TeraGrid Certificate Authority
Cyberinfrastructure
If done openly and well
other IA-64 cluster sites would adopt TeraGrid service specifications, increasing users leverage in writing to the specification others would adopt the framework for developing similar services on different architectures
Visualization Services
Data/Information File-based Data Service Collection-based Data Service Relational dBase Data Service
Status Query
For many services, publish query interface
Scheduler: queue status Compute, Visualization, etc. services: attribute details Network Weather Service Allocations/Accounting Database: for allocation status
Charlie Catlett (catlett@mcs.anl.gov)