Beruflich Dokumente
Kultur Dokumente
Definition of a Cluster
A cluster is a collection of multiple independent systems working together under a management framework for increased service availability.
Parallel
The service group can be online on multiple cluster systems simultaneously. An example is Oracle Real Application Clusters (RAC).
You can specify combinations of these characteristics to determine how dependencies affect service group behavior, as shown in a series of examples in this lesson.
App1
DB1
App2
DB2
DB3
Test
Prod
Definition of a Resource
Resources are VCS objects that correspond to the hardware or software components of an application service.
Each resource must have a unique name throughout the cluster. Choosing names that reflect the service group name makes it easy to identify all the resources in that group, for example, WebIP in the WebSG group. Resources are always contained within service groups. Resource categories include: Persistent
On-off (Mount)
Resource Dependencies
Resources in a service group have a defined dependency relationship, which determines the online and offline order of the resource.
A parent resource depends on a child resource. There is no limit to the number of parent and child resources. Persistent resources, such as NIC, cannot be parent resources. Dependencies cannot be cyclical.
Parent
Parent/child
Child
Resource Attributes
Resource attributes define an individual resource. The attribute values are used by VCS to manage the resource. Resources can have required and optional attributes, as specified by the resource type definition.
WebMount resource
Solaris
Resource Types
Resources are classified by type. The resource type specifies the attributes needed to define a resource of that type. For example, a Mount resource has different properties than an IP resource.
mount [-F FSType] [options] block_device mount_point
Solaris
offline
monitor IP Mount Disk Group Volume clean
NIC
Cluster Communication
A cluster interconnect provides a communication channel between cluster nodes.
The cluster interconnect serves to:
Determine which systems are members of the cluster using a heartbeat mechanism. Maintain a single view of the status of the cluster configuration on all systems in the cluster membership.
LLT
LLT
GAB
GAB LLT
LLT
HAD
hashadow
Runs on each system in the cluster Maintains configuration and state information for all cluster resources Manages all agents
VCS Architecture
Agents monitor resources on each system and provide status to HAD on the local system. HAD on each system sends status information to GAB. GAB broadcasts configuration information to all cluster members. LLT transports all cluster communications to all cluster nodes. HAD on each node takes corrective action, such as failover, when necessary.
HAD
hashadow
User Processes
iPlanet
TCP IP
NIC
main.cf
main.cf
include "types.cf" cluster vcs ( UserNames = { admin = ElmElgLimHmmKumGlj } Administrators = { admin } CounterInterval = 5 A simple text file is used to ) store the cluster configuration system S1 ( on disk. ) The file contents are described system S2 ( in detail later in the course. ) group WebSG ( SystemList = { S1 = 0, S2 = 1 } ) Mount WebMount ( MountPoint = "/web" BlockDevice = "/dev/vx/dsk/WebDG/WebVol" FSType = vxfs FsckOpt = "-y" )
S1
S2
S3
No Membership: S3
S1
S2
S3
Jeopardy Membership: S3
1 S1 S3
S2
1 Jeopardy membership: S3 Mini-cluster with regular membership: S1, S2 Mini-cluster with regular 2 membership: S3 No Jeopardy membership 3 SGs autodisabled
1 2 S1
S2
S3 3
S2
S3
1 2
S1 and S2 think S3 is faulted. No jeopardy occurs, so no SGs are autodisabled. If all systems are in all SGs SystemList, VCS tries to bring them online on a failover target.
S1
S2
S3
1
2
S1
S2
S3
A faults on S3. C
S1
1
S2 S3
1 2
Network partition Regular membership: S1, S2 SGHB resource faults during online. Disk Regular membership: S3
2 1 S3 3 S2 1 2 3 S3 faults; C started on S1 or S2 Regular Membership: S1, S2 LLT links to S3 disconnected S3 reboots; S3 cannot start HAD because GAB on S3 can only see one member No membership: S3
S1
Active/Passive
Before Failover
After Failover
Active/Passive N-to-1
Before Failover
After Failover
Active/Passive N + 1
After Failover
Before Failover
After Repair
Active/Active
Before Failover
After Failover
N-to-N
Before Failover
After Failover
Example configuration:
hagrp modify groupname AutoStartPolicy Load
AutoStartPolicy=Order
The first available system in AutoStartList is selected.
Animation
AutoStartPolicy=Priority
The lowest numbered system in SystemList is selected.
Animation
AutoStartPolicy=Load
The system with the greatest AvailableCapacity is selected.
Animation
Failover Policies
The FailOverPolicy attribute specifies how a target system is selected: Priority: The system with the lowest priority number in the list is selected (default). RoundRobin: The system with the least number of active service groups is selected. Load: The system with greatest available capacity is selected. Example configuration:
hagrp modify groupname FailOverPolicy Load
FailOverPolicy=Priority
The lowest numbered system in SystemList is selected.
Animation
FailOverPolicy=RoundRobin
The system with the fewest running service groups is selected.
Animation
FailOverPolicy=Load
Animation
Create, model, Graphical user Graphical and test interface interface configurations Runs on UNIX Runs on Cannot be systems with and Windows used to supported systems Web browsers manage a running cluster configuration
Only authorized VCS user accounts have access to VCS administrative interfaces.
Cluster Operator
All cluster, service group, and resource-level operations
Cluster Guest
Read-only access; new users created as Cluster Guest accounts by default
Group Administrator
All service group operations for a specified service group, except deleting service groups
Group Operator
Bring service groups and resources online and take them offline; temporarily freeze or unfreeze service groups
Common Operations
Common service group operations :
Displaying status Bringing service groups online Taking service groups offline Switching service groups Freezing service groups
Freeze
Web Server