Beruflich Dokumente
Kultur Dokumente
Tivoli Storage Manager for Virtual Environments V6.3 Step by Step Guide To vStorage Backup Server (Proxy) Sizing 22 August 2012 1.0
Page 1 of 18
Revision History
Revision Number 1.0 Revision Date 08/22/12 Summary of Changes Final version Changes marked
Disclaimer
The information contained in this document is distributed on an "as is" basis without any warranty either expressed or implied. This document has been made available as part of IBM developerWorks WIKI, and is hereby governed by the terms of use of the WIKI as defined at the following location: http://www.ibm.com/developerworks/tivoli/community/disclaimer.html Throughput numbers contained in this document are intended to be used for estimation of proxy host sizing. Actual results are environment and configuration dependent and may vary significantly. Users of this document should verify the applicable data for their specific environment.
Page 2 of 18
Contents
Contents .......................................................................................................................... 3 1.
1.1
Introduction ............................................................................................................. 4
Overview ................................................................................................................................. 5 Performance ............................................................................................................. 5 Periodic Full Backup ................................................................................................. 6 1.1.1 1.1.2
1.2 1.3
A Few Definitions .................................................................................................................... 7 Scope of this document .......................................................................................................... 7 External Dependencies and Assumptions ................................................................ 7 Performance optimization and bottleneck analysis .................................................. 7 Proxy Hardware Configuration .................................................................................. 7
2.
2.1 2.2 2.3
Architectural Considerations ................................................................................................. 12 Additional capacity requirements ............................................................................ 12 Physical or virtual proxy? ........................................................................................ 12 Rotating Fulls backups by ESX Host ................................................................... 14 Alternate Scheduling Methods ................................................................................ 14
2.5.1 2.5.2
3. 4.
4.1
Page 3 of 18
1. Introduction
Tivoli Storage Manager for Virtual Environments (TSM-VE) is a feature of the Tivoli Storage Manager product family for backing up virtual machines in a VSphere (VMWare) environment. Tivoli Storage Manager for Virtual Environments uses the latest backup technology provided by VMWare, called VStorage API (also known as VADP or VStorage API for Data Protection). An essential component of Tivoli Storage Manager for Virtual Environments is the VStorage Backup Server which performs the data transfer from the data stores that contain the virtual machine data to the Tivoli Storage Manager server. The VStorage Backup Server offloads the backup workload from the ESX server and acts as a proxy for a backup. Throughout this document, the VStorage Backup Server will be referred to as the "proxy ". A proxy that is configured on a virtual machine is referred to as a virtual proxy, and if configured on a physical machine is referred to as a physical proxy. When you consider a backup solution using Tivoli Storage Manager for Virtual Environments, one of the frequently asked questions is how to estimate the number of proxies required for a specific environment. This paper guides through the estimation process. The following diagram provides a simplified, high level overview of the components involved with TSM-VE image backup and restore:
Page 4 of 18
1.1
Overview
The proxy estimation is intended to help you plan a deployment of Tivoli Storage Manager for Virtual Environments. A recommended approach is described. However, there are many variations depending upon customer preferences, infrastructure capabilities, and other factors. Different vendors use various approaches to determine the number of proxies required, and may be constrained by product design. Tivoli Storage Manager provides flexibility for deploying the proxies and selecting virtual, physical, or a combination of both proxies. The intent is to provide a starting point for initial estimation and solution architecture. The proxy estimation process comprises the following steps: Define how the backups are scheduled. Estimate the number of proxies required. Decide whether to use virtual machine proxies, physical proxies, or a combination of both.
1.1.1 Performance
Estimating the number of proxies requires some assumptions about the performance characteristics of individual backup processes. Tivoli Storage Manager for Virtual Environments uses efficient disk block-level I/O for the backup process, and the backup process itself adds minimal overhead. Backup performance is determined the following system characteristics:
I/O capabilities of the data store storage arrays Back-end storage device used by the Tivoli Storage Manager server, for example, Virtual Tape Library (VTL) or disk Infrastructure connectivity, for example, Storage Area Network (SAN) or Local Area Network (LAN) bandwidth
It is recommended that you use benchmarking to refine the estimate of backup throughput specific to your environment. The throughput capabilities can range significantly depending upon the environment. Observed throughputs have ranged from 40GB/Hour to well over 200GB/hour for a single, individual backup processes.
1.1.1.1 Deduplication
Tivoli Storage Manager client side (inline) deduplication is highly effective with Tivoli Storage Manager for Virtual Environments and can substantially reduce back-end storage requirements as well as the proxy to Tivoli Storage Manager server bandwidth requirements. Client side deduplication requires additional processing (by the proxy host) that will slow the backup throughput. For a specific amount of data to backup, you may require more proxies to meet a given backup window when using deduplication as compared with not using deduplication. Generally the benefits of storage and bandwidth reduction will outweigh the cost of additional instances of proxies. For estimation purposes, you can assume that backup throughput when you use client deduplication is approximately 50% of the throughput without deduplication. As an alternative to using client side (inline) deduplication, TSM server-side (post-process) deduplication may be used if backup throughput requirements are the highest priority, and proxy to TSM server bandwidth is not constrained.
Page 5 of 18
HOTADD
Yes
No
Data I/O from Proxy to Tivoli Storage Manager server Communication Method LAN LAN-free Available to Virtual Proxy? Yes No Available to Physical Proxy? Yes Yes Comments Data transfers over LAN to Tivoli Storage Manager server Data transfers over SAN to Tivoli Storage Manager server storage pool devices (Tape or Virtual Tape) Note: LAN-free with disk is possible using SANergy or GFPFS.
Page 6 of 18
1.2
Term Proxy
A Few Definitions
Definition The host that performs the offloaded backup. This host can be a virtual or physical machine. Also called VStor Backup Server (VBS), Backup Server (BUS). The Tivoli Storage Manager Backup/Archive Client is installed on this host and provides the VMWare backup function. An individual backup process that performs the VMWare guest backups. Each datamover is associated with one or more Tivoli Storage Manager backup schedules. Typically there will be multiple datamovers per proxy to fully utilize the proxy host resources. Also called backup process.
Datamover
1.3
This document is intended to provide a first order estimation of proxy hosts required for a specific Tivoli Storage Manager for Virtual Environments backup environment. Using these guidelines can help to provide a successful deployment by establishing a quantitative basis for determining the quantity, placement, and sizing of the proxy hosts. There are many assumptions made within this document and actual results can vary significantly depending upon the environment and infrastructure characteristics. Careful evaluation of the environment is necessary and benchmarking during the planning phase is strongly encouraged to characterize the capabilities of the environment.
Page 7 of 18
Page 8 of 18
2.
This section provides the steps for sizing a proxy for a specific deployment scenario across an entire data center. This is a generalized approach, and the same method may be applied to individual environments that differ significantly from one another.
2.1
Assumptions
Reasonably equal distribution (within 20%) of utilized virtual machine storage capacity (data stores) across all ESX hosts. Backups are scheduled on a per ESX host basis. See scheduling section for more information. Schedule a full backup weekly and an incremental backup 6 days a week. This means that on any th th day, 1/7 of the ESX hosts will have a full backup, and 6/7 of the ESX hosts will have an incremental backup. Since we assume an even distribution of storage and VMs across all ESX th hosts, this means that 1/7 of the total amount of data is backed up daily (via the full backups) and the remainder of the data is backed up incrementally. Example environment: o o o Total number of virtual machines: 5000 Average used storage per virtual machine: 50GB Total
2.2
Example environment
Environment Description
Total Number of virtual machines Average Utilized Storage per VM Total Utilized Storage Number of ESX Hosts Number of DRS Clusters Backup Window Assumed daily change rate 5000 * 50GB
2.3
Page 9 of 18
35,700GB
4,300GB
40,100GB
Page 10 of 18
2.3.5 Summary
We have finished crunching through the numbers to estimate the number of proxies required for a TSM-VE deployment. This gives us a good starting point, but now we need to think more about the architecture of the overall solution to determine if any adjustments are necessary. We will cover this in the next section. Here is a table that summarizes all of the steps up to this point:
Environment Description
Total Number of virtual machines Average Utilized Storage per VM Total Utilized Storage Number of ESX Hosts Number of DRS Clusters Backup Window Assumed daily change rate 5000 * 50GB 5000 50GB 250,000 GB 250 50 10 Hours 2%
35,700GB
4,300GB
Page 11 of 18
2.4
Architectural Considerations
Now that we have an estimate of the number of proxies required to achieve the daily backup workload, we must now consider whether this makes sense in practical terms. TSM-VE provides a great deal of flexibility in deployment options, so we need to determine which options makes the most sense. We will consider other factors and determine if adjustments are required.
Page 12 of 18
2.4.2.1 Questions to ask when you decide between a physical and a virtual proxy
Following is a list of questions you should consider when deciding between physical and virtual machines showing which type of proxy would be preferred in each case depending upon a yes or no answer:
Question Do you require backup traffic to flow over the SAN as much as possible? *Note: Virtual machine proxies can take advantage of Hotadd data transfers from a SAN data store to the proxy which primarily uses SAN I/O via the ESX host HBA. However, a virtual machine proxy cannot take advantage of LAN-free data transfers from the proxy to the Tivoli Storage Manager server. Does your LAN (IP Network) have sufficient bandwidth to accommodate the backup traffic. Do you want to use LAN-free data transfers from the proxy to the Tivoli Storage Manager server? Note: LAN-free is usually only used with Tape or Virtual Tape backup storage devices. Do you prefer or require that all new hosts are virtual and not physical machines? Do you want to minimize the number of proxy hosts? Note: The preference is based on the assumption that you will dedicate more resources to a physical proxy than a virtual proxy. Do you use NFS attached data stores? Is 10GB Ethernet connectivity available to the Tivoli Storage Manager server?
Yes Physical*
No Virtual
Virtual Physical
Physical Virtual
Virtual Physical
Either Virtual
Virtual Virtual
Either Either
Page 13 of 18
2.5
Scheduling of Backups
The estimation technique described in this document is based on distributing the periodic full backup throughout the backup cycle. For example, with a seven day backup schedule, on any day in the week 1/7th of the VMs will have a full backup scheduled and the remaining 6/7th will have an incremental backup. This distribution can be accomplished through various methods, two of which are described below (Rotating Fulls by ESX Host and Rotating Fulls by VM). Regardless of scheduling method, the estimation technique described in this document can still be applied, by adjusting the daily backup workload appropriately. As with any backup technique, backup results should be monitored regularly to ensure that all VMs are backed up according to business requirements.
Page 14 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing 2.5.2.2 Rotating Fulls by VM
This method will require backup scheduling by individual VM and can include concurrent backup of multiple VMs on the same ESX host. Although it is possible to do this with the TSM scheduler, it is usually not practical for large environments. Custom scripting methods can be used, via VMWares PowerCLI to obtain VM lists and drive TSM backup commands (dsmc backup vm) via a command line. IBM Tivoli Lab Services provides an offering for custom backup scheduling based on specific customer criteria (such as balancing load within clusters, between data stores, etc.). The same proxy sizing method described in this document can be used for the Rotating Fulls by VM.
Page 15 of 18
3.
Your Estimate
Environment Description
You can use this table to provide your proxy sizing estimate, using the example as a guideline:
Total Number of virtual machines Average Utilized Storage per VM Total Utilized Storage Number of ESX Hosts Number of DRS Clusters Backup Window Assumed daily change rate _____ * ___ GB # of VMs * Avg. Storage Per VM
_______GB
_________GB
______GB
Page 16 of 18
4.
Resource requirements for a proxy are driven by the following key factors:
Of the two factors, I/O capacity is the most important factor because the proxys main role is to move data. When you use client deduplication, the CPU resources may become the constraint for throughput.
4.1
Page 17 of 18
END OF DOCUMENT
Page 18 of 18