Professional Documents
Culture Documents
12 September 2012
1.1
Page 1 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
Revision History
Revision Revision Summary of Changes Changes
Number Date marked
1.0 08/22/12 Final version
1.1 09/12/12 Formatting changes and typos
Added “constraint checking” section
Disclaimer
The information contained in this document is distributed on an "as is" basis without any warranty either
expressed or implied.
This document has been made available as part of IBM developerWorks WIKI, and is hereby governed by the
terms of use of the WIKI as defined at the following location:
http://www.ibm.com/developerworks/tivoli/community/disclaimer.html
Throughput numbers contained in this document are intended to be used for estimation of proxy host sizing.
Actual results are environment and configuration dependent and may vary significantly. Users of this
document should verify the applicable data for their specific environment.
Page 2 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
Contents
Contents .......................................................................................................................... 3
1. Introduction ............................................................................................................. 4
1.1 Overview ................................................................................................................................. 5
1.1.1 Performance ............................................................................................................. 5
1.1.2 Periodic Full Backup ................................................................................................. 7
1.2 A Few Definitions .................................................................................................................... 7
1.3 Scope of this document .......................................................................................................... 7
1.3.1 External Dependencies and Assumptions ................................................................ 8
1.3.2 Performance optimization and bottleneck analysis .................................................. 8
1.3.3 Proxy Hardware Configuration .................................................................................. 8
1.4 Scheduling of Backups ........................................................................................................... 8
1.4.1 “Rotating Fulls” backups by ESX Host ..................................................................... 9
1.4.2 Alternate Scheduling Methods .................................................................................. 9
2. Step by Step Proxy Sizing .................................................................................... 10
2.1 Assumptions .......................................................................................................................... 10
2.2 Example environment ........................................................................................................... 10
2.3 Perform the Estimate ............................................................................................................ 10
2.3.1 Determine daily backup workload ........................................................................... 10
2.3.2 Calculate Aggregate Throughput Requirement ...................................................... 11
2.3.3 Calculate the number of concurrent datamovers (backup processes) ................... 11
2.3.4 Determine the number of proxy hosts required ...................................................... 12
2.3.5 Summary................................................................................................................. 12
2.4 Constraint Checking and Architectural Considerations ........................................................ 13
2.4.1 Check for Constraints ............................................................................................. 13
2.4.2 Additional capacity requirements ............................................................................ 14
2.4.3 Physical or virtual proxy? ........................................................................................ 14
3. Your Estimate ....................................................................................................... 16
4. Proxy Host Resource Requirements .................................................................... 17
4.1 Determining proxy resource requirements ............................................................................ 17
4.1.1 Determining I/O resource requirements ................................................................. 17
4.1.2 Determining CPU requirements .............................................................................. 17
4.1.3 Memory estimation ................................................................................................. 17
Page 3 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
1. Introduction
Tivoli Storage Manager for Virtual Environments (TSM-VE) is a feature of the Tivoli Storage Manager product
family for backing up virtual machines in a vSphere (VMware) environment. Tivoli Storage Manager for
Virtual Environments uses the latest backup technology provided by VMware, called VStorage API (also
known as “VADP” or “VStorage APIs for Data Protection”).
An essential component of Tivoli Storage Manager for Virtual Environments is the VStorage Backup Server
which performs the data transfer from the ESX datastores that contain the virtual machine data to the Tivoli
Storage Manager server. The VStorage Backup Server offloads the backup workload from the ESX server
and acts as a proxy for a backup. Throughout this document, the VStorage Backup Server will be referred to
as the "proxy ". A proxy that is configured on a virtual machine is referred to as a “virtual proxy”, and if
configured on a physical machine is referred to as a “physical proxy”.
When you consider a backup solution using Tivoli Storage Manager for Virtual Environments, one of the
frequently asked questions is how to estimate the number of proxies required for a specific environment. This
paper guides you through the estimation process.
The following diagram provides a simplified, high level overview of the components involved with TSM-VE
image backup and restore:
Page 4 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
1.1 Overview
The proxy estimation method described in this document is intended to help you plan a deployment of Tivoli
Storage Manager for Virtual Environments. A recommended approach is described. However, there are
many variations depending upon customer preferences, infrastructure capabilities, and other factors.
Different vendors use various approaches to determine the number of proxies required, and may be
constrained by product design. Tivoli Storage Manager provides flexibility for deploying the proxies and
selecting virtual, physical, or a combination of both proxies. The intent is to provide a starting point for initial
estimation and solution architecture.
The proxy estimation process comprises the following steps:
Define how the backups are scheduled.
Decide whether to use virtual machine proxies, physical proxies, or a combination of both.
Estimate the number of proxies required.
Check for any constraints in the environment based on the assumptions used in the estimate.
1.1.1 Performance
Estimating the number of proxies requires some assumptions about the performance characteristics of
individual backup processes. Tivoli Storage Manager for Virtual Environments uses efficient disk block-level
I/O for the backup process, and the backup process itself consumes minimal CPU and memory resources.
Backup performance is determined primarily by the following system characteristics:
• I/O capabilities of the datastore storage arrays
• Back-end storage device used by the Tivoli Storage Manager server, for example, Virtual Tape Library
(VTL) or disk
• Infrastructure connectivity, for example, Storage Area Network (SAN) or Local Area Network (LAN)
bandwidth
It is recommended that you use benchmarking to refine the estimate of backup throughput specific to your
environment.
The throughput capabilities can range significantly depending upon the environment. Observed throughputs
have ranged from 40GB/Hour to well over 200GB/hour for a single, individual backup processes.
1.1.1.1 Deduplication
Tivoli Storage Manager client side (inline) deduplication is highly effective with Tivoli Storage Manager for
Virtual Environments and can substantially reduce back-end storage requirements as well as the proxy to
Tivoli Storage Manager server bandwidth requirements. Client side deduplication requires additional
processing (by the proxy host) that will slow the backup throughput. For a specific amount of data to backup,
you may require more proxies to meet a given backup window when using deduplication as compared with
not using deduplication. Generally the benefits of storage and bandwidth reduction will outweigh the cost of
additional instances of proxies. For estimation purposes, you can assume that backup throughput when you
use client deduplication is approximately 50% of the throughput without deduplication.
As an alternative to using client side (inline) deduplication, TSM server-side (post-process) deduplication may
be used if backup throughput requirements are the highest priority, and proxy to TSM server bandwidth is not
constrained.
Page 5 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
The following information on the methods available is listed here for reference. TSM user documentation
should be referenced for more details on the methods available and how to configure.
Data I/O from ESX datastore to Proxy
Transport Method Available to Virtual Available to Physical Comments
Proxy? Proxy?
NBD Yes Yes Uses LAN connection
NBDSSL Yes Yes Uses LAN connection
SAN No Yes Uses direct SAN connection to
datastore (for SAN-attached
datastores only).
HOTADD Yes No Uses SAN connection (via ESX host)
for SAN-attached volumes which is
nearly as efficient as the “SAN”
transport. For NFS datastores,
provides more efficient transport than
NBD.
Page 6 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
Term Definition
Proxy The host that performs the offloaded backup. This host can be a virtual or physical
machine. Also called “VStor Backup Server” (VBS) or “Backup Server” (BUS). The Tivoli
Storage Manager Backup/Archive Client is installed on this host and provides the
VMware backup function.
Datamover An individual backup process that performs the VMware guest backups. Each
datamover is associated with one or more Tivoli Storage Manager backup schedules.
Typically there will be multiple datamovers per proxy to fully utilize the proxy host
resources. Also called “backup process”.
Page 7 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
help to provide a successful deployment by establishing a quantitative basis for determining the quantity,
placement, and sizing of the proxy hosts. There are many assumptions made within this document and actual
results can vary significantly depending upon the environment and infrastructure characteristics. Careful
evaluation of the environment is necessary and benchmarking during the planning phase is strongly
encouraged to characterize the capabilities of the environment.
Page 8 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
As with any backup technique, backup results should be monitored regularly to ensure that all VM’s are
backed up according to business requirements.
Page 9 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
2.1 Assumptions
Reasonably equal distribution (within 20%) of utilized virtual machine storage capacity (datastores)
across all ESX hosts.
Backups are scheduled on a per ESX host basis. See scheduling section for more information.
Full backup is scheduled weekly and an incremental backup is scheduled 6 days a week. This means
th th
that on any day, 1/7 of the ESX hosts will have a full backup, and 6/7 of the ESX hosts will have an
incremental backup. Since we assume an even distribution of storage and VMs across all ESX
th
hosts, this means that 1/7 of the total amount of data is backed up daily (via the full backups) and
the remainder of the data is backed up incrementally.
Page 10 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
Page 11 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
2.3.5 Summary
We have finished crunching through the numbers to estimate the number of proxies required for a TSM-VE
deployment. This gives us a good starting point, but now we need to think more about the architecture of the
overall solution to determine if any adjustments are necessary. We will cover this in the next section. Here is
a table that summarizes all of the steps up to this point:
Environment Description
Total Number of virtual machines 5000
Average Utilized Storage per VM 50GB
Total Utilized Storage 5000 * 50GB 250,000 GB
Number of ESX Hosts 250
Number of DRS Clusters 50
Backup Window 10 Hours
Assumed daily change rate 2%
Determine Daily Backup Workload
Daily Backup Workload from Full 250,000GB ÷ 7
35,700GB/Day
Backups NOTE: periodic full every 7 days
(6 ÷ 7) * 250,000 * 0.02
Daily Backup Workload from th
NOTE: 6/7 of total data 4,300GB/Day
Incremental Backups multiplied by incremental change
rate of 2%
Total Daily Backup Workload 35,700 + 4,400 GB 40,100GB/Day
Calculate Aggregate Throughput Requirement
Page 12 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
Page 13 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
Constraint Validation
Proxy i/o throughput: Can each individual proxy Ensure that the proxy can be configured with
sustain the required i/o throughput? sufficient adapter cards (NICs and HBAs) to support
the required throughput.
TSM server throughput: Can the TSM server support Ensure that the TSM server is configured to support
the required aggregate i/o throughput for all of the the required aggregate throughput. Multiple TSM
proxies? servers may be required in some cases.
TSM server sessions: Can the TSM server support Ensure that the TSM server is configured to support
the required number of concurrent backup sessions the required number of concurrent backup sessions.
from all of the datamover processes?
Infranstructure bandwidth: Can the LAN or SAN Ensure that the LAN and SAN networks have
accommodate the aggregate workload required for sufficient bandwidth to accommodate the backup
all of the backup processes? (and restore) bandwidth requirements.
Datastore I/O Capacity: Can the ESX datastore Ensure that the Datastore I/O devices are capable of
accommodate the i/o required to support the required supporting the I/O data transfer rates required for the
backup throughput? backup processes. The assumption for the per-
process backup throughput may need to be adjusted.
Per-ESX host backup window: Will an ESX host Determine if it is possible for any one ESX host to
have an excessive number of VM’s or storage have an excessive number of VM’s and storage that
capacity that cannot be backed up within the required will result in not achieving the backup window.
window? This constraint applies when schedules are
created based on ESX hosts. Since VM’s are
backed up serially for each ESX host, it is possible to
exceed the backup window if the total storage
capacity for an ESX host significantly exceeds the
average.
Page 14 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
2.4.3.1 Questions to ask when you decide between a physical and a virtual proxy
Following is a list of questions you should consider when deciding between physical and virtual machines
showing which type of proxy would be preferred. If the answer to the question is “Yes” then preference
should be given to the type of proxy in the “Yes” column. . If the answer to the question is “No” then
preference should be given to the type of proxy indicated in the “No” column.
Question Yes No
Do you require backup traffic to flow over the SAN as much as possible? Physical* Virtual
*Note: Virtual machine proxies can take advantage of Hotadd data transfers from
a SAN datastore to the proxy which primarily uses SAN I/O via the ESX host
HBA. However, a virtual machine proxy cannot take advantage of LAN-free data
transfers from the proxy to the Tivoli Storage Manager server.
Does your LAN (IP Network) have sufficient bandwidth to accommodate the Virtual Physical
backup traffic.
Do you want to use LAN-free data transfers from the proxy to the Tivoli Storage Physical Virtual
Manager server?
Note: LAN-free is usually only used with Tape or Virtual Tape backup storage
devices.
Do you prefer or require that all new hosts are virtual and not physical machines? Virtual Either
Do you want to minimize the number of proxy hosts? Physical Virtual
Note: The preference is based on the assumption that you will dedicate more
resources to a physical proxy than a virtual proxy.
Do you use NFS attached datastores? Virtual Either
Is 10Gbit Ethernet connectivity available to the Tivoli Storage Manager server? Virtual Either
Page 15 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
3. Your Estimate
You can use this table to provide your proxy sizing estimate, using the example as a guideline:
Environment Description
Total Number of virtual machines ___________
Average Utilized Storage per VM ________GB
_____ * ___ GB
Total Utilized Storage _______ GB
# of VMs * Avg. Storage Per VM
Number of ESX Hosts _____
Number of DRS Clusters ___
Backup Window ___ Hours
Assumed daily change rate __%
Determine Daily Backup Workload
Daily Backup Workload from Full ______GB ÷ 7
_______GB
Backups NOTE: periodic full every 7 days
(6 ÷ 7) * ______ * 0.0_
Daily Backup Workload from th
NOTE: 6/7 of total data _________GB
Incremental Backups multiplied by incremental change
rate of __%
Total Daily Backup Workload _______ + ______ GB ______GB
Calculate Aggregate Throughput Requirement
Total Daily Backup Workload (from previous calculation) ______ GB
Backup Window ______ hours
Aggregate throughput Required ______ GB ÷ ___ hours ______ GB/Hour
Calculate Number of Concurrent Datamovers
Aggregate throughput Required ______ GB ÷ ___ hours ______ GB/Hour
Per Process Throughput Estimate 100GB/Hour
Number of datamovers (backup ______ GB/Hour ÷ 100GB/Hour ______
processes)
Determine Number of Proxy Hosts Required
Number of datamovers (backup ______ GB/Hour ÷ 100GB/Hour ______
processes)
Number of concurrent datamovers 10
per proxy
Number or proxy hosts required ______ ÷ 10 ______
Page 16 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
Page 17 of 18
Tivoli Storage Manager for Virtual Environments Guide to Proxy Sizing
(for example, Windows 2008R2). A minimum of 4GB of RAM should be considered when running four
concurrent backup processes, with an additional 1 GB for each additional backup process.
END OF DOCUMENT
Page 18 of 18