Professional Documents
Culture Documents
Enabling software
deduplication
across the
enterprise with
HPData Protector
Table of contents
Incompatible solutions
78%
HP StoreOnce video
Deduplication explained
Data deduplication compares chunks of information to
detect duplicates and stores each unique data segment
only once. For that to happen, a deduplication engine
assigns a unique identifier to each chunk of data using
mathematical hash functions. Once it has identified two
chunks of data as identical, the system will replace the
duplicate with a link to the original chunk.
There are two architectural approaches to chunking. A
fixed deduplication algorithm breaks data into blocks of a
fixed size. Variable chunking groups the data into blocks
based on patterns in the data itself. The advantage of
variable chunking is that it can recognize duplicates when
small changes have occurred and merely shifted the
data from one backup to the next. Variable chunking, the
technique most commonly used today, leads on average
to 20:1 deduplication ratios, or higher.
Deduplication involves a combination of three elements:
the deduplication engine, the deduplication store, and
backup agents.
The deduplication engine is where the majority of
processing takes place. It manages the logic and
processing of the backup stream by calculating segments
and hash values, identifying unique and repeated
segments, and maintaining the hash lookup table.
The deduplication store is the disk storage location
managed by the deduplication engine. It stores the unique
(deduplicated) segments, and is often physically coupled
with the deduplication engine.
Deduplication-enabled backup agents (for example, media
agents, disk agents, and application agents) manage some
of the deduplication processes. Agents can be deployed
separately from the deduplication engine to offload some
of the performance impact. Agents can perform tasks
such as segmenting the data, calculating the hash value
of segments, and sending new data to the engine and the
store. The deduplication agent talks to the deduplication
engine to calculate which segments are unique.
Figure 4. HP StoreOnce deduplication engine can be deployed at an application source, a backup server, or a target appliance such as B6200 appliance or
HPData Protector software store
Figure 7. Backup server deduplication minimizes impact on application performance and maximizes performance of the target device
Figure 8. Deduplication at the target device is easy to deploy and provides maximum storage optimization
Figure 9. Deduplication at the target device is easy to deploy and provides maximum storage optimization
Conclusion
HP Data Protector is the first and only solution in the market that
supports deduplication at any locationapplication source, backup
server, and target deviceacross an enterprise. Powered by the
industrys most advanced deduplication engine HP StoreOnce,
HP Data Protectors federated deduplication capability provides
the unique ability to deduplicate data only once across multiple
locations. HP StoreOnce technology provides a common architecture
across software and hardware, at remote sites and in the data center
enabling deduplicated data movement from edge to core without
having to rehydrate at multiple deployments. HP Data Protector
software central management combined with StoreOnce allows
organizations to maximize their critical storage resources through
the most efficient deduplication available, while meeting stringent
business SLAs and minimizing backup infrastructure related costs.
Solve the challenges of traditional deduplication with HP Data
Protector software, visit hp.com/go/dataprotector.
Get connected
hp.com/go/getconnected
Get the insider view on tech trends,
support alerts, and HP solutions
Copyright 20112012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and
services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors
or omissions contained herein.
Microsoft and Windows are U.S. registered trademarks of Microsoft Corporation. Oracle is a registered trademark of Oracle and/or its affiliates.
UNIX is a registered trademark of The Open Group.
4AA3-8728ENW, Created December 2011; Updated June 2012, Rev. 3