Sie sind auf Seite 1von 8

Module 1: Deduplication

Introduces you to deduplication (outside Data


Domain)
Reviews product concepts covered in the Technology
and Systems Introduction course

EMC Education Services

Objectives

Describe deduplication

Identify deduplication types

Inline

Post process

File based

Fixed segment

Variable segment

Identify advantages of deduplication types

EMC Education Services

Definition

Eliminates redundant data

Stores only one instance of data

For example (conceptual, not precise)

original
message
what gets
stored

EMC Education Services

Mary had a little lamb


Mary hd lite mb
3

post process

more
hardware
required

real time
deduplication

deduplication
disk subsystem

EMC Education Services

less
hardware
required

data cached
in temp disk area

more time to disaster recovery

inline

less time to disaster recovery

Inline Vs Post-Process Deduplication

deduplication
disk subsystem

post-process
deduplication
deduplication
disk subsystem

File-Based Deduplication

If 2 files are exactly alike, 1 file is stored & future


integrations of file are pointed to original file
Deduplication results are often not as great as with
other deduplication methods
Sometimes called single-instance store

EMC Education Services

Fixed-Segment Deduplication
data segments
A
C
~8k
~8k

D
~8k

E
~8k

F
~8k

data stream moves to make room


data now in different location
A
~8k

B
~8k

C
~8k

D
~8k

E
~8k

F
~8k

segment B added

EMC Education Services

Variable Segment Size Deduplication


A
~4k

B
~4k

C
~12k

D
~7k

E
~8k

F
~8k

segmentation based on bytes not


files
data stream spliced into segments
(4-12KB) with 8KB average size
EMC Education Services

Module Review
Deduplication is a technology that improves data
storage

File-based, fixed segment, and post processing are


deduplication types

EMC Education Services

Das könnte Ihnen auch gefallen