DeDuplication is a technique used to identify and segregate files that are exact duplicates. Why waste you or your team’s time by reviewing the same document ten or more times? This can become even more vital when there are multiple copies of the same privilege document.
Each electronic file has a unique fingerprint, also known as a hash value. This hash is generated by applying a one of two complex mathematical algorithms that will result in either a 128 or 160-bit identifier key. These keys are what is use to determine exact duplicates.
File duplication can occur across a single custodian (e-mail) or across entire infrastructures of files servers, emails servers and other computers. To take these two different scenarios into account we offer two types of deduplication:
• Vertical deduplication locates duplicates within the records and data of a single custodian, and
• Horizontal deduplication applies globally across all custodians.
There are advantages and drawbacks for both vertical and horizontal deduplication. An Electronic Legal project manager will help you determine what is best for your project.