BLOG on Litigation Support and eDiscovery Industry

Near-duplicate identification is one of the more common textual analytics tools used in eDiscovery. Not to be confused with document deduplication, which relies on hash values, near-duplicate identification calculates document similarity based off textual content. For example, if you had … Continue reading

Part One of this blog post introduced this basic model workflow and discussed how to implement it in a Relativity review environment. Now, a little bit about the fields, why they’re important, and how we can use them throughout the review … Continue reading