This whitepaper, written by Daegis’ Director of Technology, Doug Stewart, was presented to the Fourth DESI Workshop on Setting Standards for Electronically Stored Information in Discovery Proceedings on April 20, 2011.
eDiscovery thought leadership organizations advocate for the use of sampling throughout much of the eDiscovery process. Additionally, judging from the numerous and frequent references to “sampling” found in eDiscovery literature there appears to be wide acceptance of the use of these techniques to validate eDiscovery efforts. At the same time, there are lingering questions and concerns about the appropriateness of applying random sampling techniques to eDiscovery data sets. This paper offers evidence that random sampling of eDiscovery data sets yields results consistent with well established statistical principles. It shows that Simple Random Sampling (SRS) can be used to accurately make predictions about the composition of eDiscovery data sets and thus validate eDiscovery processes.
Blog
Twitter
YouTube