AI Content Chat (Beta) logo

First Step: Test Data Ground truth data is essentially sample data that has the answer key. For instance, if you plan to test and compare the ability to process invoice data, then the ground truth data will consist of samples of invoices along with the actual value of each field you wish to extract for each sample invoice. Eachsamplewouldlooksomethinglikethis: File Name:Invoice1234.PDF Invoice Date: 6/13/2020 Invoice Number: 1234 Invoice Amount: 2112.00 Second Step: Gather TestData Your test data should be taken from real production examples. While artificial test data could potentially substitute for real data, it is typically insufficient to adequately represent the true nature of your documents. The amount of test data you need to reliably measure any system realistically depends on the amount of variance or differences observed in eachdocumenttype.Themoretestdatayouhave,themoreaccurateyour measurements will be. That is, 500 samples should be a bare minimum in order to reliably understand if a given system actually performs in productionthewayitperformsin testing. 36 36 Straight Through Processing - Document Automation

Straight Through Processing for Document Automation - Page 36 Straight Through Processing for Document Automation Page 35 Page 37