The Difference Between a Data PDF and a Scanned PDF ExplainedThe Difference Between a Data PDF and a Scanned PDF Explained https://echovera.ca/wp-content/themes/corpus/images/empty/thumbnail.jpg 150 150 Tim Robertson Tim Robertson https://secure.gravatar.com/avatar/6847ba38495b3996e0bdf3a15c015e0c?s=96&d=mm&r=g
There are 2 types of PDF file formats used for supplier invoices, and they are a data PDF and a scanned PDF.
A data PDF is an invoice that is generated by an ERP or accounting system, or with a utility that creates PDF files for documents. If you can highlight the text in a PDF document, and then successfully paste the text into a spreadsheet or Word document, then the PDF is a data PDF.
A scanned PDF is an invoice that has been printed and scanned. It is not possible to copy the text.
Because this invoice has been printed and scanned it loses the data layer in the scanning process (as the scanning is only taking a picture of the text information on the page) and you can no longer highlight the text. Instead of highlighting the data, you can highlight a box, as in this example.
For digital capture, you need a system that will capture both types of documents. Intelligent OCR captures data from both paper and emailed invoices, regardless if the PDF invoice is a data PDF or a scanned PDF. Find our more about Intelligent OCR.
- Posted In:
- AP Automation