invoice2data 0.2.40 – The Raw Science / Technology News Feed

Tested on Python 2.7, 3.4 and 3.5 – extracts text from PDF files – searches for regex in the result – saves results as CSV – optionally renames PDF files to match the content With the flexible template system you can: – precisely match PDF files – define static fields that are the same for every invoice – have multiple regex per field – define currency Go from PDF files to this: “` {‘date’: , ‘invoice_number’: ‘30064443’, ‘amount’: 34.73, ‘desc’: ‘Invoice 30064443 from QualityHosting’} {‘date’: , ‘invoice_number’: ‘EUVINS1-OF5-DE-120725895’, ‘amount’: 35.24, ‘desc’: ‘Invoice EUVINS1-OF5-DE-120725895 from Amazon EU’} {‘date’: , ‘invoice_number’: ‘42183017’, ‘amount’: 4.11, ‘desc’: ‘Invoice 42183017 from Amazon Web Services’} {‘date’: , ‘invoice_number’: ‘12429647’, ‘amount’: 101.0, ‘desc’: ‘Invoice 12429647 from Envato’} “` ## Installation 1. Install pdftotext If possible get the latest … (more)