data:image/s3,"s3://crabby-images/e71aa/e71aa86071860bd7f9ae8b8946380dd408be850e" alt="Python pdf2csv"
data:image/s3,"s3://crabby-images/7198f/7198fe5203cb13777303b4f0433be779ebd23923" alt="python pdf2csv python pdf2csv"
If you're using vscode or some other compatible text editor, you should get autocompletion hints as long as you include that $schema tag at the top of your json file.Ī key part in setting up a new configuration is getting the page coordinates for the area and columns. This format is also captured in pdf_statement_rader/config/psr_ as a json-schema. It is envisaged that as more formats are added, the list of options will grow. These were the configuration options that were required for the default format.
data:image/s3,"s3://crabby-images/ff0d5/ff0d5096851bbaa8fbd17f41a57a649b8265eb88" alt="python pdf2csv python pdf2csv"
These config files are stored in a folder structure as follows: To set up a different statement, you can simply add a new config file and then tell the psr tool to use it. For now the only config supported is for Cheque account statements from Absa bank in South Africa.
Python pdf2csv how to#
A config file holds the instructions for how to process the raw pdf. Since bank statements are generally of the same (if inconvenient) format, we can set up a configuration to tell the tool how to grab the data.įor each type of bank statement, the exact format will be different. In fact this package uses tabula's pdf parsing library under the hood. For a really good semi-manual GUI solution, check out tabula. PDF files are notoriously difficult to extract data from. Validate Validates the csv statement rolling balance Pdf2csv Converts a pdf statement to a csv file using a given format Utility for reading bank and other statements in pdf formĭecrypt Decrypts a pdf file Uses pikepdf to open an encrypted pdf file.
data:image/s3,"s3://crabby-images/e71aa/e71aa86071860bd7f9ae8b8946380dd408be850e" alt="Python pdf2csv"