All programs related to a source or a target pdf file are withing the pdf-related-prog folder.
Program pdf-utils/pdf_to_clean_txt.py is used to extract from a scanned and OCRised pdf, like a table of contents or an index, all the text and cleaning it.