Sunday 3 September 2017

Validate/verify PDF files integrity


Is there any tool which I can run on the PDFs archives (all directories) and finally it lists/identifies the corrupt/invalid PDFs?


I have hundreds of PDF files (related to documentation etc.) on my computer (Windows machine), and very often I receive/have to send dozens of PDFs via email. It is now a normal routine that the PDF I receive or send is corrupted. This creates serious trouble sometimes, when the source file (e.g. Word file or the Tex file is missing/not available instantly).


Checking these thousands of PDFs is not possible in finite time, so I searched for a tool which I can run once and it scans all PDFs (in directories and sub-directories), and finally I get a list of those files which I should re-create. Upto now, it seems there is no such tool.



Answer



It is quite easy to check if a PDF file is valid, by using PDFtk. A free GUI for PDFtk is available from PDF Labs. When you run this tool, you can load as many PDFs as you wish, from multiple directories (by using Add files button), and then it will start accessing the pages in these PDF files, very quickly.


If any file among the selected PDFs is not valid PDF, this utility will show a message about the error, and will remove it automatically from the selection window.


Hence you can save many hours using this procedure with PDFtk. Additionally, if you have multicore CPU, you can run multiple instances of this utility and throw in hundreds of PDFs in each instance.


I am using this software since last 1 year, and it is the most handy PDF tool I have ever used.


No comments:

Post a Comment

Where does Skype save my contact's avatars in Linux?

I'm using Skype on Linux. Where can I find images cached by skype of my contact's avatars? Answer I wanted to get those Skype avat...