Open PDF Analysis Framework
PDF files rely on a complex file structure constructed from a set tokens and grammar rules. Also each token can be compressed, encrypted or even obfuscated.
Open PDF Analysis Framework (OPAF) will understand, decompress, de-obfuscate these basic PDF elements and present the resulting soup as a clean XML tree.
From there a set of configurable rules can be used to decide what to keep, what to cut out and ultimately if it is safe to open the resulting PDF projection.