The general .pdf file format is a combination of structured text , binary fields and compressed data that makes general purpose deconstruction quite tricky. If you are going after specific information in a single file then the usual unix set of hand tools may suffice. For set of documents Perl or similar scripting tool may get extract what you desire.
Also .pdf is one of those evolving standards with many versions so your milage may vary.