KCWebMonkey wrote:
are you wanting to do this locally or online? I found a utility that runs locally and converts PDF to HTML or XML: http://pdftohtml.sourceforge.net/ |
Thanks KCWebMonkey. Either locally or online both will do. The problem is it has become difficult to extract fonts from pdf files since 2000, though there are ways people speak about in online forums. This question was put forward to me by somebody who needed loads of non-English text to be converted into xml files and then get it analyzed for specific words of that language. Both of us being rather uneducated in such tricky computer matters, I decided to field this question for opinions here.
The link you have forwarded is good enough, but I'll have to first learn the ways for installing it properly,since my knowledge stops to the usual exe and zip files that software come with.

Edited by proaudience - 19 June 2007 at 3:08pm