phphtmlpdfpdf-to-html

Convert PDF to HTML


What is the best solution to convert PDF documents to be viewed in the browser as HTML? The site has several PDF documents and the visitor can click on view as HTML and this should be viewed on the screen as an HTML file.

Standard website running PHP, Linux, Apache.


Solution

  • pdftohtml works fine : fast, stable but the html result is ugly at best. I have used it for quite some time for a web site that has many job resumes.

    It is a good solution for extracting textual content however.

    I would give the scribd API a try

    or the google apps document API. GOogle does a great job a displaying and converting pdf files