pythoninstallationpdftotext

Unable to install pdftotext on Python 3.6, missing poppler


How can I install pdftotext properly?

I'm getting the error message below when installing pdftotext in Python 3.6. I also tried to install the package manually by downloading the zip file but still got the same error.

  pdftotext/pdftotext.cpp(4): fatal error C1083: Cannot open include file: 'poppler/cpp/poppler-document.h': No such file or directory
  error: command 'C:\\Program Files (x86)\\Microsoft Visual Studio 14.0\\VC\\BIN\\x86_amd64\\cl.exe' failed with exit status 2     

Solution

  • I found some help in the Readme.md file in the pdftotext package :

    1) Install OS Dependencies :

    on Debian, Ubuntu, and friends:

    sudo apt-get update
    sudo apt-get install build-essential libpoppler-cpp-dev pkg-config python-dev
    

    on Fedora, Red Hat, and friends:

    sudo yum install gcc-c++ pkgconfig poppler-cpp-devel python-devel redhat-rpm-config
    

    2) Do the normal install :

    pip install pdftotext
    

    and it worked for me.