.net-corepdfboxapache-tikapdf-parsingtika-server

Apache Tika Server Password protected pdf file parsing


I am using Tika server 2.5, when trying to parse pdf document which is password protected getting exeption of EncryptedDocumentException, so is their any way to parse this document or send the password to tika server for parsing?


Solution

  • it worked for me by using the header Password:

    Curl -T "Simple-40-bit-RC4-encrypted.pdf" http://localhost:9998/rmeta/form --header "Password:1234567890abcdefghijk1234567890abcdefghijk"