azureazure-document-intelligence

When I am trying to analyze telugu pdf in Azure document Intelligence, I am getting telugut. Is there a way to fix it?


When I am trying to analyze Telugu pdf in Azure document Intelligence, I am getting telugut. Is there a way to fix it?

!enter image description here](https://i.sstatic.net/fXsLrL6t.png)

This the output.

I expect the telugu text extracted from the document. I tried searching API doc, but there is no way set the language.

Any help is appreciated


Solution

  • As of the latest updates, Azure's OCR capabilities support a wide range of languages for printed text extraction. However, Telugu is not listed among the languages supported for printed text OCR. This means that while the service can process documents in many languages, it may not accurately extract text from documents written in Telugu.​