unicodeencodingcharacter-encodingcharactercyrillic

Converting "болно" to Cyrillic


I have problem in my database where some of the Cyrillic text is seen like this "болно Ð±Ð°Ñ Ð°Ð¼ÑŒÐ´Ñ€ÑƒÑƒÐ»Ð¶ ч Ð". Is there a way to convert this to back to human readable format.

I need to read actual context of this.


Solution

  • Best I could do from your data...it looks Cyrillic but Google Translate didn't make anything of it. It seems it was decoded under the default US Windows codec but was really UTF-8, but the data is not quite right. I'm using Python to attempt to fix it:

    >>> s.encode('cp1252').decode('utf8',errors='replace')
    'болно ба� амьдруулж ч �'