So what I am trying to do is convert a header request's content-type
into a file extension. The typical content-type
is like this for HTML pages text/html; charset=utf-8
that is the given response from Python. I have looked into using the mimetype module with no success as it doesn't look like it accommodates what I am looking for.
Rundown:
I want to convert text/html; charset=utf-8
into string .html
.
The typical image content-type is image/jpeg
depending on the image type, but I am not too worried about images, given that most URLs specify the image in the path. This is more for websites that don't end in "blahahah.html"
I do not want to use any libraries that are not in the base Python library.
You could split and strip:
r = requests.get("http://stackoverflow.com/questions/29674905/convert-content-type-header-into-file-extension")
from mimetypes import guess_extension
print(guess_extension(r.headers['content-type'].partition(';')[0].strip()))
.htm