pythonmediawikiwikipediapywikibot

Content of infobox of Wikipedia


I need to get the content of an infobox of any movie. I know the name of the movie. One way is to get the complete content of a Wikipedia page and then parse it until I find {{Infobox and then get the content of the infobox.

Is there any other way for the same using some API or parser?

I am using Python and the pywikipediabot API.

I am also familiar with the wikitools API. So instead of pywikipedia if someone has solution related to the wikitools API, please mention that as well.


Solution

  • Instead of reinventing the wheel, check out DBPedia, which has already extracted all Wikipedia infoboxes into an easily parsable database format.