mysqlimdbimdbpy

Description of the SQL movie databse made by IMDBpy


Recently, I am using IMDBpy API to scrape the IMDB dataset. In this API, there is a imdbpy2sql.py which could convert IMDB movie dataset to a SQL database. But I can not find any description of this dataset. So I can not understand the schema of this SQL database. There are too many tables in this database. Is there any way to know that?

I strictly follow this website to build my database http://blog.secaserver.com/2013/08/importing-imdb-sample-data-set-mysql/.

Thanks so much!!


Solution

  • I doubt that there are too many tables. There are a lot properties/relationships available.

    I generated this image once while creating pyIRDG. You can have a look at that code too for documentation on the available data. Here is the output of the comments: http://pastebin.com/zGnZ02w4

    I've also used MySQL Workbench to generate a schema from the db.

    There is also this German blog article with an ERM image.

    not that I'm aware of, and for sure our db is not in any NF. :-) Anyway, you can easily look at the scheme in the imdb/parser/sql/dbschema.py module or using some tool directly on the database. Source.