IMDb
The IMDb database: moderately large, real database of movies.
Original source: kt.ijs.si
Versions
Imdb_ijs (by Janez Kranjc)
- Added foreign key constrains by removal of violating samples. Note that accented chars are missing in "movies"."name" column. Unfortunatelly, accented chars are already missing in the original sql dump.
IMDB_1R (by Oliver Schulte)
Imdb_small (by Jan Motl)
Dataset details
- Associated task:
- Classification
- Domain:
- Entertainment
- Data types:
- Size:
- 477.1 MB
- Count of tables:
- 7
- Count of rows:
- 5,694,919
- Count of columns:
- 21
- Missing values:
- Yes
- Compound keys:
- No
- Loops:
- No
- Type:
- Real
- Instance count:
- 738,576
- Target table:
- actors
- Target column:
- gender
- Target ID:
- ?
- Target timestamp:
- ?
Algorithms
How to download the dataset
The datasets are publicly available directly from MariaDB database.
- Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
- Use following credentials:
- hostname: relational.fel.cvut.cz
- port: 3306
- username: guest
- password: ctu-relational
- Export "imdb_ijs" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).