Last Updated on April 30, 2023 by mishou
I. Where is IMDb Datasets?
IMDb Datasets are here:https://www.imdb.com/interfaces/https://datasets.imdbws.com/
II. Download a …tsv file and extract it
Run the command on Google Colaboratory when you want to download the name.basics.tsv.gz file.
!wget https://datasets.imdbws.com/name.basics.tsv.gz !wget https://datasets.imdbws.com/title.principals.tsv.gz
Run the command to extract GZ file:
!gunzip name.basics.tsv.gz !gunzip title.principals.tsv.gz
III. Load the data with Pandas
import pandas as pd df_name = pd.read_csv('/content/name.basics.tsv', sep = '\t') df_principals = pd.read_csv('/content/title.principals.tsv', sep = '\t')
IV. Calculate ages
You can see all the scripts here on Google Colaboratory: https://colab.research.google.com/drive/1MMWbMtZFHJmeN93t14UZiEYWrDctmIeO?usp=sharingTo be continued.