Last Updated on April 30, 2023 by mishou

I. Where is IMDb Datasets?

IMDb Datasets are here:

II. Download a …tsv file and extract it

Run the command on Google Colaboratory when you want to download the name.basics.tsv.gz file.

!wget !wget

Run the command to extract GZ file:

!gunzip name.basics.tsv.gz !gunzip title.principals.tsv.gz

III. Load the data with Pandas

import pandas as pd df_name = pd.read_csv('/content/name.basics.tsv', sep = '\t') df_principals = pd.read_csv('/content/title.principals.tsv', sep = '\t')

IV. Calculate ages

You can see all the scripts here on Google Colaboratory: be continued.

By mishou

Leave a Reply

Your email address will not be published. Required fields are marked *