Skip to content

Latest commit

 

History

History
35 lines (21 loc) · 1.25 KB

import-datasets-into-pandas.md

File metadata and controls

35 lines (21 loc) · 1.25 KB
description
Pandas gives you a nice way to view, filter and convert UDT datasets.

Import Datasets into Pandas

Exporting UDT Dataset as CSV

You can export any UDT dataset into a CSV file using the download button at the top of the page.

Download CSV from the Universal Data Tool

Import CSV Into Pandas Dataframe

We can begin by importing the pandas, and our udt.csv file.

import pandas as pd

url_or_filepath_to_csv = "https://raw.githubusercontent.com/UniversalDataTool/udt-dataset-cats-and-dogs/master/coco_dogs_and_cats.udt.csv"
udt_csv = pd.read_csv(url_or_filepath_to_csv)

{% hint style="info" %} You can use the udt.json format too, tables are just a nice way to visualize the data! {% endhint %}

If you view the udt_csv object, you should now see a breakdown of your CSV, ready to be imported!

coco_dogs_and_cats.udt.csv

Downloading Images

UDT Datasets just have links to images, so we'll need to download the actual images. Check out the fast.ai Image classification tutorial, where we show how to easily download images using the fast.ai download_images function.