This repository contains open source data sets. They are intended to be used as part of an introduction to the Deephaven Community Core Engine. For more information, check out Deephaven Community Core.
The following folders can be found in this repository:
CreditCardFraud
- Anonymized credit card purchase data, including fraudulent purchases.CryptoCurrencyHistory
- Data from CoinGecko to highlight use of CSV and Parquet data formats.DeNiro
- Data on Robert De Niro's movies up to 2016.GSOD
- Global Surface Summary of the Day (GSOD) weather data.Insurance
- A dataset of insurance charges given some indicators of patient health.Iris
- The iris flower data set from Ronald Fisher's 1936 paper.MetricCentury
- Data recorded from a 100 km bike ride.NCAA
- Twitter data and sentiment for elite 8 men and women NCAA teams 7 days before March 30, 20212.NoodlePug
-Bones and no-bones data for Noodle the Pug through Oct 27, 2021.Pems
- Traffic flow data collected near Davis, CA.Taxi
- Yellow Taxi trip records.
Each folder in this repository has the following structure within:
README
- An explanation of everything about the datacsv
- A folder with all relevant data in either CSV or TSV format (if available)parquet
- A folder with all relevant data in parquet format (if available)- Files to run a Python and/or Groovy script will be included (if available)
The examples script image can be pulled via:
docker pull ghcr.io/deephaven/examples
To download the examples, from the root of your deephaven-core
clone, run:
docker run --rm -v "$(pwd)/data:/data" ghcr.io/deephaven/examples download
- The
docker run
command downloads the example data by running the examples management container. - The
-v "$(pwd)/data:/data"
argument mounts your local$(pwd)/data
path as/data
in the container. - The example data is stored to
/data/examples
inside the container, which is$(pwd)/data/examples
on the local filesystem.
You can run docker run
again to manage the example data - for example, to download a new version. To see what options are available, run:
docker run --rm -v "$(pwd)/data:/data" ghcr.io/deephaven/examples
From the root of examples
, run:
docker build -t ghcr.io/deephaven/examples docker
The docker build
command builds a Docker container containing a script that helps manage the examples.
That script will automate the management of the example files. It allows you to download and update the example files without directly working with the git project hosting the examples. This docker build
command is only needed once; after it runs, the container is available to use on your host.