Data USA ACS Bamboo ETL

This repository will have a series of Bamboo pipelines to process and ingest the data from the American Community Survey used in Data USA.

Note: the api for the year 2014, estimate 1 and geographic division for nation does not work correctly so there is a folder with these files downloaded manually.

Local Setup

Create a new virtual environment and activate it:

python -m venv venv
source venv/bin/activate

Install requirements

pip install -r requirements.txt

Create an environment variables file following this structure:

export API_KEY='<Your API KEY goes here>';
export DATAUSA_DB_PW='monetdb'; # The default password for MonetDB is monetdb.
export DATAUSA_DB_HOST='localhost'; # Assuming you're ingesting inside a local MonetDB container.
export PYTHONPATH=$PYTHONPATH:<path to repository>/datausa-acs-bamboo-etl;

Run a pipeline, for example, the Gini pipeline:

source .env
cd acs/acs_yg_gini
python acs_yg_gini_pipeline.py

Dockerized Setup

Create Python 3.7.9 container:

docker run -it -v <path to repository>/datausa-acs-bamboo-etl:/datausa-acs-bamboo-etl --name=python3-local python:3.7.9 bash

Use previously created .env file and run a pipeline, say the Gini pipeline:

cd datausa-acs-bamboo-etl
pip install -r requirements.txt
source .env
cd acs/acs_yg_gini
python acs/acs_yg_gini_pipeline.py

Name		Name	Last commit message	Last commit date
Latest commit History 180 Commits
acs		acs
analysis		analysis
runners		runners
.gitignore		.gitignore
README.md		README.md
conns.yaml		conns.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data USA ACS Bamboo ETL

Local Setup

Dockerized Setup

About

Releases

Packages

Contributors 4

Languages

DataUSA/datausa-acs-bamboo-etl

Folders and files

Latest commit

History

Repository files navigation

Data USA ACS Bamboo ETL

Local Setup

Dockerized Setup

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages