cubems-data-pipeline

Data Pipeline for Data Analysis built separately from CUBEMS environment

Running Dataflow Pipeline

python3 -m venv venv

pip3 install -r requirements.txt

export GOOGLE_APPLICATION_CREDENTIALS=/path/to/key.json

OR create .env with the following content

GOOGLE_APPLICATION_CREDENTIALS=/path/to/key.json

Run pipeline. Check direct-runner.py for local pipeline template and dataflow-runner.py for dataflow pipeline template

python3 pipeline-file.py

Deploying Dataflow Template. The filename must be the same as template name and the file must follow the guideline https://cloud.google.com/dataflow/docs/guides/templates/creating-templates

python3 deploy.py --template [template name]

yarn

yarn deploy

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.vscode/.ropeproject		.vscode/.ropeproject
colab		colab
dataflow		dataflow
functions		functions
scripts		scripts
LICENSE		LICENSE
README.md		README.md