Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: trigger release process #858

Open
wants to merge 156 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
156 commits
Select commit Hold shift + click to select a range
ea80fbf
feat: expose summary statistics qc and locus breaker steps to hydra c…
project-defiant Jul 25, 2024
a5c96b9
fix: change config params to match new name (#721)
project-defiant Jul 29, 2024
897ae12
chore: pre-commit autoupdate (#715)
pre-commit-ci[bot] Jul 30, 2024
397f1e9
build(deps-dev): bump pymdown-extensions from 10.8.1 to 10.9 (#720)
dependabot[bot] Jul 31, 2024
0b9af70
build(deps-dev): bump deptry from 0.17.0 to 0.18.0 (#723)
dependabot[bot] Aug 8, 2024
ae734a8
feat: notebook for locus breaker and susie finemapping benchmark (#717)
Daniel-Considine Aug 8, 2024
5b5b408
fix: updating config paths and fine-mapping methods (#725)
Daniel-Considine Aug 8, 2024
e45f295
docs: macos fix for some functions (#729)
Daniel-Considine Aug 15, 2024
f49a5c5
build(deps-dev): bump ruff from 0.5.1 to 0.6.1 (#732)
dependabot[bot] Aug 22, 2024
4169b81
chore: pre-commit autoupdate (#724)
pre-commit-ci[bot] Aug 22, 2024
02b006d
build(deps-dev): bump deptry from 0.18.0 to 0.19.1 (#728)
dependabot[bot] Aug 22, 2024
5984de9
build(deps-dev): bump lxml from 5.2.2 to 5.3.0 (#727)
dependabot[bot] Aug 23, 2024
4db64a3
build(deps-dev): bump pre-commit from 3.7.1 to 3.8.0 (#719)
dependabot[bot] Aug 23, 2024
dc8c9dc
feat(variant_index): changes for a successful run (#735)
ireneisdoomed Aug 27, 2024
348b6f0
fix: update cluster creation command (#739)
project-defiant Aug 28, 2024
b6746fb
fix: revert recursiveFileLookup to False (#738)
project-defiant Aug 28, 2024
cf1ec24
feat: Finngen r11 ingestion (#733)
project-defiant Aug 29, 2024
2612aa7
fix: using h4 instead of log2(h4/h3) (#740)
Daniel-Considine Aug 30, 2024
93a6e60
fix: adding carma_tau parameter to susie_finemapper (#743)
addramir Sep 3, 2024
bb8558c
feat: logic and airflow pipeline for validation (#730)
DSuveges Sep 3, 2024
1a7b0d7
feat(variant index): improved data structure (#710)
DSuveges Sep 3, 2024
3ea47a9
fix: remove finngen prefix from credible set (#746)
project-defiant Sep 3, 2024
12ff35b
fix(finngen_r11): preserve all studyIds (#747)
project-defiant Sep 4, 2024
15a058e
feat: adding finemapping method to studylocusid hash (#744)
Daniel-Considine Sep 9, 2024
3c1e81a
build(deps-dev): bump ipython from 8.26.0 to 8.27.0 (#741)
dependabot[bot] Sep 9, 2024
151b4ec
build(deps-dev): bump deptry from 0.19.1 to 0.20.0 (#742)
dependabot[bot] Sep 10, 2024
0b216f6
fix: validation name mapping (#753)
project-defiant Sep 10, 2024
010c881
fix: removing old functions (#752)
addramir Sep 11, 2024
feb9cad
refactor: generalise per-chromosome processing (#754)
tskir Sep 11, 2024
d10cc20
refactor: generalise the harmonisation pipeline (#755)
tskir Sep 11, 2024
a49ae9a
feat: ingest FinnGen UKB meta-analysis data (#756)
tskir Sep 11, 2024
6469bf5
feat(validation): adding credible set variant validation (#757)
DSuveges Sep 12, 2024
d3435bb
feat: drop airflow orchestration layer from gentropy (#758)
project-defiant Sep 12, 2024
d6364f8
fix: multiple fixes after debugging and test runs (#760)
tskir Sep 13, 2024
34aaaf5
build(deps-dev): bump mkdocstrings-python from 1.10.5 to 1.11.1 (#749)
dependabot[bot] Sep 13, 2024
2e0e326
build(deps): bump wandb from 0.17.2 to 0.18.0 (#763)
dependabot[bot] Sep 17, 2024
6ede736
fix: prevent multiple credible filters to override spark plan (#766)
d0choa Sep 17, 2024
8c4421a
feat: flag MHC credible sets based on lead (#767)
d0choa Sep 18, 2024
d84d43b
fix: add condition to eQTL study index and schema (#770)
vivienho Sep 18, 2024
c3651b7
fix(effect harmonisation): addressing beta harmonisation bug (#762)
DSuveges Sep 18, 2024
c292e84
perf: quickly build a Docker image for every branch (#773)
tskir Sep 19, 2024
785dad0
feat: flag all top-hits from GWAS catalog curation (#775)
d0choa Sep 20, 2024
018defa
docs: fix broken refs (#768)
d0choa Sep 20, 2024
ad3f503
feat: flag PICS top hits in studies with credset sumstats (#777)
d0choa Sep 20, 2024
58fb726
fix(ld clumping): a revised logic allows a more accurate clumping (#772)
DSuveges Sep 23, 2024
b93842a
refactor(L2GFeatureMatrix)!: streamline feature matrix management (#745)
ireneisdoomed Sep 23, 2024
a29222e
feat(dataproc): ability to version gentropy for dataproc cluster (#774)
project-defiant Sep 23, 2024
dcacaf7
feat: add `studyType` to `StudyLocus` and `Colocalisation` (and `Stud…
vivienho Sep 24, 2024
df45a6c
feat: adding window based clumping to StudyLocus (#779)
DSuveges Sep 24, 2024
148e26e
fix: small qc flag fixes (#784)
addramir Sep 24, 2024
ccdb1f2
feat: add biosample index (#769)
Tobi1kenobi Sep 24, 2024
84d6638
feat: 99% credible set validation during `study_locus_validation` (#765)
d0choa Sep 24, 2024
2199ece
feat: flag credible sets explained by SuSiE regions (#780)
d0choa Sep 24, 2024
2010fb6
fix: remove n_eff check from qc_step (#785)
addramir Sep 24, 2024
d2a68d9
build(deps-dev): bump pymdown-extensions from 10.9 to 10.10.1 (#781)
dependabot[bot] Sep 24, 2024
95be9f6
build: updated precommits including adjustments to docstrings (#787)
d0choa Sep 24, 2024
6c4bdf5
fix(finngen_study_index): improved tests for finngen study index (#776)
project-defiant Sep 24, 2024
b525117
fix: clean unused study_locus step parameter (#786)
d0choa Sep 25, 2024
51125c7
fix(vep_parser): use nested schema for insilico predictors (#789)
project-defiant Sep 26, 2024
9f83329
fix: remove study_index_path from coloc step (#791)
project-defiant Sep 26, 2024
a135d26
fix(safe_array_union): allow for sorting nested structs (#793)
project-defiant Sep 27, 2024
26483c9
fix: fix bag in neglog_pvalue_to_mantissa_and_exponent (#795)
addramir Sep 27, 2024
88f62d4
fix(schema): recursive validation of arbitrarily deep nested structur…
DSuveges Sep 30, 2024
8b253a5
fix: adding data specific p-value filters (#788)
addramir Sep 30, 2024
5c58e58
feat: change `StudyLocusId` hashing method to md5 (and change `StudyL…
vivienho Sep 30, 2024
b7ccfae
feat: drop `v2g` and reimplement distance features (#771)
ireneisdoomed Oct 1, 2024
c3b8c2c
feat: out sample LD qc reason (#798)
d0choa Oct 1, 2024
a5588ae
chore: drop redundant parameter (#802)
project-defiant Oct 1, 2024
d4b5070
fix: align the schema of study_index for ukb ppp eur (#803)
project-defiant Oct 1, 2024
ccb484e
feat: force reinstallation of the gentropy on the cluster (#804)
project-defiant Oct 1, 2024
1c396d2
feat(validation): adding credible set confidence annotation at valida…
DSuveges Oct 1, 2024
9177dd4
chore: make the lb clumping ingest the partitionned data (#806)
project-defiant Oct 2, 2024
174f8f9
feat: adding filtering to susie finemapper (#796)
addramir Oct 2, 2024
25a4820
feat: add sumstat QC fields to schema (#809)
addramir Oct 2, 2024
70fd593
feat: fix biosample study validation (#810)
Tobi1kenobi Oct 3, 2024
c286c3b
refactor(vep_parser): store consequence to impact score as a project …
ireneisdoomed Oct 3, 2024
8876fc1
feat(l2g): implement variant consequence features from VEP (#805)
ireneisdoomed Oct 3, 2024
fca55be
feat: optimisation of qc step (#813)
addramir Oct 3, 2024
68c0168
fix(validation): add `qualityControls` column if missing in StudyLocu…
project-defiant Oct 4, 2024
b325eaa
fix: fix of type error in schema checking (#817)
addramir Oct 4, 2024
99b1d3c
fix: adding studId to FM log (#816)
addramir Oct 4, 2024
8b29168
fix: biosample id duplication (#822)
Tobi1kenobi Oct 7, 2024
27d8206
feat: change LD annotation for PICS fine-mapping to use major ancestr…
vivienho Oct 8, 2024
f3490d3
build(deps-dev): bump ipython from 8.27.0 to 8.28.0 (#819)
dependabot[bot] Oct 8, 2024
60f6bfa
build(deps-dev): bump pre-commit from 3.8.0 to 4.0.0 (#820)
dependabot[bot] Oct 9, 2024
b7dce8f
feat: decouple feature generation from L2G training step (#823)
ireneisdoomed Oct 9, 2024
9f0111c
fix: mhc flag incorrect (#825)
d0choa Oct 9, 2024
31e217b
revert: no longer rescuing variants not in LD matrix when overlapping…
d0choa Oct 10, 2024
58333c0
fix: empty inSilicoPredictors object in GnomAD variant index (#807)
DSuveges Oct 10, 2024
c7c602a
fix: write mode added to validation steps (#826)
d0choa Oct 11, 2024
e3d32ba
feat(l2g): merge sQTL and tuQTL colocalisation features (#824)
ireneisdoomed Oct 11, 2024
fb6111d
fix: adding single point statistics to pics loci (#832)
DSuveges Oct 11, 2024
9f446e8
fix(find_overlap): missing right study type in output (#828)
DSuveges Oct 11, 2024
c4520aa
feat: enhance variant index partitioning (#834)
d0choa Oct 12, 2024
e77abf4
feat: adding new LD interface (#759)
addramir Oct 14, 2024
38d4cb5
ci: configure java v8 (#840)
ireneisdoomed Oct 14, 2024
d461e38
fix(trainer): drop `studyLocusId` from training sets (#837)
ireneisdoomed Oct 14, 2024
6817aad
fix(l2g): remove custom session params + other fixes (#841)
ireneisdoomed Oct 14, 2024
44f05c4
build(deps-dev): bump pymdown-extensions from 10.10.1 to 10.11.2 (#815)
dependabot[bot] Oct 14, 2024
7fa85b2
build(deps-dev): bump mkdocs-git-committers-plugin-2 from 2.3.0 to 2.…
dependabot[bot] Oct 14, 2024
c50f660
build(deps-dev): bump pyparsing from 3.1.2 to 3.2.0 (#836)
dependabot[bot] Oct 14, 2024
6da92ce
build(deps-dev): bump mkdocstrings-python from 1.11.1 to 1.12.1 (#842)
dependabot[bot] Oct 14, 2024
e34e0c9
chore: adding priors to coloc step (#830)
addramir Oct 15, 2024
0a6d57b
fix: fix ukbppp studindex (#839)
addramir Oct 15, 2024
84a7a0d
fix: l2g fixes (#844)
d0choa Oct 15, 2024
172cedf
chore: remove h4/h3 ratio (#829)
addramir Oct 15, 2024
97a8873
fix: updating the susie_finemapper init (#846)
Daniel-Considine Oct 16, 2024
6a059d0
feat(coloc): step refactoring (#845)
project-defiant Oct 17, 2024
9bbbc02
test: skip `fetch_coordinates_from_rsids` (#850)
ireneisdoomed Oct 17, 2024
5cbf5ed
fix(eqtl): deduplicating credible set loci (#849)
DSuveges Oct 17, 2024
3ef43a9
chore: adding logging even when no CS in locus (#848)
addramir Oct 17, 2024
d33f66a
feat(l2g): limit colocalisation neighbourhood to protein coding genes…
ireneisdoomed Oct 17, 2024
9c52397
fix: filter nan in CSs (#855)
addramir Oct 17, 2024
c68a144
revert(finngen): restore the studyId prefix in finngen cs and si (#856)
project-defiant Oct 17, 2024
d650a29
feat(susie_finemapper): allow for extraction of the log file from man…
project-defiant Oct 18, 2024
333facb
chore(coloc): changing the content of `numberColocalisingVariants` fi…
DSuveges Oct 18, 2024
f93a9d3
fix: susie credible sets with unknown confidence (#862)
d0choa Oct 18, 2024
40a582c
fix: adding beta for lead variant (#863)
addramir Oct 21, 2024
13c2040
fix: biosample index add efo cell types (#853)
Tobi1kenobi Oct 21, 2024
7b20d55
feat(l2g): extend colocalisation neighbourhood metrics to missing gen…
ireneisdoomed Oct 21, 2024
de4627f
fix: add scQTLs into coloc features (#833)
addramir Oct 22, 2024
782a458
chore: pre-commit autoupdate (#866)
pre-commit-ci[bot] Oct 22, 2024
df220e9
feat: gwas catalog top-hit + study step (#808)
d0choa Oct 22, 2024
a78bae3
build(deps-dev): bump mypy from 1.11.0 to 1.12.1 (#865)
dependabot[bot] Oct 22, 2024
34a4e61
feat: adding desision tree to fine-mapper (#860)
addramir Oct 22, 2024
c5adb75
build(deps-dev): bump ruff from 0.6.1 to 0.7.0 (#864)
dependabot[bot] Oct 22, 2024
6e6687b
fix: logging of finemamper (#870)
addramir Oct 23, 2024
52da70f
chore: add chromosome validation (#869)
addramir Oct 23, 2024
9fe77ca
feat: add gene count features to l2g (#852)
xyg123 Oct 24, 2024
e233dac
feat: change betas to posterior mean from susie for Finngen credible …
Daniel-Considine Oct 24, 2024
d4b91d6
feat: step to export disease/target evidence (#867)
DSuveges Oct 24, 2024
c252dcb
feat(variant_index): hash variants at the time of instance creation (…
ireneisdoomed Oct 24, 2024
b694d80
fix: fix in calculate_credible_set_log10bf (#868)
addramir Oct 24, 2024
4c1013e
feat(l2g_feature_matrix): add `credibleSetConfidence` to L2G (#875)
ireneisdoomed Oct 25, 2024
3e61996
feat(l2g): normalise distance features (#878)
ireneisdoomed Oct 25, 2024
ee96c11
feat: l2g feature to indicate if gene is protein-coding or not (#873)
xyg123 Oct 25, 2024
85be796
feat: making credset qc have an option to coalsce and deduplicate cre…
Daniel-Considine Oct 25, 2024
5d9d7cb
test(study_locus) credible_set_qc step integration test (#880)
project-defiant Oct 25, 2024
cbbf3c5
feat: flagging duplicated entries while keeping one of the duplicates…
DSuveges Oct 26, 2024
d12d65d
feat: flag and filter credible sets (#879)
Tobi1kenobi Oct 28, 2024
1596c4c
build(deps-dev): bump ipython from 8.28.0 to 8.29.0 (#883)
dependabot[bot] Oct 28, 2024
31d8716
build(deps-dev): bump mypy from 1.12.1 to 1.13.0 (#884)
dependabot[bot] Oct 28, 2024
759857e
feat(trainer): log model explanation with shap (#886)
ireneisdoomed Oct 31, 2024
7bb74a5
chore: pre-commit autoupdate (#885)
pre-commit-ci[bot] Oct 31, 2024
fa38ca6
fix(distance_features): correct mean distance equation and correct ro…
ireneisdoomed Oct 31, 2024
b812f67
feat: add step to generate association data (#888)
vivienho Nov 1, 2024
c599856
fix: revert distinct for associations input file (#871)
vivienho Nov 1, 2024
9295d58
feat: add effect size direction to coloc output (#854)
Tobi1kenobi Nov 1, 2024
9237d73
refactor(convert to vcf): allow multiple input sources (#891)
project-defiant Nov 4, 2024
e0304fc
feat: deconvolute studies upon ingestion of GWAS Catalog datasets (#…
DSuveges Nov 4, 2024
3639b23
fix(`credibleSetConfidence`): inner join between study locus and vari…
ireneisdoomed Nov 4, 2024
04b1e22
feat(feature_matrix): impute values for gene attribute cols (#895)
ireneisdoomed Nov 5, 2024
94abc79
feat: adding l2g features to prediction table (#899)
DSuveges Nov 5, 2024
4d8e7c4
fix: ensure the #CHROM is not quoted (#896)
project-defiant Nov 5, 2024
2af1074
feat(feature_matrix): extract features for gwas associations only (#901)
ireneisdoomed Nov 5, 2024
6ec0d45
fix: do not impute `isProteinCoding` (#902)
addramir Nov 5, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
32 changes: 30 additions & 2 deletions .github/workflows/artifact.yml
Original file line number Diff line number Diff line change
Expand Up @@ -2,14 +2,15 @@ name: Build and Push to Artifact Registry

"on":
push:
branches: ["dev"]
tags: ["v*"]
branches: ["*"]
tags: ["*"]

env:
PROJECT_ID: open-targets-genetics-dev
REGION: europe-west1
GAR_LOCATION: europe-west1-docker.pkg.dev/open-targets-genetics-dev
REPOSITORY: gentropy-app
PYTHON_VERSION_DEFAULT: "3.10.8"

jobs:
build-push-artifact:
Expand Down Expand Up @@ -39,7 +40,18 @@ jobs:
run: |-
gcloud auth configure-docker ${{ env.REGION }}-docker.pkg.dev --quiet

- name: Quick Docker build (gentropy only, AMD64 only, with layer cache)
uses: docker/build-push-action@v6
with:
platforms: linux/amd64
push: true
tags: "${{ env.GAR_LOCATION }}/${{ env.REPOSITORY }}/gentropy:${{ github.ref_name }}"
context: .
cache-from: type=gha
cache-to: type=gha,mode=max

- name: Build and push gentropy image
if: github.ref == 'refs/heads/dev' || startsWith(github.ref, 'refs/tags/v')
uses: docker/build-push-action@v6
with:
platforms: linux/amd64,linux/arm64
Expand All @@ -48,10 +60,26 @@ jobs:
context: .

- name: Build and push VEP image
if: github.ref == 'refs/heads/dev' || startsWith(github.ref, 'refs/tags/v')
uses: docker/build-push-action@v6
with:
platforms: linux/amd64
push: true
tags: "${{ env.GAR_LOCATION }}/${{ env.REPOSITORY }}/custom_ensembl_vep:${{ github.ref_name }}"
context: .
file: "src/vep/Dockerfile"

- name: Set up Python
uses: actions/setup-python@v4
with:
python-version: ${{ env.PYTHON_VERSION_DEFAULT }}
- name: Install and configure Poetry
uses: snok/install-poetry@v1
with:
virtualenvs-create: true
virtualenvs-in-project: true
installer-parallel: true

- name: Build and push spark cluster dependencies
run: |
make build
5 changes: 5 additions & 0 deletions .github/workflows/pr.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,11 @@ jobs:
uses: actions/setup-python@v5
with:
python-version: 3.10.8
- name: Set up Java
uses: actions/setup-java@v4
with:
java-version: "8"
distribution: "temurin"
- name: Install and configure Poetry
uses: snok/install-poetry@v1
with:
Expand Down
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -13,3 +13,4 @@ site/
.env
.coverage*
wandb/
hail*.log
10 changes: 5 additions & 5 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ ci:
skip: [poetry-lock]
repos:
- repo: https://github.com/astral-sh/ruff-pre-commit
rev: v0.5.2
rev: v0.7.1
hooks:
- id: ruff
args:
Expand All @@ -15,7 +15,7 @@ repos:
files: ^((gentropy|utils|tests)/.+)?[^/]+\.py$

- repo: https://github.com/pre-commit/pre-commit-hooks
rev: v4.6.0
rev: v5.0.0
hooks:
- id: trailing-whitespace
- id: end-of-file-fixer
Expand Down Expand Up @@ -58,14 +58,14 @@ repos:
exclude: "CHANGELOG.md"

- repo: https://github.com/alessandrojcm/commitlint-pre-commit-hook
rev: v9.16.0
rev: v9.18.0
hooks:
- id: commitlint
additional_dependencies: ["@commitlint/[email protected]"]
stages: [commit-msg]

- repo: https://github.com/pre-commit/mirrors-mypy
rev: "v1.10.1"
rev: "v1.13.0"
hooks:
- id: mypy
args:
Expand Down Expand Up @@ -98,7 +98,7 @@ repos:
- id: beautysh

- repo: https://github.com/jsh9/pydoclint
rev: 0.5.4
rev: 0.5.9
hooks:
- id: pydoclint

Expand Down
37 changes: 18 additions & 19 deletions Makefile
Original file line number Diff line number Diff line change
@@ -1,10 +1,10 @@
PROJECT_ID ?= open-targets-genetics-dev
REGION ?= europe-west1
APP_NAME ?= $$(cat pyproject.toml| grep -m 1 "name" | cut -d" " -f3 | sed 's/"//g')
VERSION_NO ?= $$(poetry version --short)
CLEAN_VERSION_NO := $(shell echo "$(VERSION_NO)" | tr -cd '[:alnum:]')
BUCKET_NAME=gs://genetics_etl_python_playground/initialisation/${VERSION_NO}/
BUCKET_COMPOSER_DAGS=gs://europe-west1-ot-workflows-fe147745-bucket/dags/
APP_NAME ?= $$(cat pyproject.toml | grep -m 1 "name" | cut -d" " -f3 | sed 's/"//g')
REF ?= $$(git rev-parse --abbrev-ref HEAD)
PACKAGE_VERSION ?= $$(poetry version --short)
CLEAN_PACKAGE_VERSION := $(shell echo "$(PACKAGE_VERSION)" | tr -cd '[:alnum:]')
BUCKET_NAME=gs://genetics_etl_python_playground/initialisation/${APP_NAME}/${REF}

.PHONY: $(shell sed -n -e '/^$$/ { n ; /^[^ .\#][^ ]*:/ { s/:.*$$// ; p ; } ; }' $(MAKEFILE_LIST))

Expand Down Expand Up @@ -38,34 +38,33 @@ build-documentation: ## Create local server with documentation
create-dev-cluster: build ## Spin up a simple dataproc cluster with all dependencies for development purposes
@echo "Creating Dataproc Dev Cluster"
@gcloud config set project ${PROJECT_ID}
@gcloud dataproc clusters create "ot-genetics-dev-${CLEAN_VERSION_NO}" \
@gcloud dataproc clusters create "ot-genetics-dev-${CLEAN_PACKAGE_VERSION}-$(USER)" \
--image-version 2.1 \
--region ${REGION} \
--master-machine-type n1-standard-16 \
--initialization-actions=gs://genetics_etl_python_playground/initialisation/${VERSION_NO}/install_dependencies_on_cluster.sh \
--metadata="PACKAGE=gs://genetics_etl_python_playground/initialisation/${VERSION_NO}/gentropy-${VERSION_NO}-py3-none-any.whl,CONFIGTAR=gs://genetics_etl_python_playground/initialisation/${VERSION_NO}/config.tar.gz" \
--primary-worker-type n1-standard-8 \
--initialization-actions=$(BUCKET_NAME)/install_dependencies_on_cluster.sh \
--metadata="PACKAGE=$(BUCKET_NAME)/${APP_NAME}-${PACKAGE_VERSION}-py3-none-any.whl" \
--secondary-worker-type spot \
--worker-machine-type n1-standard-4 \
--worker-boot-disk-size 500 \
--autoscaling_policy=f"projects/${PROJECT_ID}/regions/${REGION}/autoscalingPolicies/eqtl-preprocess", \
--autoscaling-policy="projects/${PROJECT_ID}/regions/${REGION}/autoscalingPolicies/otg-etl" \
--optional-components=JUPYTER \
--enable-component-gateway
--enable-component-gateway \
--max-idle=60m

make update-dev-cluster: build ## Reinstalls the package on the dev-cluster
@echo "Updating Dataproc Dev Cluster"
@gcloud config set project ${PROJECT_ID}
gcloud dataproc jobs submit pig --cluster="ot-genetics-dev-${CLEAN_VERSION_NO}" \
gcloud dataproc jobs submit pig --cluster="ot-genetics-dev-${CLEAN_PACKAGE_VERSION}" \
--region ${REGION} \
--jars=${BUCKET_NAME}/install_dependencies_on_cluster.sh \
-e='sh chmod 750 $${PWD}/install_dependencies_on_cluster.sh; sh $${PWD}/install_dependencies_on_cluster.sh'

build: clean ## Build Python package with dependencies
@gcloud config set project ${PROJECT_ID}
@echo "Packaging Code and Dependencies for ${APP_NAME}-${VERSION_NO}"
@echo "Packaging Code and Dependencies for ${APP_NAME}-${PACKAGE_VERSION}"
@poetry build
@tar -czf dist/config.tar.gz config/
@echo "Uploading to Dataproc"
@gsutil cp src/gentropy/cli.py ${BUCKET_NAME}
@gsutil cp ./dist/${APP_NAME}-${VERSION_NO}-py3-none-any.whl ${BUCKET_NAME}
@gsutil cp ./dist/config.tar.gz ${BUCKET_NAME}
@gsutil cp ./utils/install_dependencies_on_cluster.sh ${BUCKET_NAME}
@echo "Uploading to ${BUCKET_NAME}"
@gsutil cp src/${APP_NAME}/cli.py ${BUCKET_NAME}/
@gsutil cp ./dist/${APP_NAME}-${PACKAGE_VERSION}-py3-none-any.whl ${BUCKET_NAME}/
@gsutil cp ./utils/install_dependencies_on_cluster.sh ${BUCKET_NAME}/
3 changes: 0 additions & 3 deletions config/__init__.py

This file was deleted.

83 changes: 0 additions & 83 deletions config/datasets/ot_gcp.yaml

This file was deleted.

5 changes: 0 additions & 5 deletions config/ot_config.yaml

This file was deleted.

7 changes: 0 additions & 7 deletions config/step/ot_colocalisation_coloc.yaml

This file was deleted.

7 changes: 0 additions & 7 deletions config/step/ot_colocalisation_ecaviar.yaml

This file was deleted.

10 changes: 0 additions & 10 deletions config/step/ot_eqtl_catalogue.yaml

This file was deleted.

7 changes: 0 additions & 7 deletions config/step/ot_finngen_finemapping_ingestion.yaml

This file was deleted.

5 changes: 0 additions & 5 deletions config/step/ot_gene_index.yaml

This file was deleted.

12 changes: 0 additions & 12 deletions config/step/ot_gwas_catalog_ingestion.yaml

This file was deleted.

8 changes: 0 additions & 8 deletions config/step/ot_gwas_catalog_study_curation.yaml

This file was deleted.

12 changes: 0 additions & 12 deletions config/step/ot_gwas_catalog_study_inclusion.yaml

This file was deleted.

7 changes: 0 additions & 7 deletions config/step/ot_ld_based_clumping.yaml

This file was deleted.

20 changes: 0 additions & 20 deletions config/step/ot_ld_index.yaml

This file was deleted.

11 changes: 0 additions & 11 deletions config/step/ot_locus_to_gene_predict.yaml

This file was deleted.

Loading
Loading