From ba5104e527c8fedc74408be10d1326d953f79970 Mon Sep 17 00:00:00 2001 From: Alex Richey Date: Wed, 30 Oct 2024 16:47:08 -0400 Subject: [PATCH 1/3] Update README.md --- README.md | 10 +++------- 1 file changed, 3 insertions(+), 7 deletions(-) diff --git a/README.md b/README.md index 3333280..55ac36e 100644 --- a/README.md +++ b/README.md @@ -4,7 +4,7 @@ Welcome to the Data Engineering code interview! This small data challenge is designed to test out your skills in python, sql, git, and geospatial data processing. The challenge will go from easy to difficult, there's no preassure to finish all the tasks, so try your best and get as far as you can! -To start this challenge, create a new **private** repo under your github username. We would like you to include all the code, notes, visualizations, and data inside of the repo. You will have **48 hours** to complete this data challenge. Once you are done, please provide read access to your repo by inviting `@mbh329`, `@td928` and `@AmandaDoyle` +To start this challenge, create a new **private** repo under your github username. We would like you to include all the code, notes, visualizations, and data inside of the repo. You will have **48 hours** to complete this data challenge. Once you are done, please provide read access to your repo by inviting `@alexrichey`, `@sf-dcp` and `@AmandaDoyle` > ⚠️ Note: **the repo has to be ``private``, otherwise you will be automatically ``disqualified``**. Also we will check your commit timestamp to only account for the first 48 hours of coding activities. @@ -27,7 +27,6 @@ Your code interview will be evaluated based on your repo, so make sure all files - **Project Management**: - We want to see how you manage a multi-part project and how you break down the tasks. - Feel free to open up issues for yourself / make pull requests and etc so that your code progress is captured and documented. - - We highly **discourage** lumpped commits. ## Table of Content @@ -86,14 +85,11 @@ A lot of popular databases have geospatial extensions, which makes spatial data - Perform the same aggregation in **Task 4** and store the result in a table. - **Bonus**: export the table with NTA geometry and complaint count into a shapefile under the `data` folder. -> Note: At this point you might notice that spatial software is not as straight forward as a simple `pip install`. If you are stuck with database installation or pacakge installation, you might consider adopting **[docker](https://www.docker.com/)**. Docker has a steep learning curve, so don't waste too much time on it. +> Note: At this point you might notice that spatial software is not as straight forward as a simple `pip install`. If you are stuck with database installation or pacakge installation, you might consider adopting **[docker](https://www.docker.com/)**. ## Resources -- Reach out to Te (TDu @ planning.nyc.gov) if you have any questions. We love people who ask questions. +- Reach out to Alex (arichey@planning.nyc.gov) if you have any questions. We love people who ask questions. - [PostgreSQL Installation Guide](https://www.postgresql.org/download/) - [Postgis Docker image](https://registry.hub.docker.com/r/postgis/postgis/) - [Postgis Installation Guide](https://postgis.net/workshops/postgis-intro/installation.html) -- [DigitalOcean Managed Database](https://www.digitalocean.com/products/managed-databases/) - -> DigitalOcean is great if you have a lot of trouble with installation, and it offers 100$ of free credit From dd7e2327e436db587e562332f51a6611f864ae6c Mon Sep 17 00:00:00 2001 From: Alex Richey Date: Thu, 31 Oct 2024 14:18:51 -0400 Subject: [PATCH 2/3] Update README.md --- README.md | 6 ++---- 1 file changed, 2 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index 55ac36e..e9725cd 100644 --- a/README.md +++ b/README.md @@ -6,7 +6,7 @@ Welcome to the Data Engineering code interview! This small data challenge is des To start this challenge, create a new **private** repo under your github username. We would like you to include all the code, notes, visualizations, and data inside of the repo. You will have **48 hours** to complete this data challenge. Once you are done, please provide read access to your repo by inviting `@alexrichey`, `@sf-dcp` and `@AmandaDoyle` -> ⚠️ Note: **the repo has to be ``private``, otherwise you will be automatically ``disqualified``**. Also we will check your commit timestamp to only account for the first 48 hours of coding activities. +> ⚠️ Note: **the repo has to be private, otherwise you will be automatically disqualified**. ## What we are looking for @@ -85,11 +85,9 @@ A lot of popular databases have geospatial extensions, which makes spatial data - Perform the same aggregation in **Task 4** and store the result in a table. - **Bonus**: export the table with NTA geometry and complaint count into a shapefile under the `data` folder. -> Note: At this point you might notice that spatial software is not as straight forward as a simple `pip install`. If you are stuck with database installation or pacakge installation, you might consider adopting **[docker](https://www.docker.com/)**. - ## Resources -- Reach out to Alex (arichey@planning.nyc.gov) if you have any questions. We love people who ask questions. +- Reach out to Alex (arichey@planning.nyc.gov) if you have any questions. - [PostgreSQL Installation Guide](https://www.postgresql.org/download/) - [Postgis Docker image](https://registry.hub.docker.com/r/postgis/postgis/) - [Postgis Installation Guide](https://postgis.net/workshops/postgis-intro/installation.html) From 30cab1978381a2164504aac01d277dcec9cf3059 Mon Sep 17 00:00:00 2001 From: Alex Richey Date: Thu, 31 Oct 2024 14:20:02 -0400 Subject: [PATCH 3/3] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index e9725cd..f750684 100644 --- a/README.md +++ b/README.md @@ -14,7 +14,7 @@ Your code interview will be evaluated based on your repo, so make sure all files - **Project scafolding**: How you name, manage, and organize your files. - **Reproducibility**: - - Ideally if it runs on your machine, it would also run on mine. + - Ideally if it runs on your machine, it would also run on mine. We would recommend that you use Docker. - Make sure you document any software dependency, and installation process. - **Code**: - Clean