Dataset cloud function preprocessing should handle improper datasets. #70

harrykeightley · 2022-10-07T00:55:23Z

Currently the preprocessing workflow blindly attempts to create processing jobs from the files supplied in a dataset.
This would fail if the user selected improper files, not enough files etc.

Instead, after forming a dataset object from the incoming event, the process_dataset function should check dataset validity before batching and pushing jobs to the pubsub queue.

If the dataset is found to be invalid, we should indicate so on the dataset model in firestore (setting status to error or something).

The text was updated successfully, but these errors were encountered:

harrykeightley assigned harrykeightley and unassigned harrykeightley Oct 7, 2022

harrykeightley added the bug Something isn't working label Oct 7, 2022

harrykeightley added the good first issue Good for newcomers label Oct 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset cloud function preprocessing should handle improper datasets. #70

Dataset cloud function preprocessing should handle improper datasets. #70

harrykeightley commented Oct 7, 2022 •

edited

Loading

Dataset cloud function preprocessing should handle improper datasets. #70

Dataset cloud function preprocessing should handle improper datasets. #70

Comments

harrykeightley commented Oct 7, 2022 • edited Loading

harrykeightley commented Oct 7, 2022 •

edited

Loading