Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[major] IceProd 3.0 #359

Merged
merged 162 commits into from
Oct 1, 2024
Merged

[major] IceProd 3.0 #359

merged 162 commits into from
Oct 1, 2024

Conversation

dsschult
Copy link
Collaborator

@dsschult dsschult commented Aug 2, 2023

This release removes support for IceProd pilots, and instead relies on HTCondor file transfer and running the module code directly in the HTCondor job. As a consequence, we are able to fully support running in containers via HTCondor.

Breaking changes

  • Data transfer now happens at the task level. Other file transfer is deprecated, and will be merged with the task.
  • Dataset config removals:
    • Module running_class support has been removed. Use the src attribute instead.
    • Data compression support has been removed.
    • Resources have been removed - use data in permanent input mode.
    • Steering/system has been removed.
  • The debugging pilot has limited support, and all data transfer has been removed.
  • The dataset truncated status has been moved to an attribute.
  • Tasks start at idle instead of waiting.
  • Task status waiting is now "ready to queue" and status queued is "on the HTCondor queue" with processing actually being processing in HTCondor.
  • The task reset status has been removed, and tasks will now directly go to idle.

Core Checklist

  • config as json with validation
  • render task as shell script of modules
  • get list of input/output files for a task
  • file execute parameter = maybe => transfer=???

Server Checklist

  • move queued and processing task actions "up" one level to fix queued and processing states
  • remove truncated dataset state in favor of a separate flag
  • add proper state machine for tasks, jobs, datasets
  • update website with new states
  • update rest API with new states
  • update scheduled tasks with new states
  • modify condor submit to use new task shell script as the executable, and condor file transfer as much as possible
  • make condor file transfer plugin for gridftp/http?
  • update condor plugin to get task resources from condor log

Tests

  • new core
  • REST routes
  • website handlers
  • server / batch job submission

@dsschult dsschult self-assigned this Aug 2, 2023
@dsschult dsschult linked an issue Sep 21, 2023 that may be closed by this pull request
@dsschult dsschult marked this pull request as ready for review October 1, 2024 17:37
@dsschult dsschult merged commit 3c3681e into master Oct 1, 2024
12 checks passed
@dsschult dsschult deleted the singularity branch October 1, 2024 17:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

fix deprecations
1 participant