Feature request: ECS graceful shutdown/instance draining #54

itsdalmo · 2018-09-27T13:29:26Z

When running ECS clusters on AWS, a zero-downtime rolling update of the underlying VM's can only be done gracefully by using lifecycle hooks and calling UpdateContainerInstancesState to set the instance state to DRAINING, and then waiting for it to have zero running tasks before completing the lifecycle action.

This pattern is shown here using a Lambda:

I'm wondering if perhaps this is something that could be handled by lifecycled instead. As always there are some pros and cons for users by doing it this way instead of using a Lambda...

Pros

Single way of handling scale-in for regular instances and ECS clusters.
Draining can take longer than 5min (max runtime for Lambda) without needing recursive invocations of the Lambda (or step functions).

Cons

The instance would need additional permissions, which would give any running tasks the same permissions unless users were careful to disallow it.
Instances would fail to launch if lifecycled was installed from the Github releases and GH was down. So it makes the ECS instances more "brittle".

If this is something you think belongs in Lifecycled (and it seems like a good practice), I think we could add a new flag --ecs-cluster and implement a new handler (ECSHandler?) which would drain the instance before completing the lifecycle hook. We could probably hardcode it to run before the FileHandler (aka the handler script).

What do you think @lox?

The text was updated successfully, but these errors were encountered:

lox · 2018-09-27T23:09:11Z

Yup, I love that idea!

lox · 2018-09-27T23:10:16Z

The other thing that would be neat is hibernation support: https://github.com/aws/ec2-hibernate-linux-agent

itsdalmo mentioned this issue Sep 27, 2018

Feature request: Graceful shutdown of ECS workers telia-oss/terraform-aws-ecs#1

Open

itsdalmo mentioned this issue Dec 12, 2018

Seg fault when executing spot termination script #62

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: ECS graceful shutdown/instance draining #54

Feature request: ECS graceful shutdown/instance draining #54

itsdalmo commented Sep 27, 2018

lox commented Sep 27, 2018

lox commented Sep 27, 2018

Feature request: ECS graceful shutdown/instance draining #54

Feature request: ECS graceful shutdown/instance draining #54

Comments

itsdalmo commented Sep 27, 2018

Pros

Cons

lox commented Sep 27, 2018

lox commented Sep 27, 2018