Handle SIGTERM gracefully #48

zerebubuth · 2016-11-08T16:40:27Z

Upon receiving SIGTERM, tileserver should:

Start responding to health check requests with an error.
Wait a configurable grace period, or until all outstanding requests have finished.
Shut down.

This allows tileserver to work with ELB connection draining or HAProxy to terminate while not dropping any requests. If this is done along with staggering shutdowns / upgrades so that only part of the cluster is down at any one time, then no requests are lost.

rmarianski · 2016-11-08T16:53:45Z

This is a good idea. On the one hand we can avoid this by rolling in new instances, but on the other, it's much easier to just run a deploy command in opsworks.

It might be worth considering pushing the scope of this problem outside into an opsworks tools that rolls in the deploy for us. That way it's solved for any service in an opsworks layer. Or, maybe there's a way to not require to roll in deploys, but still handle this mostly outside the actual process. I wonder if we can unregister the instance from the elb, wait until it's unregistered, and then re-register it once it's restarted. I'm assuming the wait step here handles the connection draining for us, and that opsworks wouldn't fight us and try to re-register the instance because it's still in the layer in the interim.

zerebubuth · 2016-11-08T17:04:51Z

I think both mechanisms would be good to have.

Rolling the deploy requires outside tooling, which is great for anything which is compatible with that. But I wouldn't be confident that it covers 100% of all cases that the service could be stopped. Handling SIGTERM internally is then a safety net in those (hopefully rare) cases that tileserver is stopped outside of a rolling deploy.

rmarianski · 2016-11-08T17:07:03Z

From @iandees, http://docs.aws.amazon.com/opsworks/latest/userguide/best-deploy.html#best-deploy-rolling

rmarianski · 2016-11-08T17:28:29Z

But I wouldn't be confident that it covers 100% of all cases that the service could be stopped.

Just curious, what kind of cases would this be?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle SIGTERM gracefully #48

Handle SIGTERM gracefully #48

zerebubuth commented Nov 8, 2016

rmarianski commented Nov 8, 2016

zerebubuth commented Nov 8, 2016

rmarianski commented Nov 8, 2016

rmarianski commented Nov 8, 2016

Handle SIGTERM gracefully #48

Handle SIGTERM gracefully #48

Comments

zerebubuth commented Nov 8, 2016

rmarianski commented Nov 8, 2016

zerebubuth commented Nov 8, 2016

rmarianski commented Nov 8, 2016

rmarianski commented Nov 8, 2016