You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Our main focus would be to make it descriptive enough to not cause any major outages after the migration. Also, add some additional scripts to make it more automated i.e. deployments based on namespaces, automatic configuration of fluetnd, e.t.c.
I am adding all the points that we need to focus on/include in the documentation, so that we don't miss anything.
Add more endpoint checks for the services. Some new ones that I have discovered are for das-server, dbs, and rucio monitor.
Include about nginx settings in the rolling upgrade document.
Create a separate directory for storing secrets for individual cluster. The .pem files can be encrypted (the procedure that was already being followed for DBS cluster).
Improve the procedure for stress testing the cluster.
Remove IT services that are not being used. One particular example is the fluentd service that was causing major issues with the nodes.
I will add more points to this.
The text was updated successfully, but these errors were encountered:
Our main focus would be to make it descriptive enough to not cause any major outages after the migration. Also, add some additional scripts to make it more automated i.e. deployments based on namespaces, automatic configuration of fluetnd, e.t.c.
I am adding all the points that we need to focus on/include in the documentation, so that we don't miss anything.
I will add more points to this.
The text was updated successfully, but these errors were encountered: