-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support C5d and M5d instance types #40
Comments
Probably a good strategy is to let |
Would that mean all execs in a run need to fit under the instance storage max for any of them to use instance storage? (Since there's a single disk space configuration per cluster, if I understand correctly.) It might be nice to let some execs use instance storage while others spill over to EBS. Maybe software RAID could help do this spillover seamlessly? (If it is aware of SSD vs. HDD differences, maybe it can handle this too?) Those sound more complicated, though, so something simple would be great to start with. |
I think that's too complicated. The simplest thing would be to do just one or the other. If you require more storage than is available on instance storage, tough luck... |
Having looked through the instance types with storage, I doubt that there's much value in pursuing this. Many of these instance types with storage have much less CPU/Ram compared to others at similar or cheaper cost. So we are probably better off using a cheaper but beefier or equivalent instance type with attached EBS volumes. We are considering supporting dynamic resizing of EBS volumes in a reflowlet instance Its probably worth exploring for bigmachine, particularly for say machine learning type use-cases where we do repeated IO over the same data. |
@swami-m, which instance types are you looking at? M5, C5, and R5 all have M5d, C5d, and R5d. The price differential doesn't seem huge, for example $0.096/hr for M5 vs. $0.113/hr for M5d, which have the same CPU and memory. The ~20% cost increase for local disk could be worthwhile for some workloads, right? I totally understand if this is low priority / not worth the complexity, though. |
Yeah, but an M5d.large comes with 75GB of local storage at 20% higher cost compared to a M5.large. That's just not enough instance storage for it to be worthwhile. Since reflow's instance-type-choosing logic is primarily driven by price (for the CPU/mem requirements), and since instance types with storage are always more expensive, we are not likely to choose these types (unless the cheaper ones are not available) And purely cost-wise, EBS volumes are cheaper than instance storage (in the above example 75GB of EBS costs $0.01/hr). All that being said, I agree that it could be worthwhile in cases where the user constrains reflow to use certain instance types, particularly those with instance storage. (In that case, since the user is already paying for it, might as well use it and not pay more for EBS) |
The new local storage instance types seem like good fits for reflow workers.
The text was updated successfully, but these errors were encountered: