Week 10: Optimizing the BLOOM inference API (invited talk) Lecture: slides Further reading https://github.com/huggingface/text-generation-inference