Last month, the DeepSpeed Team announced ZeRO-Infinity, a step forward in training models with tens of trillions of parameters. In addition to creating optimizations for scale, our team strives to introduce features that also improve speed, cost, and usability. As the DeepSpeed optimization library evolves, we are listening to the growing DeepSpeed community to learn […]

DeepSpeed - Microsoft Research

LLM（十二）：DeepSpeed Inference 在LLM 推理上的优化探究- 知乎

Introducing Audio Search by Length in Marketplace - Announcements

LLM（十二）：DeepSpeed Inference 在LLM 推理上的优化探究- 知乎

DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression - Microsoft Research

DeepSpeed: Advancing MoE inference and training to power next-generation AI scale - Microsoft Research

GitHub - microsoft/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

DeepSpeed Inference - Enabling Efficient Inference of Transformer Models at Unprecedented Scale, PDF, Graphics Processing Unit

Microsoft's DeepSpeed enables PyTorch to Train Models with 100-Billion-Parameter at mind-blowing speed, by Arun C Thomas, The Ultimate Engineer

ZeRO & DeepSpeed: New system optimizations enable training models with over 100 billion parameters - Microsoft Research

DeepSpeed: Extreme-scale model training for everyone - Microsoft Research