Skip to content

Releases: kubernetes-sigs/lws

LeaderWorkerSet Patch release 0.4.1

18 Oct 20:23
v0.4.1
Compare
Choose a tag to compare

This is a patch release for v0.4.0

Leaderworkerset v0.4.0

14 Sep 00:09
v0.4.0
edc9eac
Compare
Choose a tag to compare

Features:

  • Support specify NetworkConfig
  • Support leader elect for lws controller
  • Add group size as an environment variable
  • Add an example for using llama.cpp to deploy a distributed inference service
  • Update the GPU multi-node inference with vLLM example to serve Llama3.1-405b model

What's Changed

New Contributors

Full Changelog: v0.3.0...v0.4.0

Leaderworkerset v0.3.0

04 Jun 20:42
v0.3.0
f55ce01
Compare
Choose a tag to compare

Features:

  • RollingUpdate with MaxSurge support
  • Subgroup support for disaggregated serving
  • Example for multi-node serving of llama 70B on GPUs with vLLM
  • Add a new start policy API
  • Inject leader address environment variable to every container
  • Spec.rolloutStrategy should be a non-required field

Acknowledgments

Thanks to our contributors in this release, in alphabetic order:
@ahg-g @Edwinhr716 @googs1025 @gujingit @jjk-g @kerthcet @liurupeng @nayihz

Leaderworkerset v0.2.0

19 Apr 18:47
78268be
Compare
Choose a tag to compare

Features:

  • Support RollingUpdate with MaxUnavailable
  • Allow Prometheus to gather metrics gathered by controller-runtime
  • Fix TPU env var assignment when leader pod doesn't request TPU
  • User guide to deploy multi-host inference with Saxml
  • Increase qps limit for pod scheduling
  • Setup E2E test and improve test coverage

Acknowledgments

Thanks to our contributors in this release, in alphabetic order:
@ahg-g @Bslabe123 @Edwinhr716 @googs1025 @kannon92 @kerthcet @liurupeng @nayihz @Zeel-Patel

Leaderworkerset v0.1.0

13 Mar 02:51
v0.1.0
652405d
Compare
Choose a tag to compare

Features:

  • Support creating groups of pods as a unit
  • Support dual-template, one for leader and one for the workers
  • Support autoscaling through HPA
  • Support topology-aware placement
  • Support all-or-nothing restart for failure handling

Acknowledgments

Thanks to our contributors in this release, in no particular order:
@liurupeng @Edwinhr716 @kerthcet @ahg-g