Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there any early proposal or document about integrating with Gateway API ? #165

Open
caozhuozi opened this issue Sep 15, 2024 · 2 comments
Labels
feature Categorizes issue or PR as related to a new feature. needs-priority Indicates a PR lacks a label and requires one. needs-triage Indicates an issue or PR lacks a label and requires one.

Comments

@caozhuozi
Copy link

caozhuozi commented Sep 15, 2024

I came across the roadmap and am particularly interested in the Gateway API section. Will Llamz support advanced traffic management features, such as shadow and canary deployments between different model services? If so, could you share how you plan to implement this?

Thanks in advance!

@InftyAI-Agent InftyAI-Agent added needs-triage Indicates an issue or PR lacks a label and requires one. needs-kind Indicates a PR lacks a label and requires one. needs-priority Indicates a PR lacks a label and requires one. labels Sep 15, 2024
@kerthcet
Copy link
Member

shadow and canary deployments between different model services

Thanks for you concern. The TL;DR is Yes, but no idea yet

I think it's a vital feature for production. Gateway API here means a bunch of things, like token/lora/model related service, canary deployments can also be part of them (maybe later we'll sort them clearly). And what llmaz usually does is we'll have a minimal implementation for out-of-box support, but we'll also provide project integrations considering people usually have lots of projects in their cluster, we don't want to increase the maintenance burden for them. Regarding to canary deployments, there maybe argo workflow, istio, so they're all in plan I think.

About the minimal implementation, I haven't thought too much about that, and we have a bunch of higher priority tasks on hand.

/kind feature

@InftyAI-Agent InftyAI-Agent added feature Categorizes issue or PR as related to a new feature. and removed needs-kind Indicates a PR lacks a label and requires one. labels Sep 18, 2024
@caozhuozi
Copy link
Author

Hi @kerthcet! Really thanks for your great pacience and detailed replay! ❤

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature Categorizes issue or PR as related to a new feature. needs-priority Indicates a PR lacks a label and requires one. needs-triage Indicates an issue or PR lacks a label and requires one.
Projects
None yet
Development

No branches or pull requests

3 participants