add: ggml graph encoding format #66

danbev · 2024-03-04T05:43:00Z

This commit adds the ggml graph encoding format to the graph_encoding enum.

The motivation for this is to allow the wasi-nn interface to support models that are encoded in the ggml format which is the model format used by llama.cpp.

This commit adds the `ggml` graph encoding format to the `graph_encoding` enum. The motivation for this is to allow the `wasi-nn` interface to support models that are encoded in the `ggml` format which is the model format used by llama.cpp. Signed-off-by: Daniel Bevenius <[email protected]>

abrown · 2024-03-14T17:42:27Z

Thanks for the PR! I had been wondering when this new encoding would come up. Can you attend the ML working group on the 18th of this month (details)? It might be helpful for others interested in wasi-nn to understand the motivation here.

danbev · 2024-03-15T04:49:16Z

@abrown I'd be happy to attend if I can make the time slot 👍

- WebAssembly/wasi-nn#67 - WebAssembly/wasi-nn#66

devigned · 2024-03-21T16:33:44Z

@danbev, what wasi-nn backend would be used to load and run the ggml encoded models? I might misunderstand, but I don't think any of the backends currently implemented will be able to load these models. Are you suggesting a new backend implementation?

danbev · 2024-03-22T04:46:24Z

what wasi-nn backend would be used to load and run the ggml encoded models.

WasmEdge has support for a llama.cpp backend. I have got a very basic llama.cpp backend working for Wasmtime and would very much like to see support for such a backend in Wasmtime in the future. Having this encoding would hopefully help existing implementations and creating new ones easier.

Are you suggesting a new backend implementation?

Yes, I'd be interested in seeing support for llama.cpp backend in Wasmtime (and other wasm runtimes).

abrown

Ok, I think we've had this PR open for an adequate amount of time and discussed it in the ML meetings. Let's merge it!

danbev force-pushed the gguf-model-encoding branch from 3683147 to 86d5da4 Compare March 4, 2024 05:52

danbev changed the title ~~add: gguf graph encoding format~~ add: ggml graph encoding format Mar 4, 2024

abrown added a commit to abrown/bytecodealliance-meetings that referenced this pull request Mar 15, 2024

Add ML agenda items

2ae0373

- WebAssembly/wasi-nn#67 - WebAssembly/wasi-nn#66

abrown mentioned this pull request Mar 15, 2024

Add ML agenda items bytecodealliance/meetings#251

Merged

ricochet pushed a commit to bytecodealliance/meetings that referenced this pull request Mar 16, 2024

Add ML agenda items

e155393

- WebAssembly/wasi-nn#67 - WebAssembly/wasi-nn#66

abrown approved these changes Apr 10, 2024

View reviewed changes

abrown merged commit c0840c2 into WebAssembly:main Apr 10, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add: ggml graph encoding format #66

add: ggml graph encoding format #66

danbev commented Mar 4, 2024 •

edited

Loading

abrown commented Mar 14, 2024

danbev commented Mar 15, 2024

devigned commented Mar 21, 2024 •

edited

Loading

danbev commented Mar 22, 2024

abrown left a comment

add: ggml graph encoding format #66

add: ggml graph encoding format #66

Conversation

danbev commented Mar 4, 2024 • edited Loading

abrown commented Mar 14, 2024

danbev commented Mar 15, 2024

devigned commented Mar 21, 2024 • edited Loading

danbev commented Mar 22, 2024

abrown left a comment

Choose a reason for hiding this comment

danbev commented Mar 4, 2024 •

edited

Loading

devigned commented Mar 21, 2024 •

edited

Loading