Add support for quantized qwen2-0.5b #44

bil-ash · 2024-06-26T01:31:29Z

This PR adds support for quantized(q4f16_1) qwen2-0.5b. Solves issue . PR must be merged before merging this.
@Neet-Nestor

to add support for quantized(q4f16) qwen2-0.5b

Neet-Nestor · 2024-06-26T02:33:00Z

Unfortunately, this won't fix the issue since the model Qwen2-0.5B-Instruct-q4f16-MLC you added is not available in WebLLM yet. Instead we need to compile the model, upload to huggingface, update WebLLM, and finally update our app.

bil-ash · 2024-06-26T02:36:37Z

Unfortunately, this won't fix the issue since the model Qwen2-0.5B-Instruct-q4f16-MLC you added is not available in WebLLM yet. Instead we need to compile the model, upload to huggingface, update WebLLM, and finally update our app.

I saw that and so I have also created the required PR in the web-llm repo.

Neet-Nestor · 2024-06-26T02:37:41Z

Thanks! Let me try.

Neet-Nestor · 2024-06-26T02:51:37Z

Thanks! It's working perfectly on my end. I will leave the other 2 PRs for my team to review, but I can merge this one and publish a new version of the webapp so that you can use it immediately.

Update constant.ts

f00cf05

to add support for quantized(q4f16) qwen2-0.5b

bil-ash mentioned this pull request Jun 26, 2024

[Feature Request]: Add quantized qwen2-0.5b #41

Closed

Neet-Nestor closed this Jun 26, 2024

Neet-Nestor reopened this Jun 26, 2024

This was referenced Jun 26, 2024

Add quantized qwen2-0.5b mlc-ai/web-llm#490

Merged

Add support for quantized qwen2-0.5b mlc-ai/binary-mlc-llm-libs#128

Merged

Neet-Nestor approved these changes Jun 26, 2024

View reviewed changes

Neet-Nestor merged commit 0617dbb into mlc-ai:main Jun 26, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for quantized qwen2-0.5b #44

Add support for quantized qwen2-0.5b #44

bil-ash commented Jun 26, 2024

Neet-Nestor commented Jun 26, 2024

bil-ash commented Jun 26, 2024

Neet-Nestor commented Jun 26, 2024 •

edited

Loading

Neet-Nestor commented Jun 26, 2024

Add support for quantized qwen2-0.5b #44

Add support for quantized qwen2-0.5b #44

Conversation

bil-ash commented Jun 26, 2024

Neet-Nestor commented Jun 26, 2024

bil-ash commented Jun 26, 2024

Neet-Nestor commented Jun 26, 2024 • edited Loading

Neet-Nestor commented Jun 26, 2024

Neet-Nestor commented Jun 26, 2024 •

edited

Loading