Skip loading of the model during conversion to save on RAM usage #83

Proryanator · 2024-07-23T15:48:38Z

Helps address #69 somewhat, by preventing coremltools loading the converted model/compiling it post-conversion.

I don't believe exporters is doing anything with either the compiled nor the loaded model (other than just using the spec), so it is probably fine to default to skipping loading the model. I've also seen a recent example this month publishes in swift-transformers that also skipped this step: sample code here.

However if desired, I also considered adding in a new argument that would allow users to set this themselves if we don't want to force this default behavior onto people.

I can also add a new section to the troubleshooting section specific to this error if that would be helpful.

Test setup:

M3 Max Macbook Pro with 36GB of RAM
Converting model: Undi95/MythoMax-L2-Kimiko-v2-13b (~26GB model)

Current behavior w/o this fix:
Getting a zsh: killed python3 -m exporters.coreml ... error due to running out of memory (see issue for more details).

Behavior with this fix:
Able to successfully convert the provided model and similar ones of its size (although much larger than my system's RAM will still not work). This will at least make this tool more accessible for models that should fix inside your RAM size.

Skipping loading of the model

07f2139

Proryanator mentioned this pull request Jul 31, 2024

Export of Llama2 fails #76

Open

ZachNagengast mentioned this pull request Aug 29, 2024

There appear to be 1 leaked semaphore objects to clean up at shutdown apple/ml-stable-diffusion#8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip loading of the model during conversion to save on RAM usage #83

Skip loading of the model during conversion to save on RAM usage #83

Proryanator commented Jul 23, 2024 •

edited

Loading

Skip loading of the model during conversion to save on RAM usage #83

Are you sure you want to change the base?

Skip loading of the model during conversion to save on RAM usage #83

Conversation

Proryanator commented Jul 23, 2024 • edited Loading

Proryanator commented Jul 23, 2024 •

edited

Loading