Pruned model is same size as original #29

virentakia · 2023-12-05T02:02:38Z

Great work on the project, really excited to see the outcomes.

However, After running the script below, the pruned model (output) seems to be of the same size as the original one (which is 6.38G)

!python /content/wanda/main.py
--model openlm-research/open_llama_3b_v2
--prune_method wanda
--sparsity_ratio 0.5
--sparsity_type unstructured
--save_model out/pruned
--save out/open_llama_3b_v2/unstructured/wanda/

Is this correct, or am I missing something?!

Eric-mingjie · 2023-12-05T14:13:32Z

Yes, this is correct and it has been true for unstructured pruning. To my understanding, unstructured sparsity won't save the memory footprint on modern GPU devices.

virentakia · 2023-12-16T12:06:23Z

Thanks, any pruning options for reducing memory footprint?!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pruned model is same size as original #29

Pruned model is same size as original #29

virentakia commented Dec 5, 2023 •

edited

Loading

Eric-mingjie commented Dec 5, 2023

virentakia commented Dec 16, 2023

Pruned model is same size as original #29

Pruned model is same size as original #29

Comments

virentakia commented Dec 5, 2023 • edited Loading

Eric-mingjie commented Dec 5, 2023

virentakia commented Dec 16, 2023

virentakia commented Dec 5, 2023 •

edited

Loading