[Help][BUG] `KeyError: 'lm_head.weight'` on loading llama 3.2 #1920

steveepreston · 2024-10-13T13:24:49Z

Trying to load llama-3.2 on TPU VM v3-8 via this:

device_mesh = keras.distribution.DeviceMesh((1, 8), ["batch", "model"], devices=keras.distribution.list_devices())
layout_map = keras.distribution.LayoutMap(device_mesh)
layout_map["token_embedding/embeddings"] = ("model", None)
layout_map["decoder_block.*attention.*(query|key|value)/kernel"] = ("model", None, None)
layout_map["decoder_block.*attention_output/kernel"] = ("model", None, None)
layout_map["decoder_block.*ffw_gating.*/kernel"] = (None, "model")
layout_map["decoder_block.*ffw_linear/kernel"] = ("model", None)
model_parallel = keras.distribution.ModelParallel(layout_map=layout_map, batch_dim_name="batch")
keras.distribution.set_distribution(model_parallel)


model = keras_nlp.models.Llama3CausalLM.from_preset("meta-llama/Llama-3.2-3B-Instruct")

but it throws this Error:

KeyError: 'lm_head.weight'

note: i get layout_map code from This Example. i don't know if problem is from layout_map or Llama3CausalLM

The text was updated successfully, but these errors were encountered:

Gopi-Uppari · 2024-10-16T07:50:32Z

Hi @steveepreston,

I able to execute the code using the Gemma model, and it worked without any issues. For the Llama model, however, could you please reach out to the Llama team for further assistance? Please refer to the Gist file for more details.

Thank you.

steveepreston · 2024-10-16T08:10:02Z

Thank you for attention @Gopi-Uppari

Yes, gemma successfully executed in my test too. (although gemma-2-9b-it thrown OOM on TPU).
Problem is about llama model.

ok, i will try to create another issue there also.

github-actions bot added the Gemma Gemma model specific issues label Oct 13, 2024

steveepreston changed the title ~~[BUG] KeyError: 'lm_head.weight' on loading llama 3.2~~ [Help][BUG] KeyError: 'lm_head.weight' on loading llama 3.2 Oct 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Help][BUG] `KeyError: 'lm_head.weight'` on loading llama 3.2 #1920

[Help][BUG] `KeyError: 'lm_head.weight'` on loading llama 3.2 #1920

steveepreston commented Oct 13, 2024 •

edited

Loading

Gopi-Uppari commented Oct 16, 2024

steveepreston commented Oct 16, 2024

[Help][BUG] KeyError: 'lm_head.weight' on loading llama 3.2 #1920

[Help][BUG] KeyError: 'lm_head.weight' on loading llama 3.2 #1920

Comments

steveepreston commented Oct 13, 2024 • edited Loading

Gopi-Uppari commented Oct 16, 2024

steveepreston commented Oct 16, 2024

[Help][BUG] `KeyError: 'lm_head.weight'` on loading llama 3.2 #1920

[Help][BUG] `KeyError: 'lm_head.weight'` on loading llama 3.2 #1920

steveepreston commented Oct 13, 2024 •

edited

Loading