Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

llama_7b wikitext perplexity 7.0915350914 #45

Open
xiaopengaia opened this issue Apr 30, 2024 · 3 comments
Open

llama_7b wikitext perplexity 7.0915350914 #45

xiaopengaia opened this issue Apr 30, 2024 · 3 comments

Comments

@xiaopengaia
Copy link

bash scripts/llama_7b.sh

the source model: wikitext perplexity is 5.67702

prune this model, sparsity 50%, get wikitext perplexity is 7.09153509

but the paper is : 50% 7.26

why?

@Eric-mingjie
Copy link
Collaborator

Are you using the c4 calibration dataset or the redpajama? In my experience, using the calibration dataset from redpajama could get numbers as high as around 7.10.

@taratt
Copy link

taratt commented May 8, 2024

Hi. Were you able to resolve the issue?

@xiaopengaia
Copy link
Author

Thanks, I didn't use c4 calibration dataset, I just wanted to reproduce the data in the paper.

def get_loaders(name, nsamples=128, seed=0, seqlen=2048, tokenizer=None):
    if 'wikitext2' in name:
        return get_wikitext2(nsamples, seed, seqlen, tokenizer)
    # if "c4" in name:
        # return get_c4(nsamples, seed, seqlen, tokenizer)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants