LM Contamination Task

This repo works for llm-jp eval-tuning-wg task9: データリークの評価

Introduction

Oscar Sainz, et al. firstly proposed the idea that the model is contaminated if it is able to generate examples of the dataset. However, recent works show that this method can be unreliable and subject to failure. S. Golchin & M. Surdeanu(https://arxiv.org/pdf/2311.06233.pdf) argue that such failures can result either from the sparsity introduced by the request to reproduce the first instances of a dataset split or from the inability to bypass the safety filters set by the model provider when the model is asked to generate copyrighted content like dataset instances.

Osainz has posted the related works on huggingface community

[Time Travel in LLMs: Tracing Data Contamination in Large Language Models (Golchin and Surdeanu, 2023)][reference]
[Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation (Li 2023)][reference] reference
[Detecting Pretraining Data from Large Language Models (Shi et al., 2023)][reference] reference
[Proving Test Set Contamination in Black Box Language Models (Oren et al., 2023)][reference] reference
[Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models (Golchin and Surdeanu, 2023)][reference] reference
[Investigating Data Contamination in Modern Benchmarks for Large Language Models (Deng et al., 2023)][reference] reference
[Rethinking Benchmark and Contamination for Language Models with Rephrased Samples (Yang et al., 2023)][reference] reference

Progress

So far, this repo implementated part of S. Golchin & M. Surdeanu(https://arxiv.org/pdf/2311.06233.pdf)'s work.

Experiment Results

WNLI

GPT3.5

BLUERT:

with guide 0.5124241530895233
without guide 0.22064677874247232 RouGEL:
with guide 0.34238831625188737
without guide 0.09239756877931599

GPT4

BLUERT:

with guide 0.49290904998779295
without guide 0.46190741956233977
with guide 0.32426375556561493
without guide 0.2879418270645807

Name		Name	Last commit message	Last commit date
Latest commit History 1,274 Commits
.idea		.idea
__pycache__		__pycache__
data/nli-task/wnli		data/nli-task/wnli
figures		figures
initial_time_travel		initial_time_travel
legacy_code		legacy_code
llm-jp-test		llm-jp-test
out/nli-task/wnli		out/nli-task/wnli
.DS_Store		.DS_Store
.gitignore		.gitignore
analyse_auc_results.py		analyse_auc_results.py
analyse_by_hypothesis_test.py		analyse_by_hypothesis_test.py
analyse_by_lime.py		analyse_by_lime.py
analyse_by_mem_score.py		analyse_by_mem_score.py
analyse_corpus_similarity.py		analyse_corpus_similarity.py
analyse_entropy.py		analyse_entropy.py
analyse_mem_score.py		analyse_mem_score.py
auc_caculation.py		auc_caculation.py
auc_results_absolute_truncated_all_length_all_model_size.csv		auc_results_absolute_truncated_all_length_all_model_size.csv
auc_results_absolute_truncated_all_length_all_model_size_0.csv		auc_results_absolute_truncated_all_length_all_model_size_0.csv
auc_results_absolute_truncated_all_length_all_model_size_1.csv		auc_results_absolute_truncated_all_length_all_model_size_1.csv
auc_results_absolute_truncated_all_length_all_model_size_2.csv		auc_results_absolute_truncated_all_length_all_model_size_2.csv
auc_results_absolute_untruncated_all_length_all_model_size.csv		auc_results_absolute_untruncated_all_length_all_model_size.csv
auc_results_absolute_untruncated_all_length_all_model_size_0.csv		auc_results_absolute_untruncated_all_length_all_model_size_0.csv
auc_results_absolute_untruncated_all_length_all_model_size_1.csv		auc_results_absolute_untruncated_all_length_all_model_size_1.csv
auc_results_absolute_untruncated_all_length_all_model_size_2.csv		auc_results_absolute_untruncated_all_length_all_model_size_2.csv
auc_results_relative_truncated_all_length_all_model_size.csv		auc_results_relative_truncated_all_length_all_model_size.csv
auc_results_relative_truncated_all_length_all_model_size_0.csv		auc_results_relative_truncated_all_length_all_model_size_0.csv
auc_results_relative_truncated_all_length_all_model_size_1.csv		auc_results_relative_truncated_all_length_all_model_size_1.csv
auc_results_relative_truncated_all_length_all_model_size_2.csv		auc_results_relative_truncated_all_length_all_model_size_2.csv
black_box_mia.py		black_box_mia.py
build_dataset.py		build_dataset.py
corpus_sim.xlsx		corpus_sim.xlsx
data_creation_for_memorization.py		data_creation_for_memorization.py
data_explore.py		data_explore.py
data_random_sample.py		data_random_sample.py
dataset_reform.py		dataset_reform.py
eda_pac_mia.py		eda_pac_mia.py
embedding_features.py		embedding_features.py
embedding_figure_draw.py		embedding_figure_draw.py
embedding_learning.py		embedding_learning.py
eval.py		eval.py
evaluation_results.tex		evaluation_results.tex
generation_features.py		generation_features.py
gray_box_mia.py		gray_box_mia.py
memorization_score_filtering.py		memorization_score_filtering.py
memorization_statistics.py		memorization_statistics.py
obtain_sec_mem_score.py		obtain_sec_mem_score.py
readme.md		readme.md
requirement.txt		requirement.txt
results_check.py		results_check.py
results_plot.png		results_plot.png
results_plot_absolute_truncated.png		results_plot_absolute_truncated.png
results_plot_absolute_untruncated.png		results_plot_absolute_untruncated.png
results_plot_relative_truncated.png		results_plot_relative_truncated.png
results_show.py		results_show.py
runing_both.py		runing_both.py
sbatch_balck_box.sh		sbatch_balck_box.sh
sbatch_pac.sh		sbatch_pac.sh
sbatch_relative_run.sh		sbatch_relative_run.sh
sbatch_truncate_run.sh		sbatch_truncate_run.sh
sbatch_untruncate_run.sh		sbatch_untruncate_run.sh
sem_score_comparision.py		sem_score_comparision.py
utils.py		utils.py
youden_index_validating.py		youden_index_validating.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LM Contamination Task

Introduction

Progress

Experiment Results

WNLI

GPT3.5

GPT4

About

Releases

Packages

Contributors 2

Languages

llm-jp/llm-jp-data-contamination

Folders and files

Latest commit

History

Repository files navigation

LM Contamination Task

Introduction

Progress

Experiment Results

WNLI

GPT3.5

GPT4

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages