[MIEB] Add image classification and zero shot classification tasks #1101

isaac-chung · 2024-07-19T13:48:19Z

First batch of image classification datasets, which covers

Caltech101
- Linear probe
- zero shot
CIFAR10
- Linear probe
- zero shot
CIFAR100
- Linear probe
- zero shot
FGVCAircraft
- Linear probe
- zero shot
Food101
- Linear probe
- zero shot
OxfordPets
- Linear probe
- zero shot

Checklist

Run tests locally to make sure nothing is broken using make test.
Run the formatter to format the code using make lint.

Adding datasets checklist

Reason for dataset addition: ...

I have run the following models on the task (adding the results to the pr). These can be run using the mteb -m {model_name} -t {task_name} command.
- mteb run -m openai/clip-vit-base-patch32 --model_revision 3d74acf9a28c67741b2f4f2ea7635f0aaf6f0268 --output_folder results-mieb -t Caltech101
I have checked that the performance is neither trivial (both models gain close to perfect scores) nor random (both models gain close to random scores).
If the dataset is too big (e.g. >2048 examples), considering using self.stratified_subsampling() under dataset_transform()
I have filled out the metadata object in the dataset file (find documentation on it here).
Run tests locally to make sure nothing is broken using make test.
Run the formatter to format the code using make lint.

gowitheflow-1998 · 2024-07-19T14:26:50Z

thanks! amazing speed. I think for these ones, we would like to implement evaluation with both AbsTaskImageClassification (linear probe) and also zero-shot ones with AbsTaskZeroshotClassification. Text candidate prompts for zeroshot can be found
p42 here in the CLIP paper, e.g., "a photo of kangaroo" and the rest of 101 for Caltech101. cc @imenelydiaker for information as well.

Also why is test failing for loading caltech101?

isaac-chung · 2024-07-19T15:22:53Z

I think for these ones, we would like to implement evaluation with both AbsTaskImageClassification (linear probe) and also zero-shot ones with AbsTaskZeroshotClassification.

Ooooh no problem. I was confused by the sheet where it said I; or I-T zero shot - thought it meant either-or, not both. I can add the zero-shot version as well though 👍

Also why is test failing for loading caltech101?

~~I am also perplexed by this. The link we use to test works.~~ [edit] think I found a fix. Pushed.

isaac-chung · 2024-07-19T20:22:34Z

~~For CIFAR, the implementation here is 9-20 points short compared to the CLIP paper for both linear probing and zero shot on the B/32 model 🤔 Same for Caltech101.~~ Resolved!

* mieb ZeroshotClassification * mieb docs * mieb implementation demo * model meta; abstask column names; linear probe clf * model meta; abstask column names; linear probe clf * fix: update naming as candidate_labels * Update README.md * Update README.md * i2tretrieval * test load data ignore i2tretrieval * [MIEB] Add image clustering (#1088) * make lint * wip * add TinyImageNet and run * type hints * add accuracy * lint * remove unused & fix typos * T2I Retrieval * Any2AnyRetrieval * fix tests from merge * [MIEB] Add image text pair classification and tests (#1099) * add ImageTextPairClassification abstask and evaluator * dataset transform into sequence of images for each sample * fix processing logic; list of list images compatability * lint and docstrings * make lint * fix failing tests in TaskMetadata * add tests for mieb * skip gated repo --------- Co-authored-by: gowitheflow-1998 <[email protected]> * [MIEB] Add image classification and zero shot classification tasks (#1101) * fix task metadata * use overrideable column names * add CIFAR datasets * add caltech101 dataset * add FGVC aircraft dataset * add food 101 dataset * add OxfordPets dataset * remove comments * correct cifar100 path * update cifar100 classification results * cifar zero shot results * add caltech101 zero shot * matching CLIP paper implementation * add aircraft and food zero shot * add oxford pets zero shot * [MIEB] Add CIFAR clustering (#1104) add CIFAR clustering * [MIEB] Add more image classification and zero shot classification datasets (#1103) * update category to i2t * add MNIST linear probe and zero shot * add FER2013 linear probe and zero shot * add stanford cars linear probe and zero shot * add birdsnap linear probe and zero shot * add eurosat linear probe and zero shot * lint * correct eurosat zero shot labels * add abstask for image multilable and voc2007 * make lint * [MIEB] Add more image classification and zero shot datasets (#1105) * add STL10 linear probe and zero shot * add RESISC45 linear probe and zeor shot * add Describable textures linear probe and zero shot * fix spacing lint * add SUN397 linear probe and zero shot * correct SUN397 zero shot captions * add baai bge vista * add e5-v * linting * memory issues for image linear probe & zeroshot * kknn linear probe arguments * del comments * Add some classification and ZeroShot classification tasks (#1107) * Add Country211 classification task * Add imagenet1k classification task * Add UCF101 classification task * Add PatchCamelyon Classification task * Add GTSRB classification task * Add GSTRB Zero Shot Classification * Add country211 zero shot classification * Add results for classification tasks * Add zero shot classification tasks * Add PatchCamelyon tasks and results * Add linting * Add results and fix prompts for zero shot * Add results * Add results and linting * fix dependency & clip mock test * [MIEB] Add jina clip (#1120) * add jina clip and mscoco i2t and t2i results * make lint * [MIEB] Update `mieb` with the `main` branch and some fixes (#1126) * fix instruction retrival (#1072) * fix instruction retrival * fix test * add points * make nested results * add test * skip instruction test * fix instruction passes * fix unions * move do_length_ablation Co-authored-by: Kenneth Enevoldsen <[email protected]> --------- Co-authored-by: Kenneth Enevoldsen <[email protected]> * Update points table * fix: fix bug-causing spelling error in function name of e5-mistral-instruct (#1106) found bug * 1.12.85 Automatically generated by python-semantic-release * fix: MultilingualSentimentClassification (#1109) * Update points table * fix: Avoid spaces in dataset name for CQADupstack and ignore speed tasks * 1.12.86 Automatically generated by python-semantic-release * fix: Ensure that MLSUMClusteringP2P.v2 use the fast implementation as was intended (#1112) * fix: Ensure that MLSUMClusteringP2P.v2 use the fast implementation as was intended * fix: fixed formatting for cli * docs: improve searchability in the advanced usage documentation * 1.12.87 Automatically generated by python-semantic-release * docs: improve searchability in the advanced usage documentation (#1113) * docs: improve searchability in the advanced usage documentation * docs: update based on corrections * fix: export type for `mteb create_meta` (#1114) * fix export type * fix dataset version too * 1.12.88 Automatically generated by python-semantic-release * fix: Simplify models implementations (#1085) * Merge * Adapt * Simplify * Check for rev again * Rmv cmmnt * Simplify * simplify * Rmv comment Co-authored-by: Kenneth Enevoldsen <[email protected]> * Use logging; change try except; add info * Lint * Rmv results * Update rev * format * Simplify models; Allow instructions * Jobs * Fix merge * Format * Adapt models * fix: ensure that e5 ignores the NQ * format --------- Co-authored-by: Kenneth Enevoldsen <[email protected]> * 1.12.89 Automatically generated by python-semantic-release * fix: nomic models using prefix correctly (#1125) * fix: nomic models using prefix correctly * chore: remove comment * fix: handling in case not torch tensor * Fix typo --------- Co-authored-by: Niklas Muennighoff <[email protected]> * 1.12.90 Automatically generated by python-semantic-release * refactor vista model wrapper to contain lib import * python 38 type hints --------- Co-authored-by: Roman Solomatin <[email protected]> Co-authored-by: Kenneth Enevoldsen <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: anpalmak2003 <[email protected]> Co-authored-by: github-actions <[email protected]> Co-authored-by: Niklas Muennighoff <[email protected]> Co-authored-by: Zach Nussbaum <[email protected]> Co-authored-by: chenghao xiao <[email protected]> * image memoery issues for all retrieval Abstasks * Add CLEVR and SciMMIR Image-Text Understanding tasks (#1127) * Add CLEVER and SciMMIR * Update metadata * remove useless comment * Add linting * fix typo and tests * Add CLEVR count task * add linting * add fashion200k & fashionIQ test passed * clip text max seq truncation * add WebQA, NIGHTS, OVEN * any2any retrieval chunk encoding * add nomic vision model; any2any topk bug * add cv recall * add InfoSeek; VisualNews * [MIEB] Add Stanford Cars i2i Retrieval (#1147) * wip * add results * make lint * change back the order * [MIEB] Add CUB200 i2i retrieval (#1154) * add cub200 and results * add skip_first_result * skipped self and rerun results * consolidate i2t and t2i to any2any * remove abstask and evaluators * remove references from test --------- Co-authored-by: gowitheflow-1998 <[email protected]> Co-authored-by: chenghao xiao <[email protected]> Co-authored-by: Imene Kerboua <[email protected]> Co-authored-by: Roman Solomatin <[email protected]> Co-authored-by: Kenneth Enevoldsen <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: anpalmak2003 <[email protected]> Co-authored-by: github-actions <[email protected]> Co-authored-by: Niklas Muennighoff <[email protected]> Co-authored-by: Zach Nussbaum <[email protected]>

isaac-chung added 9 commits July 19, 2024 13:14

fix task metadata

246a6ab

use overrideable column names

d3f88d9

add CIFAR datasets

9dd0bfb

add caltech101 dataset

e19cbf4

add FGVC aircraft dataset

b828754

add food 101 dataset

2320a2d

add OxfordPets dataset

577c127

Merge branch 'mieb' into add-img-class-tasks

66ac2b6

remove comments

1daa91f

isaac-chung marked this pull request as ready for review July 19, 2024 13:58

isaac-chung requested a review from gowitheflow-1998 July 19, 2024 13:58

isaac-chung added 2 commits July 19, 2024 19:48

correct cifar100 path

01049ca

update cifar100 classification results

1138c19

isaac-chung added 5 commits July 19, 2024 20:23

cifar zero shot results

b12db2c

add caltech101 zero shot

e83c005

matching CLIP paper implementation

1fa9a57

add aircraft and food zero shot

51a66ba

add oxford pets zero shot

f0ae658

isaac-chung changed the title ~~[MIEB] Add image classification tasks~~ [MIEB] Add image classification and zero shot classification tasks Jul 20, 2024

gowitheflow-1998 approved these changes Jul 20, 2024

View reviewed changes

gowitheflow-1998 merged commit 3f888fa into mieb Jul 20, 2024
9 checks passed

gowitheflow-1998 deleted the add-img-class-tasks branch July 20, 2024 14:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MIEB] Add image classification and zero shot classification tasks #1101

[MIEB] Add image classification and zero shot classification tasks #1101

isaac-chung commented Jul 19, 2024 •

edited

Loading

gowitheflow-1998 commented Jul 19, 2024

isaac-chung commented Jul 19, 2024 •

edited

Loading

isaac-chung commented Jul 19, 2024 •

edited

Loading

[MIEB] Add image classification and zero shot classification tasks #1101

[MIEB] Add image classification and zero shot classification tasks #1101

Conversation

isaac-chung commented Jul 19, 2024 • edited Loading

Checklist

Adding datasets checklist

gowitheflow-1998 commented Jul 19, 2024

isaac-chung commented Jul 19, 2024 • edited Loading

isaac-chung commented Jul 19, 2024 • edited Loading

isaac-chung commented Jul 19, 2024 •

edited

Loading

isaac-chung commented Jul 19, 2024 •

edited

Loading

isaac-chung commented Jul 19, 2024 •

edited

Loading