Skip to content

Commit

Permalink
Merge branch 'main' into simplify-models
Browse files Browse the repository at this point in the history
  • Loading branch information
Muennighoff authored Jul 24, 2024
2 parents 027110a + 2e6dd12 commit 886c796
Show file tree
Hide file tree
Showing 28 changed files with 787 additions and 403 deletions.
38 changes: 28 additions & 10 deletions .github/workflows/docs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,8 @@ on:
pull_request:

jobs:
create-table:
create-table-on-pr:
if: github.event_name == 'pull_request'
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
Expand All @@ -23,23 +24,40 @@ jobs:
make install
- name: Create table
run: python docs/create_tasks_table.py
run: |
make build-docs
create-table-and-push:
if: github.ref == 'refs/heads/main'
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
with:
token: ${{ secrets.RELEASE }}

- name: Configure Git (for push only)
if: github.event_name == 'push'
- uses: actions/setup-python@v4
with:
python-version: "3.10"
cache: "pip"

- name: Install dependencies
run: |
git config --global user.email "github-actions[bot]@users.noreply.github.com"
git config --global user.name "github-actions[bot]"
make install
- name: Create table
run: |
make build-docs
- name: Push table
if: github.event_name == 'push'
env:
GITHUB_TOKEN: ${{ secrets.RELEASE }}
run: |
git config --global user.email "github-actions[bot]@users.noreply.github.com"
git config --global user.name "github-actions[bot]"
# Check if changes exist
if git diff --quiet; then
echo "No changes detected"
else
git add docs/tasks.md
git commit -m "Update tasks table"
git push
git push
fi
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -141,3 +141,6 @@ tmp.py
# sandbox
sb.ipynb
tests/create_meta/model_card.md

# removed results from mteb repo they are now available at: https://github.com/embeddings-benchmark/results
results/
6 changes: 6 additions & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -30,3 +30,9 @@ pr:
@echo "--- 🚀 Running requirements for a PR ---"
make lint
make test


build-docs:
@echo "--- 📚 Building documentation ---"
# since we do not have a documentation site, this just build tables for the .md files
python docs/create_tasks_table.py
1 change: 1 addition & 0 deletions docs/mmteb/points/1072.jsonl
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
{"GitHub": "Samoed", "Bug fixes": 2}
2 changes: 1 addition & 1 deletion docs/mmteb/points_table.md
Original file line number Diff line number Diff line change
Expand Up @@ -57,9 +57,9 @@ _Note_: this table is **autogenerated** and should not be edited. It is intended
| sted97 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| PranjalChitale | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| Sakshamrzt | 12 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| Samoed | 0 | 0 | 6 | 0 | 0 | 0 | 9 | 0 | 15 |
| taidnguyen | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| artemsnegirev | 12 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 14 |
| Samoed | 0 | 0 | 4 | 0 | 0 | 0 | 9 | 0 | 13 |
| mariyahendriksen | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 | 12 |
| xhluca | 6 | 2 | 4 | 0 | 0 | 0 | 0 | 0 | 12 |
| anpalmak2003 | 9 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 12 |
Expand Down
5 changes: 3 additions & 2 deletions docs/tasks.md
Original file line number Diff line number Diff line change
Expand Up @@ -52,6 +52,7 @@ The following tables give you an overview of the tasks in MTEB.
| [BlurbsClusteringS2S.v2](https://www.inf.uni-hamburg.de/en/inst/ab/lt/resources/data/germeval-2019-hmc.html) (Steffen Remus, 2019) | ['deu'] | Clustering | s2s | [Fiction, Written] | {'test': 2048} | {'test': 23.02} |
| [BornholmBitextMining](https://aclanthology.org/W19-6138/) | ['dan'] | BitextMining | s2s | [Web, Social, Fiction, Written] | {'test': 500} | {'test': 89.7} |
| [BrazilianToxicTweetsClassification](https://paperswithcode.com/dataset/told-br) (Joao Augusto Leite and Diego F. Silva and Kalina Bontcheva and Carolina Scarton, 2020) | ['por'] | MultilabelClassification | s2s | [Constructed, Written] | {'test': 2048} | {'test': 85.05} |
| [BrightRetrieval](https://huggingface.co/datasets/xlangai/BRIGHT) (Su et al., 2024) | ['eng'] | Retrieval | s2p | [Non-fiction] | {'standard': 1334914, 'long': 7048} | {'standard': 800.3994729248476, 'long': 46527.35839954597} |
| [BulgarianStoreReviewSentimentClassfication](https://doi.org/10.7910/DVN/TXIK9P) (Georgieva-Trifonova et al., 2018) | ['bul'] | Classification | s2s | [Reviews, Written] | {'test': 182} | {'test': 316.7} |
| [CBD](http://2019.poleval.pl/files/poleval2019.pdf) | ['pol'] | Classification | s2s | | {'test': 1000} | {'test': 93.2} |
| [CDSC-E](https://aclanthology.org/P17-1073.pdf) | ['pol'] | PairClassification | s2s | | None | None |
Expand Down Expand Up @@ -850,7 +851,7 @@ The following tables give you an overview of the tasks in MTEB.
| ell | 3 | 6 | 1 | 0 | 1 | 2 | 0 | 3 | 0 | 0 | 0 |
| emi | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| emp | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| eng | 16 | 143 | 16 | 3 | 1 | 8 | 7 | 72 | 13 | 2 | 1 |
| eng | 16 | 143 | 16 | 3 | 1 | 8 | 7 | 73 | 13 | 2 | 1 |
| enq | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| epo | 3 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| eri | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Expand Down Expand Up @@ -1622,7 +1623,7 @@ The following tables give you an overview of the tasks in MTEB.
| zty | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| zul | 2 | 3 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| zyp | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Total | 1394 | 794 | 304 | 3 | 28 | 67 | 47 | 371 | 85 | 2 | 2 |
| Total | 1394 | 794 | 304 | 3 | 28 | 67 | 47 | 372 | 85 | 2 | 2 |
<!-- TASK LANG TABLE END -->

</details>
15 changes: 0 additions & 15 deletions missing_tasks.txt

This file was deleted.

Loading

0 comments on commit 886c796

Please sign in to comment.