Fair and square model selection #10

forrestbao · 2022-12-04T08:39:58Z

In the code below, we used two models of quite different capacities: For bert-score and bertscore-sentence-MNLI, we used Roberta-Large, which is about 1.6GB (default for bert-score implemented in HF's evaluate library). But for bertscore-sentence, which is built on top of sentence-bert, we used all-MiniLM-L6-v2, which has only 80MB. So this gives our bertscore-sentence approach a huge disadvantage. Of course, we pick that one to be fast in pilot studies.

https://github.com/SigmaWe/DocAsRef_0/blob/de4de4b4275e661621bebf3b2f92d8676e2f81c2/dar_env.py#L8-L11

I think if we use a large-capacity model for bertscore-sentence, we can further boost our sentence-based pair-wise approach.

There are two directions we can try:

A quick one is we just use a larger model trained by sentence-bert project. Let's try two all-mpnet-base-v2 and all-roberta-large-v1. The former one is still much smaller than Roberta-large but has higher scores according to sentence-bert leader board while the latter one is just RoBERTa-large but trained using sentence-bert's dot-product loss. Thus let's test both of these two versions below:
```
   sent_embedder = sentence_transformers.SentenceTransformer("all-mpnet-base-v2") 
   sent_embedder = sentence_transformers.SentenceTransformer("all-roberta-large-v1") 
```
BTW, we can use HF's transformers library for Sentence-Bert as well. In this way, we don't have importing bothtransformers and sentence_transformers. We can consolidate all code under one framework.
A slower but completely fair approach: we also use RoBERTa-large (generally trained, not on MNLI) to embed the sentence and extract the embedding corresponding to the [CLS] token. For how to do it, see here.

The text was updated successfully, but these errors were encountered:

bs_sent: mnli allow specify classifier, cos_sim allow specify embedder fix: #5, #10 direction 1

forrestbao assigned TURX Dec 4, 2022

forrestbao changed the title ~~Fair and square model selecition~~ Fair and square model selection Dec 4, 2022

This was referenced Dec 4, 2022

Vectorizing MNLI inference #5

Closed

Adding MoverScore #11

Open

forrestbao added experiment P1 labels Dec 4, 2022

TURX added a commit that referenced this issue Dec 9, 2022

bs_sent np optimization and more confs

1a5d751

bs_sent: mnli allow specify classifier, cos_sim allow specify embedder fix: #5, #10 direction 1

TURX added a commit that referenced this issue Dec 9, 2022

bs_sent np optimization and more confs

3a3b8de

bs_sent: mnli allow specify classifier, cos_sim allow specify embedder fix: #5, #10 direction 1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fair and square model selection #10

Fair and square model selection #10

forrestbao commented Dec 4, 2022 •

edited

Loading

Fair and square model selection #10

Fair and square model selection #10

Comments

forrestbao commented Dec 4, 2022 • edited Loading

forrestbao commented Dec 4, 2022 •

edited

Loading