How to use this together with wandb? Plus a question about class imbalance #489

jasperhyp · 2022-06-16T00:55:09Z

jasperhyp
Jun 16, 2022

Hi Kevin, Thanks for creating this wonderful repo! I was just wondering what the best practice is to use this along with wandb, which I typically use for hyperparameter tune. Basically, I'll need to log each epoch's loss & metrics, etc., to wandb. I guess probably either not use the trainer but have to write the training code as in classical pytorch by hand, or use custom hook perhaps.

Also, as someone new to metric learning, I'm also wondering how well the standard metric learning losses & miners etc. handle imbalanced training data. For example, I have a few classes with tens of samples, while most classes only have one or two samples. This is like some contrastive learning scheme where I'll have only one or no positive sample and a lot of negatives. Could you recommend some papers that compare the losses or samplers or miners, etc., on those settings? Thanks a lot!

Answered by KevinMusgrave

Jun 16, 2022

Hi Kevin, Thanks for creating this wonderful repo! I was just wondering what the best practice is to use this along with wandb, which I typically use for hyperparameter tune. Basically, I'll need to log each epoch's loss & metrics, etc., to wandb. I guess probably either not use the trainer but have to write the training code as in classical pytorch by hand, or use custom hook perhaps.

Yes you could pass in end_of_iteration_hook like this:

def hook(trainer):
    for k,v in trainer.losses.items():
        log(k, v)

trainer = MetricLossOnly(..., end_of_iteration_hook = hook)

But instead of this, I would write my own training code, or use a framework like PyTorch Lightning, PyTorch Ignite…

View full answer

KevinMusgrave · 2022-06-16T12:10:10Z

KevinMusgrave
Jun 16, 2022
Maintainer

Hi Kevin, Thanks for creating this wonderful repo! I was just wondering what the best practice is to use this along with wandb, which I typically use for hyperparameter tune. Basically, I'll need to log each epoch's loss & metrics, etc., to wandb. I guess probably either not use the trainer but have to write the training code as in classical pytorch by hand, or use custom hook perhaps.

Yes you could pass in end_of_iteration_hook like this:

def hook(trainer):
    for k,v in trainer.losses.items():
        log(k, v)

trainer = MetricLossOnly(..., end_of_iteration_hook = hook)

But instead of this, I would write my own training code, or use a framework like PyTorch Lightning, PyTorch Ignite, Catalyst etc. Their trainer classes offer more features and are maintained better, because the entire focus of those libraries is to make training easier.

Also, as someone new to metric learning, I'm also wondering how well the standard metric learning losses & miners etc. handle imbalanced training data. For example, I have a few classes with tens of samples, while most classes only have one or two samples. This is like some contrastive learning scheme where I'll have only one or no positive sample and a lot of negatives. Could you recommend some papers that compare the losses or samplers or miners, etc., on those settings? Thanks a lot!

A common technique in metric learning papers is to make each batch balanced. For example, if the batch size is 64, you could include 16 classes with 4 samples each. In this library its called MPerClassSampler:

from pytorch_metric_learning.samplers import MPerClassSampler

# pass this into your dataloader
sampler = MPerClassSampler(labels, m=4, batch_size=64)

At the moment I can't think of any papers on this specific subject.

3 replies

jasperhyp Jun 16, 2022
Author

Thanks so much Kevin for your detailed and quick response! Yes, I noticed MPerClassSampler and was referring to it when I tried to describe the scenario. I noticed that for classes with extremely scarce samples (say, 1), the sample itself will be resampled m times, which might create a bias when anchored on that sample class perhaps. I am guessing it might be better in that case to add some jitter into the embedding, but I am just not sure how metric learning people usually handle this case.

KevinMusgrave Jun 16, 2022
Maintainer

I don't have experience with few shot learning. I think your idea of adding jitter is good. Or you could apply a random augmentation to the input data.

jasperhyp Jun 16, 2022
Author

Thanks a lot! Will experiment with potential augmentations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use this together with wandb? Plus a question about class imbalance #489

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 3 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

How to use this together with wandb? Plus a question about class imbalance #489

jasperhyp Jun 16, 2022

Replies: 1 comment · 3 replies

KevinMusgrave Jun 16, 2022 Maintainer

jasperhyp Jun 16, 2022 Author

KevinMusgrave Jun 16, 2022 Maintainer

jasperhyp Jun 16, 2022 Author

jasperhyp
Jun 16, 2022

Replies: 1 comment 3 replies

KevinMusgrave
Jun 16, 2022
Maintainer

jasperhyp Jun 16, 2022
Author

KevinMusgrave Jun 16, 2022
Maintainer

jasperhyp Jun 16, 2022
Author